Patent application number | Description | Published |
20100269071 | INTEGRATING INFORMATION SOURCES - Disclosed is a method and system for integrating heterogeneous fragments of information from at least two collaboration channels, using one or more servers, each adapted to provide fragments from one collaboration channel; one or more adapters, each adapted to adapt a fragment provided by a corresponding server to an interface common to the collaboration channels; and a module adapted to: find a previously stored fragment related to a new adapted fragment, and create a link between the related fragment and the new fragment; and one or more repositories, each adapted to store an adapted fragment from a corresponding server. Other embodiments are also disclosed. | 10-21-2010 |
20100332424 | DETECTING FACTUAL INCONSISTENCIES BETWEEN A DOCUMENT AND A FACT-BASE - Techniques for identifying one or more inconsistencies between an unstructured document and a back-end fact-base are provided. The techniques include automatically parsing a query document and comparing the document with a back-end fact-base comprising facts relevant to the document, identifying one or more inconsistencies between information mentioned in the document and the facts stored in the back-end fact-base, and providing a response to the query document, wherein the response additionally includes the one or more identified inconsistencies. | 12-30-2010 |
20110125729 | ANALYZING XML DATA - Embodiments of the invention disclose a method, a system and a computer program product of discovering automated insights in XML data by generating a query result in response to querying data using a query, wherein the data is in a markup language format, and identifying a pattern associated with the query result, wherein the data in the markup language format is used for pattern identification. | 05-26-2011 |
20110191323 | EFFICIENT MULTIPLE TUPLE GENERATION OVER STREAMING XML DATA - Methods and arrangements for extracting tuples from a streaming XML document. A query twig is applied to the XML document stream, tuples are extracted from the XML document stream based on the query twig, and a quantity of extracted tuples is limited via foregoing extraction of duplicate tuples extraction of tuples that do not satisfy query twig criteria. | 08-04-2011 |
20110191361 | SYSTEM AND METHOD FOR BUILDING A CLOUD AWARE MASSIVE DATA ANALYTICS SOLUTION BACKGROUND - Embodiments of the invention provide data management solutions that go beyond the traditional warehousing system to support advanced analytics. Furthermore, embodiments of the invention relate to systems and methods for extracting data from an existing data warehouse, storing the extracted data in a reusable (intermediate) form using data parallel and compute parallel techniques over cloud, query processing over the data with/without compute parallel techniques, and providing querying using high level querying languages. | 08-04-2011 |
20110191781 | RESOURCES MANAGEMENT IN DISTRIBUTED COMPUTING ENVIRONMENT - A method, system and a computer program product for determining resources allocation in a distributed computing environment. An embodiment may include identifying resources in a distributed computing environment, computing provisioning parameters, computing configuration parameters and quantifying service parameters in response to a set of service level agreements (SLA). The embodiment may further include iteratively computing a completion time required for completion of the assigned task and a cost. Embodiments may further include computing an optimal resources configuration and computing at least one of an optimal completion time and an optimal cost corresponding to the optimal resources configuration. Embodiments may further include dynamically modifying the optimal resources configuration in response to at least one change in at least one of provisioning parameters, computing parameters and quantifying service parameters. | 08-04-2011 |
20110225116 | SYSTEMS AND METHODS FOR POLICY BASED EXECUTION OF TIME CRITICAL DATA WAREHOUSE TRIGGERS - Embodiments of the invention broadly contemplate systems, methods, apparatuses and program products that employ a policy based method of timely execution of triggers over a data warehouse. Embodiments of the invention utilize methods to convert a trigger condition into differential queries over data sources, to decide when to send queries to data sources, and determine to which data sources to send queries. | 09-15-2011 |
20120150825 | Cleansing a Database System to Improve Data Quality - According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above. | 06-14-2012 |
20120179658 | Cleansing a Database System to Improve Data Quality - According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above. | 07-12-2012 |
20120259836 | Analyzing XML Data - Embodiments of the invention disclose a method, a system and a computer program product of discovering automated insights in XML data by generating a query result in response to querying data using a query, wherein the data is in a markup language format, and identifying a pattern associated with the query result, wherein the data in the markup language format is used for pattern identification. | 10-11-2012 |
20120259837 | Analyzing XML Data - Embodiments of the invention disclose a method, a system and a computer program product of discovering automated insights in XML data by generating a query result in response to querying data using a query, wherein the data is in a markup language format, and identifying a pattern associated with the query result, wherein the data in the markup language format is used for pattern identification. | 10-11-2012 |
20120265764 | FILE SEARCHING ON MOBILE DEVICES - Systems and methods are provided for file searching on mobile devices. A system includes a user interface and a file query system. The user interface is for receiving a user-provided spatio-temporal query for use in searching for a particular file. The user-provided spatio-temporal query is provided by a user of a mobile device. The file query system is for determining information about the particular file responsive to the user-provided spatio-temporal query, and identifying from the information one or more files as a search result for the particular file. | 10-18-2012 |
20130006974 | FILE SEARCHING ON MOBILE DEVICES - Systems and methods are provided for file searching on mobile devices. A system includes a user interface and a file query system. The user interface is for receiving a user-provided spatio-temporal query for use in searching for a particular file. The user-provided spatio-temporal query is provided by a user of a mobile device. The file query system is for determining information about the particular file responsive to the user-provided spatio-temporal query, and identifying from the information one or more files as a search result for the particular file. | 01-03-2013 |
20130006992 | Adapting Data Quality Rules Based Upon User Application Requirements - During application of data quality rules to a data set obtained from a data source, data is retrieved from the data source along with a common set of rules configured to format the retrieved data in a manner in accordance with one or more predefined data quality rules of the common set of rules. At least one predefined data quality rule is adjusted utilizing at least one editable widget to form a modified set of data quality rules adapted for use with a specified application. The modified set of data quality rules is applied to the retrieved data. | 01-03-2013 |
20130007589 | INTERACTION VIA SHORT MESSAGE SERVICE MESSAGES WITH WIRELESS MARKUP LANGUAGE BASED WEBSITES - A computational device receives a first text message in a text messaging format from a mobile device to access a website that stores information in a markup language format. The computational device converts one or more elements of the stored information from the markup language format to the text messaging format. The computational device sends a second text message that indicates how to interact with the website in the text messaging format to the mobile device. | 01-03-2013 |
20130007629 | Adapting Data Quality Rules Based Upon User Application Requirements - During application of data quality rules to a data set obtained from a data source, data is retrieved from the data source along with a common set of rules configured to format the retrieved data in a manner in accordance with one or more predefined data quality rules of the common set of rules. At least one predefined data quality rule is adjusted utilizing at least one editable widget to form a modified set of data quality rules adapted for use with a specified application. The modified set of data quality rules is applied to the retrieved data. | 01-03-2013 |
20130086116 | DECLARATIVE SPECIFICATION OF DATA INTEGRATON WORKFLOWS FOR EXECUTION ON PARALLEL PROCESSING PLATFORMS - A method for receiving a declarative specification including a plurality of stages. Each stage specifies an atomic operation, a data input to the atomic operation, and a data output from the atomic operation. The data input is characterized by a data type. Links between at least two of the stages are generated to create a data integration workflow. The data integration workflow is compiled to generate computer code for execution on a parallel processing platform. The computer code configured to perform at least one of data preparation and data analysis. | 04-04-2013 |
20130218843 | INTELLIGENT DATA ARCHIVING - Methods and apparatus, including computer program products, implementing and using techniques for archiving an active record in a database. An auxiliary table is created in the database. The auxiliary table can store metadata information about active records to be archived and deleted from the database. An active record to be deleted from the database is selected. Metadata information is stored in the auxiliary table for the selected active record. The selected active record is archived into a data storage device. The selected active record is deleted from the database, based on the update information in the auxiliary table, without comparing column data for the archived record in the data storage device and the active record in the database prior to deletion of the active record from the database. | 08-22-2013 |
20130238610 | Automatically Mining Patterns For Rule Based Data Standardization Systems - Computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group. | 09-12-2013 |
20130238611 | Automatically Mining Patterns for Rule Based Data Standardization Systems - Methods, computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group. | 09-12-2013 |
20130254237 | DECLARATIVE SPECIFICATION OF DATA INTEGRATON WORKFLOWS FOR EXECUTION ON PARALLEL PROCESSING PLATFORMS - A method for receiving a declarative specification including a plurality of stages. Each stage specifies an atomic operation, a data input to the atomic operation, and a data output from the atomic operation. The data input is characterized by a data type. Links between at least two of the stages are generated to create a data integration workflow. The data integration workflow is compiled to generate computer code for execution on a parallel processing platform. The computer code configured to perform at least one of data preparation and data analysis. | 09-26-2013 |
20130265326 | DISCOVERING A REPORTING MODEL FROM AN EXISTING REPORTING ENVIRONMENT - Computer software is disclosed for discovering and representing a reporting model of an existing reporting environment. For each report in a plurality of reports, the software searches metadata of the report for descriptive information and dependencies on other reports. The software depicts, in a graphical representation, each report and relationships between the reports. | 10-10-2013 |
20130297412 | DELIVERING PERSONALIZED RECOMMENDATIONS THAT RELATE TO TRANSACTIONS ON DISPLAY - Provided are techniques for providing personalized recommendations. One or more transactions are received from one or more customer interaction channels. The received one or more transactions are stored in an incremental data store. One or more predictive rules are generated based on the received one or more transactions and based on one or more transactions previously stored in the incremental data store. In real-time, one or more personalized recommendations specific to a user and to the received one or more transactions are generated using the one or more generated predictive rules. | 11-07-2013 |
20130325881 | Supplementing Structured Information About Entities With Information From Unstructured Data Sources - According to a present invention embodiment, a system supplements structured information within a data system for entities based on unstructured data. The system analyzes a document with unstructured data and extracts attribute values from the unstructured data for one or more entities of the data system. Entity records with structured information are retrieved from the data system based on the extracted attribute values. Entity references for corresponding entities of the data system are constructed based on a comparison of the retrieved entity records and the extracted attribute values. The entity references are linked to the corresponding entities within the data system, wherein the entity references include extracted attributes from the unstructured data for corresponding linked entities. Embodiments of the present invention further include a method and computer program product for supplementing structured information within a data system for entities based on unstructured data in substantially the same manner described above. | 12-05-2013 |
20130325882 | SUPPLEMENTING STRUCTURED INFORMATION ABOUT ENTITIES WITH INFORMATION FROM UNSTRUCTURED DATA SOURCES - A method for supplementing structured information within a data system for entities based on unstructured data analyzes a document with unstructured data and extracts attribute values from the unstructured data for one or more entities of the data system. Entity records with structured information are retrieved from the data system based on the extracted attribute values. Entity references for corresponding entities of the data system are constructed based on a comparison of the retrieved entity records and the extracted attribute values. The entity references are linked to the corresponding entities within the data system, with the entity references including extracted attributes from the unstructured data for corresponding linked entities. | 12-05-2013 |
20140095527 | EXPANDING HIGH LEVEL QUERIES - A received query is modified by identifying one or more words or phrases in the query that have known associations with stored data upon which the query can be executed, determining whether each of the one or more words or phrases is associated with an attribute name corresponding to stored data, determining whether each of the one or more words or phrases is associated with any constraints regarding an attribute name, determining whether each of the one or more words or phrases is associated with a rule regarding structure or syntax of the query, substituting any determined attribute names into the query for each of the respective associated one or more words or phrases, adding any determined constraints to the query limiting a determined attribute name, and applying any determined rules associated with the one or more words or phrases. | 04-03-2014 |
20140358956 | EXPANDING HIGH LEVEL QUERIES - One or more computer processors identify words in a natural language query that have previously-established associations with stored data upon which the query will be executed. The natural language query is formatted to a query template that includes at least one first component and at least one second component. One or more computer processors modify the at least one first component of the a query template based on at least one link between the one or more words in the natural language query and the at least one first component. One or more computer processors generate a new query by adding one or more values to the query template. The one or more values are added to the query template based on the modification of the at least one first component. One or more computer processors execute a search based on the new query. | 12-04-2014 |