Patent application number | Description | Published |
20080243832 | Method and System for Parsing Languages - Embodiments of systems and methods for comparing attributes of a data record are presented herein. In some embodiments, a weight is based on a comparison of the name (or other) attributes of data records. In some embodiments, an information score may be calculated for each of two name attributes to be compared to get an average information score for the two name attributes. The two name attributes may then be compared against one another to generate a weight between the two attributes. This weight can then be normalized to generate a final weight between the two business name attributes. Comparing attributes according to embodiments disclosed herein can facilitate linking data records even if they comprise attributes in languages which do not use the Latin alphabet. | 10-02-2008 |
20080244008 | METHOD AND SYSTEM FOR DATA EXCHANGE AMONG DATA SOURCES - Systems and methods for joining results in loosely collaborative data exchanges which include a plurality of nodes in communication with each other wherein a node is constrained from sharing payload data regarding a first subject, is allowed to share matching data regarding the first subject, and is further constrained to be separate from the other nodes. In some embodiments, a method includes receiving a set of virtual results from the constrained node including the matching data and excluding the payload data. The method also includes receiving results from the other nodes and determining whether any of the results correspond to the same subject. The method also includes, when any of the results correspond to the same subject, joining the corresponding portions in a hub separate from the constrained node. The joined results can be output. | 10-02-2008 |
20090089317 | Method and system for indexing, relating and managing information about entities - Systems and methods for use in association with a master entity index system may allow data records to be grouped together into various entities, where each of the entities may represent a logical or physical item. These entities may also be associated with one another in a manner such that relationships between entities may likewise be represented. In one embodiment, an interface may be provided for use in conjunction with the master entity index system such that these various entities and relationships may be better managed, manipulated or visualized. This interface may allow a user to search for or otherwise obtain an entity, where a representation of this entity and one or more associated entities may be presented to the user along with representations of the relationships between these entities. | 04-02-2009 |
20090089332 | METHOD AND SYSTEM FOR ASSOCIATING DATA RECORDS IN MULTIPLE LANGUAGES - Embodiments disclosed herein provide a system and method for associating data records in multiple languages within a single hub. As a record comes in from an information source coupled to the hub, it is associated with a particular language at a core layer. The hub maps each language one-to-one to a member type. For each data record of a particular member type, unique derivation code is utilized to perform standardization and bucketing at a derived layer. A weight may be used to balance the richness of languages so that data records in different languages can have the same statistical meaning. Since attributes are standardized with respect to a language of a data record, appropriate languages or script can be passed along with the data record. The hub can then match the data record to the optimum algorithm(s) for entity processing at an entity layer. | 04-02-2009 |
20090089630 | METHOD AND SYSTEM FOR ANALYSIS OF A SYSTEM FOR MATCHING DATA RECORDS - Embodiments disclosed herein provide a system and method for analyzing an identity hub. Particularly, a user can connect to the identity hub, load an initial set of data records, create and/or edit an identity hub configuration locally, analyze and/or validate the configuration via a set of analysis tools, including an entity analysis tool, a data analysis tool, a bucket analysis tool, and a linkage analysis tool, and remotely deploy the validated configuration to an identity hub instance. In some embodiments, through a graphical user interface, these analysis tools enable the user to analyze and modify the configuration of the identity hub in real time while the identity hub is operating to ensure data quality and enhance system performance. | 04-02-2009 |
20100114877 | Method and System for Filtering False Positives - Embodiments of systems and methods for reducing false positives during the linking of data records are disclosed herein. Broadly speaking, embodiments of the present invention may be used in the generation of an overall weight from the comparison of various attributes of data records, where the linking of the data records is dependent on the overall weight. More specifically, embodiments of the present invention may calculate a false positive penalty based on a set of results, each of the set of results based on a comparison of an attribute. The false positive penalty may be subtracted from the overall weight generated from the comparison of the attributes of data records to adjust the overall weight. By configuring which attributes of the data records are used as the set of attributes for generating the false positive penalty, and the penalties associated with a particular combination of results for the comparisons of these attributes, the incidence of false positives in the linking of data records may be significantly reduced. | 05-06-2010 |
20100174725 | METHOD AND SYSTEM FOR COMPARING ATTRIBUTES SUCH AS BUSINESS NAMES - Embodiments of systems and methods for comparing attributes of a data record are presented herein. Broadly speaking, embodiments of the present invention generate a weight based on a comparison of the name (or other) attributes of data records. More particularly, embodiments of the present invention generate a weight based on a comparison of name attributes. More specifically, embodiments of the present invention may calculate an information score for each of two name attributes to be compared to get an average information score for the two name attributes. The two name attributes may then be compared against one another to generate a weight between the two attributes. This weight can then be normalized to generate a final weight between the two business name attributes. | 07-08-2010 |
20100175024 | SYSTEM AND METHOD FOR AUTOMATIC WEIGHT GENERATION FOR PROBABILISTIC MATCHING - Embodiments of the invention provide a system and method of automatically generating weights for matching data records. Each field of a record may be compared by an exact match and/or close matches and each comparison can result in a mathematical score which is the sum of the field comparisons. To sum up the field scores accurately, the automatic weight generation process comprises an iterative process. In one embodiment, initial weights are computed based upon unmatched-set probabilities and default discrepancy weights associated with attributes in the comparison algorithm. A bulk cross-match is performed across the records using the initial weights and a candidate matched set is computed for updating the discrepancy probabilities. New weights are computed based upon the unmatched probabilities and the updated discrepancy probabilities. Test for convergence between the new weights and the old weights. Repeat with the new weight table until the weights converge to their final value. | 07-08-2010 |
20110010401 | GRAPHICAL USER INTERFACE FOR THE CONFIGURATION OF AN ALGORITHM FOR THE MATCHING OF DATA RECORDS - Systems and methods for an interface for the configuration of a system for the association of data records are disclosed. Embodiments of the invention provide a graphical interface for manipulating a representation of an algorithm for the configuration of such a system, including the configuration of how a system transforms data records from their native format to a standard data format, selects set of candidates and associates data records. | 01-13-2011 |
20110191349 | Method and System For Indexing, Relating and Managing Information About Entities - Systems and methods for use in association with a master entity index system may allow data records to be grouped together into various entities, where each of the entities may represent a logical or physical item. These entities may also be associated with one another in a manner such that relationships between entities may likewise be represented. In one embodiment, an interface may be provided for use in conjunction with the master entity index system such that these various entities and relationships may be better managed, manipulated or visualized. This interface may allow a user to search for or otherwise obtain an entity, where a representation of this entity and one or more associated entities may be presented to the user along with representations of the relationships between these entities. | 08-04-2011 |