Patent application number | Description | Published |
20110029571 | Query Optimization Over Graph Data Streams - An illustrative embodiment includes a method for executing a query on a graph data stream. The graph stream comprises data representing edges that connect vertices of a graph. The method comprises constructing a plurality of synopsis data structures based on at least a subset of the graph data stream. Each vertex connected to an edge represented within the subset of the graph data stream is assigned to a synopsis data structure such that each synopsis data structure represents a corresponding section of the graph. The method further comprises mapping each received edge represented within the graph data stream onto the synopsis data structure which corresponds to the section of the graph which includes that edge, and using the plurality of synopsis data structures to execute the query on the graph data stream. | 02-03-2011 |
20110074786 | Dimensional Reduction Mechanisms for Representing Massive Communication Network Graphs for Structural Queries - Mechanisms are provided for transforming an original graph data set into a representative form having a smaller number of dimensions that the original graph data set. The mechanisms generate a graph transformation basis structure based on an input graph data structure. The mechanisms further transform an original graph data set based on an intersection of the graph transformation basis structure and the input graph data structure to thereby generate a transformed graph data set data structure. The transformed graph data set data structure has a reduced dimensionality from that of the input graph data structure but represents characteristics of the original graph data set. Moreover, the mechanisms perform an application specific operation on the transformed graph data set data structure to generate an output of a closest similarity record in the transformed graph data set to a target component. | 03-31-2011 |
20110078143 | Mechanisms for Privately Sharing Semi-Structured Data - Mechanisms are provided for anonymizing data comprising a plurality of graph data sets. The mechanisms receive input data comprising a plurality of graph data sets. Each graph data set comprises data for generating a separate graph from graphs associated with other graph data sets. The mechanisms perform clustering on the graph data sets to generate a plurality of clusters. At least one cluster of the plurality of clusters comprises a plurality of graph data sets. Other clusters in the plurality of clusters comprise one or more graph data sets. The mechanisms also determine, for each cluster in the plurality of clusters, aggregate properties of the cluster. Moreover, the mechanisms generate, for each cluster in the plurality of clusters, pseudo-synthetic data representing the cluster, from the determined aggregate properties of the clusters. | 03-31-2011 |
20110213740 | SYSTEM AND METHOD FOR RESOURCE ADAPTIVE CLASSIFICATION OF DATA STREAMS - A system and method for resource adaptive classification of data streams. Embodiments of systems and methods provide classifying data received in a computer, including discretizing the received data, constructing an intermediate data structure from said received data as training instances, performing subspace sampling on said received data as test instances and adaptively classifying said received data based on statistics of said subspace sampling. | 09-01-2011 |
20110295832 | Identifying Communities in an Information Network - Techniques for identifying one or more communities in an information network are provided. The techniques include collecting one or more nodes and one or more edges from an information network, performing a random walk on the one or more nodes to produce a sequence of one or more nodes, creating a sequence database from one or more sequences produced via random walk, and mining the sequence database to determine one or more patterns in the network, wherein the one or more patterns identify one or more communities in the information network. | 12-01-2011 |
20120218908 | System and Method for Finding Important Nodes in a Network - Techniques for optimizing steady state flow of a network are provided. The techniques include determining a first set of two or more nodes in a network, computing a steady-state flow probability of the first set of two or more nodes, and iteratively interchanging nodes from a second set of two or more nodes into the first set of two or more nodes to determine an optimum total steady state flow of the network, wherein determining an optimum total steady-state flow of the network comprises iteratively interchanging nodes until no additional improvements in steady-state flow over the computed steady-state flow probability can be obtained. | 08-30-2012 |
20120269200 | Similarity Searching in Large Disk-Based Networks - Techniques for determining a shortest path in a disk-based network are provided. The techniques include creating a compressed representation of an underlying disk resident network graph, wherein creating a compressed representation of an underlying disk resident network graph comprises determining one or more dense regions in the disk resident graph and compacting the one or more dense regions into one or more compressed nodes, associating one or more node penalties with the one or more compressed nodes, wherein the one or more node penalties reflect a distance of a sub-path within a compressed node, and performing a query on the underlying disk resident network graph using the compressed representation and one or more node penalties to determine a shortest path in the disk-based network to reduce the number of accesses to a physical disk. | 10-25-2012 |
20140047089 | SYSTEM AND METHOD FOR SUPERVISED NETWORK CLUSTERING - A method (and system) for supervised network clustering includes receiving and reading node labels from a plurality of nodes on a network, as executed by a processor on a computer having access to the network, the network defined as a group of entities interconnected by links. The node labels are used to define densities associated with the nodes. Node components are extracted from the network, based on using thresholds on densities. Smaller components having a size below a user-defined threshold are merged. | 02-13-2014 |
20140052673 | SYSTEM, METHOD AND COMPUTER PROGRAM PRODUCT FOR CLASSIFICATION OF SOCIAL STREAMS - A method of labeling an unlabeled message of a social stream. The method including training a training model based on labeled messages, partitioning the training model into a plurality of class partitions, each comprising statistical information and a class label, computing a confidence for each of the class partitions based on information of an unlabeled message and the statistical information of a respective class partition, as executed by a processor in a computer system, and labeling the unlabeled message of the social stream according to respective confidences of the class partitions. | 02-20-2014 |
20140052674 | SYSTEM, METHOD AND COMPUTER PROGRAM PRODUCT FOR CLASSIFICATION OF SOCIAL STREAMS - A system that labels an unlabeled message of a social stream. The system including a memory device storing instructions to execute a training model, the training model being trained based on labeled messages, and partitioned into a plurality of class partitions, each of which comprise statistical information and a class label, and a Central Processing Unit (CPU) that computes a confidence for each of the class partitions based on information of an unlabeled message and the statistical information of a respective class partition, and that labels the unlabeled message according to respective confidences of the class partitions. | 02-20-2014 |
20140122540 | Dimensional Reduction Mechanisms for Representing Massive Communication Network Graphs for Structural Queries - Mechanisms are provided for transforming an original graph data set into a representative form having a smaller number of dimensions that the original graph data set. The mechanisms generate a graph transformation basis structure based on an input graph data structure. The mechanisms further transform an original graph data set based on an intersection of the graph transformation basis structure and the input graph data structure to thereby generate a transformed graph data set data structure. The transformed graph data set data structure has a reduced dimensionality from that of the input graph data structure but represents characteristics of the original graph data set. Moreover, the mechanisms perform an application specific operation on the transformed graph data set data structure to generate an output of a closest similarity record in the transformed graph data set to a target component. | 05-01-2014 |