Patent application number | Description | Published |
20080250265 | SYSTEMS AND METHODS FOR PREDICTIVE FAILURE MANAGEMENT - A system and method for using continuous failure predictions for proactive failure management in distributed cluster systems includes a sampling subsystem configured to continuously monitor and collect operation states of different system components. An analysis subsystem is configured to build classification models to perform on-line failure predictions. A failure prevention subsystem is configured to take preventive actions on failing components based on failure warnings generated by the analysis subsystem. | 10-09-2008 |
20090024618 | SYSTEM AND METHOD FOR INDEXING WEIGHTED-SEQUENCES IN LARGE DATABASES - The present invention provides an index structure for managing weighted-sequences in large databases. A weighted-sequence is defined as a two-dimensional structure in which each element in the sequence is associated with a weight. A series of network events, for instance, is a weighted-sequence because each event is associated with a timestamp. Querying a large sequence database by events' occurrence patterns is a first step towards understanding the temporal causal relationships among the events. The index structure proposed herein enables the efficient retrieval from the database of all subsequences (contiguous and non-contiguous) that match a given query sequence both by events and by weights. The index structure also takes into consideration the nonuniform frequency distribution of events in the sequence data. | 01-22-2009 |
20090077148 | Methods and Apparatus for Perturbing an Evolving Data Stream for Time Series Compressibility and Privacy - Techniques for perturbing an evolving data stream are provided. The evolving data stream is received. An online linear transformation is applied to received values of the evolving data stream generating a plurality of transform coefficients. A plurality of significant transform coefficients are selected from the plurality of transform coefficients. Noise is embedded into each of the plurality of significant transform coefficients, thereby perturbing the evolving data stream. A total noise variance does not exceed a defined noise variance threshold. | 03-19-2009 |
20090086755 | SYSTEMS AND METHODS FOR CORRELATION OF BURST EVENTS AMONG DATA STREAMS - Systems and methods for the identification of correlated burst events among two or more data streams, given one or more specific query time spans are disclosed. Also broadly contemplated is the act of finding, from one or more data streams, those streams that have correlated burst events with another given data stream within a time span. | 04-02-2009 |
20090094265 | SYSTEMS AND METHODS FOR RIGHTS PROTECTION OF DATASETS WITH DATASET STRUCTURE PRESERVATION - A system and method for rights protection of a dataset that includes multiple trajectory objects includes determining an intensity power for embedding a watermarking key in a data trajectory. The data trajectory is modified to embed a watermarking key at the intensity power such that the intensity power guarantees an original pair-wise relationship between distance-based neighboring objects before and after embedding of the key such that a modified trajectory provides a watermarked version of the data trajectory. | 04-09-2009 |
20090100014 | Methods and Apparatus for Adaptive Source Filtering and Load Shedding for Data Stream Processing - Techniques are disclosed for adaptive source filtering and load shedding in such data stream processing systems. For example, in one aspect of the invention, a method for use in filtering data in a distributed data stream processing system, wherein a server receives and processes one or more data streams from one or more data sources, comprises the steps of the server periodically re-configuring one or more filters and sending the one or more periodically re-configured filters to the one or more data sources, and the one or more data sources performing data filtering based on the one or more periodically re-configured filters received from the server. | 04-16-2009 |
20090204574 | SYSTEMS AND METHODS FOR COMPUTATION OF OPTIMAL DISTANCE BOUNDS ON COMPRESSED TIME-SERIES DATA - There are provided a method and a system for computation of optimal distance bounds on compressed time-series data. In a method for similarity search, the method includes the step of transforming sequence data into a compressed sequence represented by top-k coefficients of the sequence data and a sum of the energy of omitted coefficients of the sequence data. The method further includes the step of computing at least one of a lower bound and an upper bound on a distance range between a query sequence and the compressed sequence, given a first and a second constraint. The first constraint is that a sum of squares of the omitted coefficients is less than a sum of the energy of the omitted coefficients. The second constraint is that the energy of the omitted coefficients is less than the energy of a lowest energy one of the top-k coefficients. | 08-13-2009 |
20090248749 | System and Method for Scalable Processing of Multi-Way Data Stream Correlations - A computer implemented method, apparatus, and computer usable program code for processing multi-way stream correlations. Stream data are received for correlation. A task is formed for continuously partitioning a multi-way stream correlation workload into smaller workload pieces. Each of the smaller workload pieces may be processed by a single host. The stream data are sent to different hosts for correlation processing. | 10-01-2009 |
20090319526 | Method and Apparatus for Variable Privacy Preservation in Data Mining - Improved privacy preservation techniques are disclosed for use in accordance with data mining. By way of example, a technique for preserving privacy of data records for use in a data mining application comprises the following steps/operations. Different privacy levels are assigned to the data records. Condensed groups are constructed from the data records based on the privacy levels, wherein summary statistics are maintained for each condensed group. Pseudo-data is generated from the summary statistics, wherein the pseudo-data is available for use in the data mining application. Principles of the invention are capable of handling both static and dynamic data sets | 12-24-2009 |
20100268734 | SYSTEM AND METHOD FOR DISTRIBUTED PRIVACY PRESERVING DATA MINING - Distributed privacy preserving data mining techniques are provided. A first entity of a plurality of entities in a distributed computing environment exchanges summary information with a second entity of the plurality of entities via a privacy-preserving data sharing protocol such that the privacy of the summary information is preserved, the summary information associated with an entity relating to data stored at the entity. The first entity may then mine data based on at least the summary information obtained from the second entity via the privacy-preserving data sharing protocol. The first entity may obtain, from the second entity via the privacy-preserving data sharing protocol, information relating to the number of transactions in which a particular itemset occurs and/or information relating to the number of transactions in which a particular rule is satisfied. | 10-21-2010 |
20100281028 | SYSTEM AND METHOD FOR INDEXING A DATA STREAM - There are provided methods, computer program products, and systems for indexing a data stream. A method for indexing a data stream having attribute values includes the steps of parsing the data stream, and forming an index of tuples for a subset of attribute values of the data stream. The index is configured for retrieving the top-K tuples that optimize linearly weighted sums of at least some of the attribute values in the subset. | 11-04-2010 |
20100329563 | System and Method for Real-time New Event Detection on Video Streams - Techniques are disclosed for detecting new events in a video stream that yield improved detection efficiency in real time. For example, a method determines whether a given event is a new event in a video stream. The video stream includes a plurality of events. A first step extracts a first set of features (e.g., text features) from the given event. The first set of features is computationally less expensive to process as compared to a second set of features (e.g., image features) associated with the given event. A second step computes one or more first dissimilarity values between the given event and one or more previous events in the video stream using only the first set of features when one or more first dissimilarity criteria exist. A third step determines whether the given event is a new event based on the one or more computed first dissimilarity values. | 12-30-2010 |
20110213740 | SYSTEM AND METHOD FOR RESOURCE ADAPTIVE CLASSIFICATION OF DATA STREAMS - A system and method for resource adaptive classification of data streams. Embodiments of systems and methods provide classifying data received in a computer, including discretizing the received data, constructing an intermediate data structure from said received data as training instances, performing subspace sampling on said received data as test instances and adaptively classifying said received data based on statistics of said subspace sampling. | 09-01-2011 |
20130013873 | SYSTEM AND METHOD FOR OPTIMIZING DATA IN VALUE-BASED STORAGE SYSTEM - A storage system includes a plurality of data vats, and a processor including an optimizing unit that optimizes a value of data stored in the storage system. The optimizing unit optimizes the value by computing and implementing an optimal decision for allocating new data to a first data vat of the plurality of data vats, moving existing data from at least a second data vat of the plurality of data vats to the first data vat, and deleting existing data from the first data vat, based on an amount of data in each of the plurality of data vats. | 01-10-2013 |
20130304722 | RANGE QUERY METHODS AND APPARATUS - Range query techniques are disclosed for use in accordance with data stream processing systems. A technique is provided for incrementally processing continual range queries against moving objects. This may be done for location-aware services and applications. For example, a technique for evaluating one or more continual range queries over one or more moving objects comprises maintaining a query index with one or more containment-encoded virtual constructs associated with the one or more continual range queries over the one or more moving objects, and incrementally evaluating the one or more continual range queries using the query index. | 11-14-2013 |
20140344845 | SYSTEM AND METHOD FOR REAL-TIME NEW EVENT DETECTION ON VIDEO STREAMS - Techniques are disclosed for detecting new events in a video stream that yield improved detection efficiency in real time. For example, a method determines whether a given event is a new event in a video stream. The video stream includes a plurality of events. A first step extracts a first set of features (e.g., text features) from the given event. The first set of features is computationally less expensive to process as compared to a second set of features (e.g., image features) associated with the given event. A second step computes one or more first dissimilarity values between the given event and one or more previous events in the video stream using only the first set of features when one or more first dissimilarity criteria exist. A third step determines whether the given event is a new event based on the one or more computed first dissimilarity values. | 11-20-2014 |