Clearwell Systems, Inc. Patent applications |
Patent application number | Title | Published |
20120296891 | METHODS AND SYSTEMS FOR AUTOMATIC EVALUATION OF ELECTRONIC DISCOVERY REVIEW AND PRODUCTIONS - Techniques are provided for automatic sampling evaluation. An automatic sampling evaluation system enables users to evaluate convergence of one or more search processes. For example, given a set of searches that were validated by human review, a system can implement a retrieval process that samples one or more non-retrieved collections. Each individual document's similarity in the one or more non-retrieved collections is automatically evaluated to other documents in any retrieved sets. Given a goal of achieving a high recall, documents with high similarity can then be analyzed for additional noun phrases that may be used for a next iteration of a search. Convergence can be expected if the information gain in the new feedback loop is less than previous iterations, and if the additional documents identified are below a certain threshold document count. | 11-22-2012 |
20120209853 | METHODS AND SYSTEMS TO EFFICIENTLY FIND SIMILAR AND NEAR-DUPLICATE EMAILS AND FILES - A set of trigrams can be generated for each document in a plurality of documents processed by an e-discovery system. Each trigram in the set of trigrams for a given document is a sequence of three terms in the given document. A set of trigrams for each similar document is then determined based on the set of trigrams for the original document. To facilitate identification of the similar documents, a full text index is then generated for the plurality of documents and the set of trigrams for each document are indexed into the full text index, as individual terms. Queries can be generated into the full text index based on trigrams of a document to determine other similar or near-duplicate documents. After a set of potentially similar documents are identified, a separate distance criteria can be applied to evaluate the level of similarity between the two documents in an efficient way. | 08-16-2012 |
20120209847 | METHODS AND SYSTEMS FOR AUTOMATICALLY GENERATING SEMANTIC/CONCEPT SEARCHES - In various embodiments, a semantic space associated with a corpus of electronically stored information (ESI) may be created and used for concept searches. Documents (and any other objects in the ESI, in general) may be represented as vectors in the semantic space. Vectors may correspond to identifiers, such as, for example, indexed terms. The semantic space for a corpus of ESI can be used in information filtering, information retrieval, indexing, and relevancy rankings. | 08-16-2012 |
20120158728 | SYSTEMS AND METHODS FOR TAGGING EMAILS BY DISCUSSIONS - The invention provides for techniques to process and produce email documents. The techniques provide for organizing a first plurality of email documents into a plurality of document groups, reviewing a document group from the plurality of document groups, and associating a review content with the document group. The techniques provide for ways to propagate the review content to one or more email documents associated with the document group and producing a second plurality of email documents. The techniques provide for annotating one or more email documents in accordance with the review content. Depending on the embodiment, review content may include text, graphics, audio, tag, and multimedia information. Produced documents can be searched and browsed in accordance with information in the review content. Email documents can be grouped by information in meta information and/or header information associated with the email documents into various groups, including threads or conversations, for example. | 06-21-2012 |
20100030798 | Systems and Methods for Tagging Emails by Discussions - The invention provides for techniques to process and produce email documents. The techniques provide for organizing a first plurality of email documents into a plurality of document groups, reviewing a document group from the plurality of document groups, and associating a review content with the document group. The techniques provide for ways to propagate the review content to one or more email documents associated with the document group and producing a second plurality of email documents. The techniques provide for annotating one or more email documents in accordance with the review content. Depending on the embodiment, review content may include text, graphics, audio, tag, and multimedia information. Produced documents can be searched and browsed in accordance with information in the review content. Email documents can be grouped by information in meta information and/or header information associated with the email documents into various groups, including threads or conversations, for example. | 02-04-2010 |
20090265609 | Method and System for Producing and Organizing Electronically Stored Information - The invention provides techniques for efficiently organizing and reviewing electronic documents to be produced in the course of a discovery process. The technique provides for marking the master or pivot document with review information, and identifying a plurality of duplicate documents related to the master or pivot document. The technique provides for reviewing a master or pivot document and propagating the review information to a set of related documents. The technique provides for producing a plurality of electronic documents where each of the electronic documents is marked up in accordance with the review information. The method provides for organizing the plurality of electronic documents so it can be presented and searched in an efficient manner. | 10-22-2009 |