Inventors list |
Assignees list |
Classification tree browser |
Top 100 Inventors |
Top 100 Assignees |
Kolcz
Aleksander Kolcz, Fairfax, VA US
| Patent application number | Description | Published |
|---|---|---|
| 20080319995 | RELIABILITY OF DUPLICATE DOCUMENT DETECTION ALGORITHMS - In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold. | 12-25-2008 |
| 20100299290 | Web Query Classification - A query phrase may be automatically classified to one or more topics of interest (e.g., categories) to assist in routing the query phrase to one or more appropriate backend databases. A selectional preference query classification technique may be used to classify the query phrase based on a comparison between the query phrase and patterns of query phrases. Additionally, or alternatively, a combination of query classification techniques may be used to classify the query phrase. Topical classification of a query phrase also may be used to assist a search system in delivering auxiliary information to a user who entered the query phrase. Advertisements, for instance, may be tailored based on classification rather than query keywords. | 11-25-2010 |
| 20110276646 | RELIABILITY OF DUPLICATE DOCUMENT DETECTION ALGORITHMS - In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold. | 11-10-2011 |
Aleksander Kolcz, Faifax, VA US
| Patent application number | Description | Published |
|---|---|---|
| 20100191819 | Group Based Spam Classification - An e-mail filter is used to classify received e-mails so that some of the classes may be filtered, blocked, or marked. The e-mail filter may include a classifier that can classify an e-mail as belonging to a particular class and an e-mail grouper that can detect substantially similar, but possibly not identical, e-mails. The e-mail grouper determines groups of substantially similar e-mails in an incoming e-mail stream. For each group, the classifier determines whether one or more test e-mails from the group belongs to the particular class. The classifier then designates the class to which the other e-mails in the group belong based on the results for the test e-mails. | 07-29-2010 |
Aleksander Kolcz, Kirkland, WA US
| Patent application number | Description | Published |
|---|---|---|
| 20090157720 | RAISING THE BASELINE FOR HIGH-PRECISION TEXT CLASSIFIERS - The claimed subject matter provides systems and/or methods for normalizing document representations for use with Naïve Bayes. The system can include devices and components that determine norms associated with documents by aggregating absolute term weight values associated with the documents, and further ascertain term weights for features associated with the documents, and thereafter divides the term weights for the features associated with the documents with the norms associated with the documents to produce a normalized document representation that can be utilized by arbitrary linear classifiers. | 06-18-2009 |
| 20090222917 | DETECTING SPAM FROM METAFEATURES OF AN EMAIL MESSAGE - Detecting spam from metafeatures of an email message. As a part of detecting spam, the email message is accessed and a distribution of numerical values is accorded to a set of features of the email message. It is determined whether the distribution of numerical values accorded the set of features of the email message is consistent with that of spam. Access is provided to the determination of whether the email message has a distribution of numerical values accorded the set of features that is consistent with that of spam. | 09-03-2009 |
Aleksander Kolcz, Colorado Springs, CO US
| Patent application number | Description | Published |
|---|---|---|
| 20110125578 | FILTERING SYSTEM FOR PROVIDING PERSONALIZED INFORMATION IN THE ABSENCE OF NEGATIVE DATA - Systems and methods are provided for personalizing advertising for a user. In accordance with certain implementations, information is accessed indicating which documents were selected by a user and which documents were not selected by a user. At least one positive word vector is generated using words contained in at least one of the selected documents, and at least one negative word vector is generated using words contained in at least one of the unselected documents. Document word vectors are generated, and a document rank order is established based on a vector space relationship analysis. Categories associated with the documents are ranked based on the document rank order, and the ranked categories are sent to an ad server. Advertising material associated with the ranked categories may then be received from the ad server in a selected context. | 05-26-2011 |
Aleksander R. Kolcz, Kirkland, WA US
| Patent application number | Description | Published |
|---|---|---|
| 20110296524 | Campaign Detection - Campaign detection techniques are described. In implementations, a signature is computed for each of a plurality of emails to be communicated by a service provider to respective intended recipients. A determination is made that two or more of the plurality of emails is similar based on the respective signatures. Responsive to a finding that a number of similar emails exceeds a threshold, an indication is output that the similar emails have a likelihood of being involved in a spam campaign. | 12-01-2011 |
