Patent application number | Description | Published |
20080222125 | ANALYZING A QUERY LOG FOR USE IN MANAGING CATEGORY-SPECIFIC ELECTRONIC CONTENT - Providing category-specific electronic content includes receiving a request for electronic content. The request has an attribute. The attribute of the received request is compared to an attribute related to a query included in a log of search queries. An indication of a category that represents a search query from a log of search queries that is related to an attribute that matches the attribute of the received request is accessed, and electronic content that is representative of the identified category is accessed and provided. | 09-11-2008 |
20080319995 | RELIABILITY OF DUPLICATE DOCUMENT DETECTION ALGORITHMS - In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold. | 12-25-2008 |
20090222444 | QUERY DISAMBIGUATION - A search query is resolved prior to being submitted to one or more search engines. The query is resolved such that the query unambiguously corresponds to a category included in a query ontology that relates search queries to query categories. The query may be resolved by supplementing the query with additional information corresponding to the category. For example, the query may be formatted into a canonical form of the query for the category. Alternatively or additionally, the query may be supplemented with one or more keywords that are associated with the category and that represent words or phrases that appear in a high percentage of search results for queries from the category. Resolving the query yields search results that more closely reflect search results desired by a user submitting the query. | 09-03-2009 |
20100088322 | REAL TIME QUERY TRENDS WITH MULTI-DOCUMENT SUMMARIZATION - A list of “hot topics” may be provided to a user to indicate information that is currently popular. A topic may be deemed popular when a large number of search queries related to the topic are entered by users. A search system may receive and analyze an electronic source of published information to determine a reason for why a particular popular topic is popular. If content related to why a particular popular topic is popular exists in multiple electronic sources of published information, text summarization techniques may be used to determine a reason for why the popular topic is popular by from among the multiple electronic sources of published information. | 04-08-2010 |
20100169329 | SYSTEM FOR SIMILAR DOCUMENT DETECTION - A document is compared to the documents in a document collection using a hash algorithm and collection statistics to detect if the document is similar to any of the documents in the document collection. | 07-01-2010 |
20100235375 | TEMPORAL SEARCH QUERY PERSONALIZATION - A user is made able to configure a search query to be responsive to temporal factors in order to adjust the search query to more accurately reflect the user's true information need. By adjusting the search query in this way, the user is more likely to receive satisfactory search results. | 09-16-2010 |
20100299290 | Web Query Classification - A query phrase may be automatically classified to one or more topics of interest (e.g., categories) to assist in routing the query phrase to one or more appropriate backend databases. A selectional preference query classification technique may be used to classify the query phrase based on a comparison between the query phrase and patterns of query phrases. Additionally, or alternatively, a combination of query classification techniques may be used to classify the query phrase. Topical classification of a query phrase also may be used to assist a search system in delivering auxiliary information to a user who entered the query phrase. Advertisements, for instance, may be tailored based on classification rather than query keywords. | 11-25-2010 |
20110270818 | DOMAIN EXPERT SEARCH - Expert domains for a query category represent domains from which a high percentage of search results for queries associated with the query category are retrieved. The expert domains are identified by establishing a base statistical model that indicates frequencies of appearance for domains in search results retrieved for queries corresponding to multiple categories. In addition, frequencies of domain appearance are determined for search results retrieved for queries associated with a category. Domains that appear more frequently in the search results corresponding to the category are identified as expert domains for the category. A user may be allowed to customize expert domains related to one or more categories by adding or removing expert domains for the category. | 11-03-2011 |
20110276646 | RELIABILITY OF DUPLICATE DOCUMENT DETECTION ALGORITHMS - In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold. | 11-10-2011 |
20120150907 | AUDIO AND/OR VIDEO SCENE DETECTION AND RETRIEVAL - Movie video trailers for a particular movie quote may be created and provided to a user. The Internet may be searched to identify documents that likely include references to a movie. A reference to the movie within an identified document may be detected and determined to be a movie quote. The movie quote and related information may be extracted from the identified document. A location of the movie quote within the movie may be determined. A movie video trailer that includes the movie quote may be created based on the location of the movie quote. A request for a movie video trailer that includes a movie quote or a partial movie quote, specified by the user, may be received from the user. A movie video trailer that includes the movie quote or the partial movie quote may be identified and provided to the user. | 06-14-2012 |
20120173560 | QUERY ROUTING - A search query is submitted to one or more information sources associated with a category of the query. The category of the query is indicated by a query ontology that relates queries to query categories. The information sources represent information sources from which a high percentage of search results for queries associated with the category are retrieved. For instance, the category of the query is identified by identifying categories corresponding to variations of the query, where each variation represents a combination of the terms within the query, and where the categories of the variations are assumed to be the categories of the query. Information sources associated with the query categories are identified, and the query is submitted to the identified information sources. Submitting the query to the identified information sources may cause search results retrieved for the query to more closely reflect search results desired by a user that specified the query. | 07-05-2012 |
20120197913 | SYSTEM FOR SIMILAR DOCUMENT DETECTION - A document is compared to the documents in a document collection using a hash algorithm and collection statistics to detect if the document is similar to any of the documents in the document collection. | 08-02-2012 |
20120209870 | WEB QUERY CLASSIFICATION - A query phrase may be automatically classified to one or more topics of interest (e.g., categories) to assist in routing the query phrase to one or more appropriate backend databases. A selectional preference query classification technique may be used to classify the query phrase based on a comparison between the query phrase and patterns of query phrases. Additionally, or alternatively, a combination of query classification techniques may be used to classify the query phrase. Topical classification of a query phrase also may be used to assist a search system in delivering auxiliary information to a user who entered the query phrase. Advertisements, for instance, may be tailored based on classification rather than query keywords. | 08-16-2012 |
20130007026 | Reliability of Duplicate Document Detection Algorithms - In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold. | 01-03-2013 |
20130124556 | Real Time Query Trends with Multi-Document Summarization - A list of “hot topics” may be provided to a user to indicate information that is currently popular. A topic may be deemed popular when a large number of search queries related to the topic are entered by users. A search system may receive and analyze an electronic source of published information to determine a reason for why a particular popular topic is popular. If content related to why a particular popular topic is popular exists in multiple electronic sources of published information, text summarization techniques may be used to determine a reason for why the popular topic is popular by from among the multiple electronic sources of published information. | 05-16-2013 |
20130173518 | Simplifying Lexicon Creation in Hybrid Duplicate Detection and Inductive Classifier System - A classification system includes a signature-based duplicate detector and an inductive classifier that share attribute information. To perform the duplicate detection and the classification, the duplicate detector and inductive classifier are first initialized by generating a lexicon of attributes for the duplicate detector and a classification model for the classifier. To develop a classification model, a training set of documents of known class are used by the classifier to determine the attributes of the documents that are most useful in classifying an unknown document. The model is developed from these attributes. Attribute information containing the attributes determined by the classifier is then passed to the duplicate detector and the duplicate detector uses the attribute information to generate the lexicon of attributes. | 07-04-2013 |
20130173562 | Simplifying Lexicon Creation in Hybrid Duplicate Detection and Inductive Classifier System - A classification system includes a signature-based duplicate detector and an inductive classifier that share attribute information. To perform the duplicate detection and the classification, the duplicate detector and inductive classifier are first initialized by generating a lexicon of attributes for the duplicate detector and a classification model for the classifier. To develop a classification model, a training set of documents of known class are used by the classifier to determine the attributes of the documents that are most useful in classifying an unknown document. The model is developed from these attributes. Attribute information containing the attributes determined by the classifier is then passed to the duplicate detector and the duplicate detector uses the attribute information to generate the lexicon of attributes. | 07-04-2013 |
20130173563 | RELIABILITY OF DUPLICATE DOCUMENT DETECTION ALGORITHMS - In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold. | 07-04-2013 |
20130173599 | QUERY DISAMBIGUTION - A search query is resolved prior to being submitted to one or more search engines. The query is resolved such that the query unambiguously corresponds to a category included in a query ontology that relates search queries to query categories. The query may be resolved by supplementing the query with additional information corresponding to the category. For example, the query may be formatted into a canonical form of the query for the category. Alternatively or additionally, the query may be supplemented with one or more keywords that are associated with the category and that represent words or phrases that appear in a high percentage of search results for queries from the category. Resolving the query yields search results that more closely reflect search results desired by a user submitting the query. | 07-04-2013 |
20130173608 | TEMPORAL SEARCH QUERY PERSONALIZATION - A user is made able to configure a search query to be responsive to temporal factors in order to adjust the search query to more accurately reflect the user's true information need. By adjusting the search query in this way, the user is more likely to receive satisfactory search results. | 07-04-2013 |
20140172836 | Audio and/or Video Scene Detection and Retrieval - Video trailers for a video quote may be created and provided to a user. The Internet may be searched to identify documents that likely include references to a video. A reference to the video within an identified document may be detected and determined to be a video quote. The video quote and related information may be extracted from the identified document. A location of the video quote within the video may be determined. A video trailer that includes the video quote may be created based on the location of the video quote. A request for a video trailer that includes a video quote or a partial video quote, specified by the user, may be received from the user. A video trailer that includes the video quote or the partial video quote may be identified and provided to the user. | 06-19-2014 |