Patent application number | Description | Published |
20080250033 | SYSTEM AND METHOD FOR DETERMINING AN EVENT OCCURENCE RATE - Described are a system and method for determined an event occurrence rate. A sample set of content items may be obtained. Each of the content items may be associated with at least one region in a hierarchical data structure. A first impression volume may be determined for the at least one region as a function of a number of impressions registered for the content items associated with the at least one region. A scale factor may be applied to the first impression volume to generate a second impression volume. The scale factor may be selected so that the second impression volume is within a predefined range of a third impression volume. A click-through-rate (CTR) may be estimated as a function of the second impression volume and a number of clicks on the content item. | 10-09-2008 |
20080275890 | System and method for smoothing hierarchical data using isotonic regression - An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming. | 11-06-2008 |
20080275901 | System and method for detecting a web page - An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming. | 11-06-2008 |
20090043597 | System and method for matching objects using a cluster-dependent multi-armed bandit - An improved system and method for matching objects using a cluster-dependent multi-armed bandit is provided. The matching may be performed by using a multi-armed bandit where the arms of the bandit may be dependent. In an embodiment, a set of objects segmented into a plurality of clusters of dependent objects may be received, and then a two step policy may be employed by a multi-armed bandit by first running over clusters of arms to select a cluster, and then secondly picking a particular arm inside the selected cluster. The multi-armed bandit may exploit dependencies among the arms to efficiently support exploration of a large number of arms. Various embodiments may include policies for discounted rewards and policies for undiscounted reward. These policies may consider each cluster in isolation during processing, and consequently may dramatically reduce the size of a large state space for finding a solution. | 02-12-2009 |
20090112865 | HIERARCHICAL STRUCTURE ENTROPY MEASUREMENT METHODS AND SYSTEMS - Methods and apparatuses are provided for accessing taxonomic data associated with an item as classified into a taxonomy having a hierarchical structure, establishing dependency data associated with a distribution represented in the taxonomic data, and determining entropic data for the item based, at least in part, on the distribution and established dependency. | 04-30-2009 |
20090177959 | AUTOMATIC VISUAL SEGMENTATION OF WEBPAGES - To provide valuable information regarding a webpage, the webpage must be divided into distinct semantically coherent segments for analysis. A set of heuristics allow a segmentation algorithm to identify an optimal number of segments for a given webpage or any portion thereof more accurately. A first heuristic estimates the optimal number of segments for any given webpage or portion thereof. A second heuristic coalesces segments where the number of segments identified far exceeds the optimal number recommended. A third heuristic coalesces segments corresponding to a portion of a webpage with much unused whitespace and little content. A fourth heuristic coalesces segments of nodes that have a recommended number of segments below a certain threshold into segments of other nodes. A fifth heuristic recursively analyzes and splits segments that correspond to webpage portions surpassing a certain threshold portion size. | 07-09-2009 |
20090248608 | METHOD FOR SEGMENTING WEBPAGES - A method of segmenting a webpage into visually and semantically cohesive pieces uses an optimization problem on a weighted graph, where the weights reflect whether two nodes in the webpage's DOM tree should be placed together or apart in the segmentation; the weights are informed by manually labeled data. | 10-01-2009 |
20090287672 | Method and Apparatus for Better Web Ad Matching by Combining Relevance with Consumer Click Feedback - A method and apparatus are provided for better web ad matching by combining relevance with consumer click feedback. In one example, the method includes receiving a query page, extracting features from the query page, re-weighting the query page, evaluating the query page in light of each ad in order to score each ad and pick substantially best ad matches of the indexed ads, and returning the substantially best ad matches to the consumer computer. | 11-19-2009 |
20100049709 | Generating Succinct Titles for Web URLs - Methods, computer programs, and systems for generating a link title for a URL (Uniform Resource Locator) within a context webpage to be shown as a web result are provided. The method evaluates generation parameters for a plurality of sources for picking words from the link title. Further, the method generates candidates for the link title, and a likelihood is computed for each candidate. When computing the likelihood, the generation parameters, the context webpage and the words are considered. In addition, the method selects a candidate with the highest likelihood from all the computed likelihoods, and presents the URL with the selected candidate as the title. | 02-25-2010 |
20100250528 | QUICKLINK SELECTION FOR NAVIGATIONAL QUERY - According to techniques described herein, the best set of quicklinks is picked to maximize the benefits for a majority of the users of a search engine, since the “real estate” on a search results page is constrained and valuable. Quicklinks are navigational shortcuts that are displayed below the website homepage on a search results page. Using user browsing trails obtained from browser toolbars, and a simple probabilistic model, the quicklink selection program is formulated as a combinatorial optimization problem. Two techniques are proposed herein: a greedy technique and a tree-based technique. The tree-based technique finds an optimal solution, but may do so in a greater amount of time than the greedy technique takes to find a solution that is not guaranteed to be optimal. The tree-based technique may incorporate natural constraints on the set of chosen quicklinks. | 09-30-2010 |
20110035345 | AUTOMATIC CLASSIFICATION OF SEGMENTED PORTIONS OF WEB PAGES - Exemplary methods and apparatuses are provided which may be used for classifying and indexing segmented portions of web pages and providing related information for use in information extraction and/or information retrieval systems. | 02-10-2011 |
20110153550 | SYSTEM AND METHOD FOR DETERMINING AN EVENT OCCURRENCE RATE - Described are a system and method for determined an event occurrence rate. A sample set of content items may be obtained. Each of the content items may be associated with at least one region in a hierarchical data structure. A first impression volume may be determined for the at least one region as a function of a number of impressions registered for the content items associated with the at least one region. A scale factor may be applied to the first impression volume to generate a second impression volume. The scale factor may be selected so that the second impression volume is within a predefined range of a third impression volume. A click-through-rate (CTR) may be estimated as a function of the second impression volume and a number of clicks on the content item. | 06-23-2011 |
20120166445 | METHOD, APPARATUS AND COMPUTER READABLE MEDIUM FOR INDEXING ADVERTISEMENTS TO COMBINE RELEVANCE WITH CONSUMER CLICK FEEDBACK - A method and apparatus are provided for better web ad matching by combining relevance with consumer click feedback. In one example, the method includes receiving a query page, extracting features from the query page, re-weighting the query page, evaluating the query page in light of each ad in order to score each ad and pick substantially best ad matches of the indexed ads, and returning the substantially best ad matches to the consumer computer. | 06-28-2012 |
20140108417 | METHOD AND APPARATUS FOR WEB AD MATCHING - A method and apparatus are provided for better web ad matching by combining relevance with consumer click feedback. In one example, the method includes receiving a query page, extracting features from the query page, re-weighting the query page, evaluating the query page in light of each ad in order to score each ad and pick substantially best ad matches of the indexed ads, and returning the substantially best ad matches to the consumer computer. | 04-17-2014 |