Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees


Query augmenting and refining (e.g., inexact access)

Subclass of:

707 - Data processing: database and file management or data structures

707001000 - DATABASE OR FILE ACCESSING

707003000 - Query processing (i.e., searching)

Patent class list (only not empty are listed)

Deeper subclasses:

Entries
DocumentTitleDate
20080313165SCALABLE MODEL-BASED PRODUCT MATCHING - Aspects of the subject matter described herein relate to matching product information to products. In aspects, a product matching component receives product information. The product matching component normalizes the product information and obtains keywords from the product information. By querying a database of recognized products, the keywords are used to obtain a list of products that potentially match the product information. A confidence level is assigned to each of the potential matches in the list. A match may be returned for the highest matched product or for a selectable number of products whose confidence level(s) exceed a selectable threshold.12-18-2008
20090327264Topics in Relevance Ranking Model for Web Search - Described is a technology by which topics corresponding to web pages are used in relevance ranking of those pages. Topics are extracted from each web page of a set of web pages that are found via a query. For example, text such as nouns may be extracted from the title, anchor texts and URL of a page, and used as the topics. The extracted topics from a page are used to compute a relevance score for that page based on an evaluation of that page's topics against the query. The pages are then ranked relative to one another based at least in part on the relevance score computed for each page, such as by determining a matching level for each page, ranking pages by each level, and ranking pages within each level. Also described is training a model to perform the relevance scoring and/or ranking.12-31-2009
20080294620User-defined relevance ranking for search - Detailed herein is a technology which, among other things, allows a search engine to utilize a user-defined relevance function. In one approach to the technology, a method of applying a user-defined relevance function is described. In this approach, a complex search query is decomposed into a simple operator. The simple operator is associated with the user-defined relevance function. A document which matches the search query is retrieved, and a rank is calculated for the document, using the user-defined relevance function.11-27-2008
20090216749IDENTITY BASED CONTENT FILTERING - A system for determining the context in which a content filtering decision is made makes use of user identity information and user usage patterns. This decision making process can be downloaded to the user to allow the user of detailed identity information that the user prefers not to release to third parties. The number of factors used in the determination of the context is increased by providing access to resources otherwise not available to an in-the-cloud decision making process.08-27-2009
20090216747System and method for detecting, collecting, analyzing, and communicating event-related information - A system and method involves detecting operational social disruptive events on a global scale, assigning disease event staging and warnings to express data in more simplistic terms, modeling data in conjunction with linguistics analysis to establish responsive actions, generating visualization and modeling capabilities for communicating information, and modeling disease propagation for containment and forecasting purposes.08-27-2009
20090210408METHOD AND SYSTEM FOR ROLE BASED SITUATION AWARE SOFTWARE - A method for role based situation aware software includes: mapping one or more users to one or more communities of practice (CoP); aggregating a series of tags from the one or more CoP to form an initial set of role-based tags; filtering the initial set of role-based tags based on user context elements to form context sensitive user-role tags; querying one or more information services based on the context sensitive user-role tags; aggregating information obtained from querying the one or more information services; and providing the aggregated information to the user as dynamic context related content.08-20-2009
20090198672CONTEXT-SENSITIVE QUERY EXPANSION - A method for processing a search query having a plurality of search terms for searching for documents includes segmenting the query to identify two or more units, expanding the query by selecting one or more substitutable units for at least one unit in the query, and calculating a substitution probability for each substitutable unit. For each substitutable unit, a co-occurrence probability is calculated with each of the remaining units in the search query. An occurrence probability is then calculated for each substitutable unit, and a score is calculated based on the combination of the substitution probability, the co-occurrence probability, and occurrence probability. The documents are ranked in an order determined by the score.08-06-2009
20100030768CLASSIFYING DOCUMENTS USING IMPLICIT FEEDBACK AND QUERY PATTERNS - Methods and apparatus are described for classifying documents using a document representation model based on implicit user feedback obtained from search engine queries. The model may be used to achieve better results in non-supervised tasks such as clustering and labeling through the incorporation of usage data obtained from the search engine queries.02-04-2010
20100017402Method, Apparatus, and Data Processor Program Product Capable of Enabling Management of Athleticism Development Program Data - Various aspects of one or more methods, apparatuses and data processor program products capable of enabling management of data associated with an athleticism development program are disclosed herein. These various aspects include maintaining a database including subscriber performance data for a plurality of athleticism development program subscribers and facilitating preparation of a subscriber performance report for a specified one of the plurality of athleticism development program subscribers. The subscriber performance data is capable of enabling an attained standardized athleticism level to be determined for each one of the athleticism development program subscribers. The implementation of standardized athleticism levels is advantageous as it supports a measurable plan of progress for motivating a subscriber and trainer to meet their individual and mutual goals.01-21-2010
20100017391POLARITY ESTIMATION SYSTEM, INFORMATION DELIVERY SYSTEM, POLARITY ESTIMATION METHOD, POLARITY ESTIMATION PROGRAM AND EVALUATION POLARITY ESTIMATIOM PROGRAM - An evaluation polarity of reputation information with an unknown evaluation polarity is estimated by utilizing reputation information with a known evaluation polarity. The present polarity estimation system is a polarity estimation system for estimating an evaluation polarity indicating whether reputation information is positive or negative, and includes a reputation information storage part that precedently stores reputation information with a known evaluation polarity; and a polarity estimating means for estimating an evaluation polarity of reputation information with an unknown evaluation polarity on the basis of the reputation information with the known evaluation polarity precedently stored in the reputation information storage part.01-21-2010
20090006363Granular Data for Behavioral Targeting - A method of targeting receives several granular events and preprocesses the received granular events thereby generating preprocessed data to facilitate construction of a model based on the granular events. The method generates a predictive model by using the pre-processed data. The predictive model is for determining a likelihood of a user action. The method trains the predictive mode. A system for targeting includes granular events, a preprocessor for receiving the granular events, a model generator, and a model. The preprocessor has one or more modules for at least one of pruning, aggregation, clustering, and/or filtering. The model generator is for constructing a model based on the granular events, and the model is for determining a likelihood of a user action. The system of some embodiments further includes several users, a selector for selecting a particular set of users from among the several users, a trained model, and a scoring module.01-01-2009
20080208841CLICK-THROUGH LOG MINING - Click-through log mining is described. Raw search click-through log data is processed to generate ordered query keywords, utilizing an algorithm to expand user-submitted keywords to include high frequency user queries, managing the keywords for a keyword expansion file, analyzing the algorithm performance on a bidding criteria, and identifying related phrases with similar page-click behaviors for advertisements.08-28-2008
20090077052Historical media recommendation service - A media recommendation system for recommending media content that is historically related to seed media content is provided. The recommended media content may be songs, television programs, movies, or a combination thereof, and the seed media content may be a song, television program, or movie.03-19-2009
20090138465TECHNICAL DOCUMENT ATTRIBUTE ASSOCIATION ANALYSIS SUPPORTING APPARATUS - Data on a group of technical documents having an attribute X and an attribute Y is acquired and a score corresponding to the data on the technical documents belonging to the combination of the attribute X and attribute Y is calculated. The attribute X is placed on the horizontal axis and the attribute Y is placed on the vertical axis. The scores are placed in a matrix manner. According to the scores belonging to each column of the arrangement in the matrix, a group of vectors X05-28-2009
20090157658COMMUNICATIONS SYSTEM AND METHOD FOR SERVING ELECTRONIC CONTENT - A method and system for serving electronic content for placement in a user interface provided by an online service provider system is described. The system stores user data in a user database based on user interaction with the user interface and stores filters associated with the electronic content. The user data is compared to the filters associated with the electronic content and it is determined if the user data matches conditions of the filters. If a match is determined the electronic content is provided for presentation by the user interface.06-18-2009
20080301116Search Ranger System And Double-Funnel Model For Search Spam Analyses and Browser Protection - An exemplary method for protecting web browsers from spam includes providing a multi-layer model that includes a doorway layer, a redirection domain layer, an aggregator layer, a syndicator layer and an advertiser layer; identifying domains as being associated with at least one of the layers; and, based at least in part on the identifying, taking one or more corrective actions to protect web browsers from search spam. An exemplary method for identifying a bottleneck layer in a multi-layer spam model includes providing a multi-layer spam model, collecting spam advertisements, associating a block of IP addresses with the collected spam advertisements and identifying a bottleneck layer based on the block of IP addresses. Other methods, systems, etc., are also disclosed.12-04-2008
20090049039MECHANISM FOR IMPROVING THE EFFECTIVENESS OF AN INTERNET SEARCH ENGINE - Large websites employ internal search engines to assist visitors of the site to access pages relevant to the visitor's needs. Such internal search engines generally use a specialist database containing information relevant to the website.02-19-2009
20090193019SYSTEM AND METHOD FOR SEARCHING A REMOTE DATABASE - A device is in communication with a server over a wireless network. Items are stored locally in the device. The device receives, from a user of the device, a request to search for items matching a search parameter. The device finds items among the locally-stored items that match the parameter. The device sends a request to the server for the server to search among items stored remotely on the server for items matching the parameter. The device receives, from the server, a list of the remotely-stored items that match the parameter without receiving the items themselves. The device displays, to the user, a composite list of both the locally-stored items and remotely-stored items matching the parameter.07-30-2009
20090193017Methods and Systems for Corporate Discovery, Investigation, and Implementation of Emerging Technology - This disclosure provides a system and method of creating a searchable company database, the method comprising the steps of: a) having at least one company enter company information using a user interface; b) creating a company profile database, where the database categorizes the company into a least two tiers of a hierarchical classification system based on the information entered in step a); and c) permitting users to identify a company of interest based on the hierarchical classification system.07-30-2009
20090193016METHOD AND SYSTEM FOR ACCESS TO RESTRICTED RESOURCES - A method and system of providing a search result to a user based on information indicated in a restricted access resource is described. A search system utilizing the assistance of human searchers or guides may obtain a search result using information included in a restricted resource. Access to a restricted resource is granted to guides based on access information provided to the search service. A guide may access information indicated in a restricted resource in order to obtain a search result. A search result obtained based on information indicated in a restricted resource may be returned to a user.07-30-2009
20090193015METHOD AND APPARTUS FOR ADAPTIVELY UPDATING RECOMMEND USER GROUP - A method of adaptively updating a first recommended user group list of a first user connected to a network, in which a predetermined number of second users are selected from the first recommend user group list that is a list of users having a high similarity to the first user in consuming contents. A predetermined number of third users having a high similarity to the first user is selected from second recommend user group lists respectively of the selected second users possess. The first recommend user group list is updated to include the selected third users. As the preference of a user changes, the recommend user group may be reconfigured with updated recommend users by reflecting a corresponding preference. Also, quality contents can be provided by recommending a user of a corresponding terminal with the contents preferred by other users in the updated recommend user group list.07-30-2009
20090193014APPARATUS AND METHODS FOR TRACKING, QUERYING, AND VISUALIZING BEHAVIOR TARGETING PROCESSES - Disclosed are apparatus and methods for providing information that is related to user on-line behavior, which was also used at least partly to generate user scores by one or more behavior targeting processes. A query client may select to receive information from a plurality of different data feeds that are retained within a plurality of different databases by a plurality of different behavior targeting processes. The selectable data feeds generally correspond to different types or aggregations of user on-line behavior. In certain embodiments, information from the selected data feeds for a particular user is compiled and presented in a single, interactive user interface that allows the client to easily view various aspects of such information.07-30-2009
20090193012Inheritance in a Search Index - Embodiments of the present invention perform bulk updates of a search index with a representation that includes upward and downward inheritance. In various embodiments of the invention, a batched set of update requests is applied to a search index to modify existing indexed objects. A representation of the inheritance consequences of the updates is created, and that representation is used to construct a second batched set of update requests. The second batched set of update requests is applied to propagate the updates to the objects that have inheritance relationships to the modified existing indexed objects.07-30-2009
20090193011Phrase Based Snippet Generation - Disclosed herein is a method, a system and a computer product for generating a snippet for an entity, wherein each snippet comprises a plurality of sentiments about the entity. One or more textual reviews associated with the entity is selected. A plurality of sentiment phrases are identified based on the one or more textual reviews, wherein each sentiment phrase comprises a sentiment about the entity. One or more sentiment phrases from the plurality of sentiment phrases are selected to generate a snippet.07-30-2009
20090193010METHOD AND SYSTEM FOR REDUCING COMPLEX TREE STRUCTURES TO SIMPLE TREE STRUCTURES BASED ON RELEVANCE - The present invention discloses a method for reducing a tree structure in a processing system. The method includes providing a plurality of nodes in a tree structure. The method also includes querying each of the plurality of nodes based upon a threshold value, wherein the threshold is related to relevance; when a count of a particular node matches the threshold then a next child node is queried to determine if the next child node matches the threshold, if a child node does not exist for the queried node when the node is displayed. The method further includes visiting all of the parent nodes based on the querying step until all of plurality of nodes have been queried. The method finally includes displaying the nodes that satisfy the threshold value.07-30-2009
20090193008TERM SYNONYM GENERATION - Synonyms for a term to be indexed are dynamically generated by applying one or more rules (e.g., splitting, deletion or addition of characters, and concatenation of portions) to the term, each synonym generated either including only a portion and not all of the term or differing from the term by at least one additional character in a position between a first character and a last character (i.e., not at either end of the term). The term and some or all of the synonyms are then indexed for subsequent searching.07-30-2009
20090193007Systems and methods for ranking search engine results - Systems and methods for ranking search engine results based at least in part on user access to the results of previous search inquiries. Results to a search inquiry appearing on a search engine results page are ranked according to their relevance with respect to the search inquiry, and the ranking is based at least in part on an evaluation of user data associated with actions taken by one or more users in response to other search inquiries. The systems and methods retain data associated with search results for future use on a user specific or multi-user basis, and may access this data from local storage or centralized storage within a network.07-30-2009
20100010990PROGRESS INFORMATION OUTPUT METHOD, MEDIUM STORING PROGRESS INFORMATION OUTPUT PROGRAM, AND PROGRESS INFORMATION OUTPUT APPARATUS - A progress information output method executed by a computer includes acquiring a set of similar applications, the contents of which are similar to the contents of a given application, by searching a first database that stores the contents of each application using information regarding the contents of the given application as a search key, acquiring a set of progress information corresponding to the set of similar applications by searching a second database that stores progress information that is an accumulation of information on procedures performed for applications and indicates a progress of a given application, counting the number of procedures included in the set of progress information, and outputting the number of procedure types counted or an analysis result obtained using the counted number.01-14-2010
20100010987Searching system having a server which automatically generates search data sets forshared searching - A system has a primary server storing search data sets (“staks”) and a linked social network server. Interaction with the server is via a client and a Web site. There is a software agent code executing on the client. The software agent component provides full integration with underlying search engines so that users can continue to search in the normal way, using their favourite search engines, while benefiting from management of search staks, automatic stak selection, and result promotion. The system allows users to manage their staks and providing a range of social networking style services to help users make contact with other like-minded searchers. In addition, it allows users to search for relevant staks. The software agent component provides browser-based access to allow users to manage and share their searchers direct from their browser, as well as providing the benefits of search promotions as they search normally. The Web site provides a wide range of additional features to users and allows them to monitor their own activity and stak activities in more detail, and includes a wide range of social networking style features based around the sharing of search information. The engine provides back-end functionality needed to drive a search service including: the management, storage and indexing of stak information; the generation of search promotions; user management; stak search and recommendations.01-14-2010
20090006370ADVANCED TECHNIQUES FOR SQL GENERATION OF PERFORMANCEPOINT BUSINESS RULES - Computer-implemented methods and computer-readable storage media are disclosed to facilitate the application of business rules. A rule is received, the rule defining one or more calculations to be performed on specified data stored in a multidimensional database to yield at least one result. At least one database query is generated seeking the specified data to be retrieved from the multidimensional database. An intermediary table is created to accommodate the specified data retrieved from the multidimensional database. The specified data is stored in the intermediary table and the specified data is manipulated when the data is retrieved or after the specified data is stored in the intermediary table.01-01-2009
20090024619Processing video files using metadata and time stamp - A method for processing video data involves receiving data from a series of images and analyzing the data to identify geometric forms. The forms are stored as metadata of a first data level and are linked by time stamps to the images in which the forms were identified. The metadata from an image and the previous image are compared, and delta metadata is generated from the difference. Delta metadata is also marked with time stamps. Metadata and delta metadata are analyzed, and objects are extracted from the geometric forms. The objects are stored as time-stamped metadata and delta metadata of a second data level. The process is repeated for higher data levels. A user inputs a database query to identify from among the stored input images that particular image sequence in which the extracted object is recorded. Queries started at higher data levels are quicker but less accurate.01-22-2009
20090070321USER SEARCH INTERFACE - A search mechanism for users of search engines includes a back-end information retrieval system which accepts terms and weights thereof as input set from a front-end and processes said set. A front-end system interacting with said back-end information retrieval system. A database that is searchable by the backend information retrieval system. The search mechanism further includes a visual search interface module (VSI) implemented through the front-end system, where the graphic user interface module is used to change suggested-terms and refine query of multimedia search.03-12-2009
20090077054Cardinality Statistic for Optimizing Database Queries with Aggregation Functions - Embodiments of the invention provide techniques for generating predicted cardinality statistics for grouped aggregation functions included in database queries. In general, characteristics of a database query are determined, and are then supplied to a probability function configured to generate a predicted cardinality statistic. The generated statistic represents a prediction of the probable cardinality of the results of a grouped aggregation function in the event that the query is executed. The predicted cardinality statistic may be used by a query optimizer to determine an efficient query plan for executing the database query.03-19-2009
20090043761AUTONOMIC COMPUTING SYSTEM, EXECUTION ENVIRONMENT CONTROL PROGRAM - To realize an autonomic system and method of improving the quality of a piece of software and solving problems with respect to operations in stages of a software life cycle. There are provided a pattern catalog 02-12-2009
20080294631DESIRE POSTING SYSTEM AND METHOD - Systems and methods are provided herein that provide for desire posting.11-27-2008
20090327265RELEVANCE SCORE IN A PAID SEARCH ADVERTISEMENT SYSTEM - Described is a paid search advertising technology in which advertisements associated with bidding keywords are ranked by relevance when returning one or more advertisements in a response to a query. A relevance score is computed for an advertisement based on the bidding keyword and page data (text and/or other page content) of the advertisement. The relevance score may be based on a similarity vector score computed from a keyword vector and page data vector relationship, combined with a proximity score computed from the keyword's bigram set and the page data bigram set. When a query is received, advertisements are selected based on the proximity of the query to each advertisement's bidding keyword, providing candidate scores. Each candidate score is modified (e.g., multiplied) into a final score based on its respective advertisement's relevance score. The final scores are then used to re-rank the advertisements relative to one another.12-31-2009
20100030764Recommender System Utilizing Collaborative Filtering Combining Explicit and Implicit Feedback with both Neighborhood and Latent Factor Models - Example collaborative filtering techniques provide improved recommendation prediction accuracy by capitalizing on the advantages of both neighborhood and latent factor approaches. One example collaborative filtering technique is based on an optimization framework that allows smooth integration of a neighborhood model with latent factor models, and which provides for the inclusion of implicit user feedback. A disclosed example Singular Value Decomposition (SVD)-based latent factor model facilitates the explanation or disclosure of the reasoning behind recommendations. Another example collaborative filtering model integrates neighborhood modeling and SVD-based latent factor modeling into a single modeling framework. These collaborative filtering techniques can be advantageously deployed in, for example, a multimedia content distribution system of a networked service provider.02-04-2010
20090037404EXTENDED CURSOR SHARING - Techniques for sharing cursors are provided. When a new query is issued, a database server determines whether the new query is semantically equivalent to a previous query. If so, then database server computes statistics associated with the new query. Based on the statistics, the database server determines whether compiling the new query would produce an execution plan that satisfies certain criteria. If so, then the cursor is used to execute the new query. In another approach, one cursor sharing technique (CST) is used to determine which cursor to use to execute a first set of semantically-equivalent queries. Statistics are gathered during execution of the first set of queries. The database server determines, based on the statistics, when to switch from using the first CST to a different CST. The different CST is used to determine which cursor to use to execute a second set of queries that are semantically-equivalent to the first set.02-05-2009
20090187565SYSTEM AND METHOD FOR HANDLING ITEM LISTINGS WITH GENERIC ATTRIBUTES - A system for storing a plurality of items across different categories in a database including a database that stores a data structure that has item entries for items of different categories. Each item entry includes one or more associated attributes. The attributes may be shared by multiple items across more than one category.07-23-2009
20090187563Methods for Generating and Indicating a Time Relevant Status for an Operating Entity - A method for generating an operating status at a defined time for an operating entity, comprising the steps of: generating one or more time rules for hours of operation of the operating entity; determining an operating status value by comparing the time with the time rules; and matching an operating status to said operating status value.07-23-2009
20090182729LOCAL QUERY IDENTIFICATION AND NORMALIZATION FOR WEB SEARCH - Computer-implemented methods and systems for processing user entered query data to improve results of a search of pages using a local search database are provided, when searching the internet. The method includes receiving the user entered query data and parsing each word of the query data and examining each word to determine if the word is associated with one of a business name, a city name or a state name. The examining uses probabilistic dictionaries to determine a likelihood that the word is one of the business name, the city name or the state name. Then, associating the words that were determined to be: (i) the business name with a business name tag to create one or more tagged business terms; (ii) the city name with a city name tag to create one or more tagged city terms; and (iii) the state name with a state name tag to create one or more tagged state terms. The method further includes normalizing each of the tagged business terms, the tagged city terms and the tagged state terms. The normalizing includes boosting information if found in the local search database and determining proximity between selected ones of the tagged business, city or state terms. Then, generating an optimized internal search query that incorporates constraints and ranking based on at least the boosting information and the determined proximity between the selected tagged business, city or state terms. The optimized internal search query is applied to the internet to enable search results to be produced and displayed to the user in response to the entered query data.07-16-2009
20090171949Linguistic Assistance Systems And Methods - System and Methods determine a linguistic preference between two or more phrases. Each of the phrases is submitted to at least one search engine as a search string. Search results are retrieved from each of the at least one search engine for each submitted search string and total hit values of each search result are compared. One of the two or more phrases associated with the greatest total hit value are displayed to a user as the preferred phrase.07-02-2009
20090171939USER-GENERATED ACTIVITY MAPS - Apparatus and computer-readable media for associating metadata with a geographic location are provided. The apparatus includes logic for detecting that a mobile device is present at a geographic location relevant to a user of the mobile device, logic for retrieving context information associated with the location, logic for selecting a program code module based upon a contextual relevancy of the location, logic for providing the program code module for execution, where the program code module is capable of performing processing specific to at least one aspect of the location, the processing is based upon the context information, and the program code module is further capable of receiving at least one input data item from the mobile device, where the at least one input data item describes an activity of the user at the location, and logic for associating the at least one input data item with the location.07-02-2009
20100042619METHOD AND SYSTEM OF TRIGGERING A SEARCH REQUEST - A method and system are described for creating a recurring or triggered search request. A search request is associated with a condition which initiates an information search. A search result, including a search query associated with a search request and a condition may be provided to a user via any or all communication services and/or devices associated with the user. A tool is provided to enable a user to select an existing search request and/or search result which may be used to create a triggered or ‘favorite’ search query which may be triggered as designated. Triggered search requests may be suggested to a user using automated and/or human assisted techniques.02-18-2010
20100042618SYSTEMS AND METHODS FOR COMPARING USER RATINGS - A rating submitted by a user may be compared to ratings submitted by other users in a user community. The users within the user community may be identified using respective descriptive tags. A subset of users within the community may be defined using the descriptive tags. A tag-specific comparison may be made between the rating submitted by the user and a particular subset of the user community. The user may add, edit, and/or remove descriptive tags responsive to the comparisons. Cohesive groups may be identified within the user community. Ratings submitted by members of a cohesive group may be used to suggest content to other members of the group.02-18-2010
20100042617METHOD AND SYSTEM FOR DOWNLOADING ADDITIONAL SEARCH RESULTS INTO ELECTRONIC DICTIONARIES - In one embodiment, the invention provides a method for a system to provide information based on a query, the method comprising: performing a first search of at least one first source for information responsive to the query; providing a result of said search to a user; based on user input, performing a second search of at least one second source for information responsive to the query; and providing a result of said second search to the user.02-18-2010
20100042615SYSTEMS AND METHODS FOR AGGREGATING CONTENT ON A USER-CONTENT DRIVEN WEBSITE - One or more items submitted by user-contributors of a website may be aggregated according to aggregation criteria. The aggregation criteria may specify a topic or type of item to be included in the aggregation. The aggregation criteria may be generated a priori by a user of the website and/or may be generated on-the-fly based on a search, inbound link, or the like. User contributed items may be associated with metadata and/or tags. The items and, importantly the metadata associated therewith, may be rated by users of the website. The item ratings, the metadata, and/or the metadata ratings may be used to aggregate relevant items, thereby increasing the probability that the aggregation closely matches the aggregation criteria. The aggregated items may be presented in a user interface to one or more users of the website.02-18-2010
20100042614DEFERRED 3-D SCENEGRAPH PROCESSING - Processing a scenegraph for a client, including: creating a stack of filters, wherein each filter of the stack of filters is configured to edit or create a property on an object within the scenegraph; presenting a query by the client to the stack of filters for a first property on a first object within the scenegraph to determine whether a filter of the stack of filters edits or creates the first property on the first object; and returning a value for the first property if the filter of the stack of filters edits or creates the first property.02-18-2010
20100042613METHOD AND SYSTEM FOR AUTOMATED SEARCH ENGINE OPTIMIZATION - A system and method receives identification of a web page. The web page is added to a web page library. Upon occurrence of an analysis event, the web page is provided to an automated analysis framework. Analyses representing SEO best practices are run against the web page. Each of the analyses is associated with input data.02-18-2010
20100042621Methods, Systems, And Computer Program Products For Characterizing Links To Resources Not Activated - Methods, systems, and computer program products for characterizing links to resources that are not activated are disclosed. According to one aspect, a page is presented via a user interface of a client device, the page including a link to a resource accessible via a network through activation of the link. The client device determines whether the link on the page is not activated. The link is characterized based on at least one of information associated with the link, the resource, and the page responsive to determining that the link is not activated.02-18-2010
20100042620System and Methods for Managing Complex Service Delivery Through Coordination and Integration of Structured and Unstructured Activities - The invention allows for the integration of structured and unstructured human activities in the context of delivering one or more service. The systems and method described improve efficiency and quality of service delivery by increasing overall productivity and by providing better accountability for the actual cost of delivery.02-18-2010
20100042612Method and system for ranking journaled internet content and preferences for use in marketing profiles - A method and system for ranking and categorizing journaled internet data sources for use in marketing and advertising. Journaled internet data sources are identified and examined. Journal data is retrieved from one or more of the data sources and a voting algorithm is applied to classify the journaled data. The journaled data is associated with one or more content categories of a monitoring taxonomy that specifies content categories and relationships between the content categories. Based on the associations, an interest level, an interaction level, a direction level, or authority level is computed and used to rank the journaled data. The rankings are stored and can be provided for use in targeted marketing and advertising.02-18-2010
20100042610RANK DOCUMENTS BASED ON POPULARITY OF KEY METADATA - Ranking of documents by metadata popularity provides relevant search results in response to user search queries received by a search engine. Metadata popularity is determined by comparing metadata from a document with popularity data from one or more sources. In some embodiments, metadata popularity is determined based on a frequency with which extracted metadata appears in query logs. Search results are ordered based on metadata popularity and returned in response to the user search queries.02-18-2010
20100042609SHARING ITEM IMAGES USING A SIMILARITY SCORE - In some example embodiments, a system and method are illustrated to associate an item listing with one or more corresponding images. The system and method include receiving an item listing for an item from a user device. The system and method include generating a similarity score for a respective existing image associated with one or more existing item listings. The generating may be done by comparing the item listing with the existing item listings received from the user device. The similarity score may indicate a degree of similarity between the item listing and the existing item listings associated with the respective existing image. The system and method include proposing a specified number of existing images with the highest similarity scores to the user device. The system and method further include associating the item listing with one or more of the specified number of existing images accepted by the user device.02-18-2010
20100042607METHOD, APPARATUS, AND COMPUTER PROGRAM PRODUCT FOR ADAPTIVE QUERY PARALLELISM PARTITIONING WITH LOOK-AHEAD PROBING AND FEEDBACK - A database query is partitioned into an initial partition including a plurality of parallel groups, and is executed, via an execution plan, based on the initial partition. A sampling subset of data is identified from the plurality of parallel groups. Substantially in parallel with the executing of the query, the execution plan is executed on the sampling subset of data as a sampling thread. The execution plan is modified based on feedback from the execution of the execution plan on the sampling subset of data.02-18-2010
20080215563Pseudo-Anchor Text Extraction for Vertical Search - A search method uses pseudo-anchor text associated with search objects to improve search performance. The pseudo-anchor text may be extracted in combination with an identifier of the search objects (such as a pseudo-URL) from a digital corpus such as a collection of documents. Pseudo-anchor texts for each object are preferably extracted from candidate anchor blocks using a machine learning based approach. The pseudo-anchor texts are made available for searching and used to help ranking the objects in a search result to improve search performance. Method may be used in vertical search of objects such as published articles, products and images that lack explicit URL and anchor text information.09-04-2008
20090248654SYSTEM AND METHOD FOR PROCESSING MAIL USING SENDER AND RECIPIENT NETWORKED MAIL PROCESSING SYSTEMS - Systems and methods for allowing the sender of a mail piece to obtain an accurate recipient address for the mail piece when the mail piece is being prepared are provided. The mail processing systems of the sender and recipient businesses are networked such that communication can occur between them. When the sender is preparing a mail piece for delivery to a recipient, the sender can participate in an interactive session with the recipient's mail room, utilizing the networked mail processing systems, to obtain a correct recipient address for a mail piece based on a database of recipient addresses maintained by the recipient's mail processing system. Since the mail piece is provided with an accurate recipient address, upon receipt of the mail piece by the recipient's mail room the mail piece can be properly delivered without requiring significant work by the recipient's mailroom to determine the appropriate intended recipient.10-01-2009
20090006360SYSTEM AND METHOD FOR APPLYING RANKING SVM IN QUERY RELAXATION - An enterprise-wide query relaxative support vector machine ranking algorithm approach provides enhanced functionality for query execution in a heterogeneous enterprise environment. Improved query results are obtained by adjusting ranking functions using machine learning methods to automatically train ranking functions. The improved query results are obtained using a list of document-query pairs that are modeled as a binary classification training problem, combination function which requires ranking and learning functions to be implemented representing document attributes and metadata utilizing query relaxation techniques and adjusted ranking functions. Machine learning methods implement user feedback to automatically train ranking functions.01-01-2009
20090157672METHOD AND SYSTEM FOR MEMORY AUGMENTATION - In a method for memory augmentation, a contextual factor is detected for a user, a query is selected for a database based upon the contextual factor, a query result is received with one or more reminders relevant to the user and a reminder has information relevant to at least one of recalling and reinforcing a memory, any number of reminders are selected from the query result based upon an estimation of information beneficial for the user to receive, and sending information from the selected reminders for presentation on a communication channel with at least one of the information for presentation and the communication channel is selected based upon at least one of a situation and a state for the user.06-18-2009
20090157663MODELING QUALITATIVE RELATIONSHIPS IN A CAUSAL GRAPH - The invention relates to a system (06-18-2009
20090157671System And Method For Providing Full-Text Search Integration In XQuery - A system and method for providing full-text search integration in XQuery is presented. A built-in search function defined in an XQuery language is implemented, and a full-text search is initiated. The search function includes one or more search terms and a relation logic. Variants for each search term in the search function are identified. Posting lists are obtained for one or more of the variants. Each posting list includes values offset from elements containing the search term associated with the variant to which the posting list corresponds. The relation logic is applied to the offset values of the posting lists. Those elements with offset values that satisfy the relation logic are selected. The elements that satisfy the relation logic are provided as results of the full-text search.06-18-2009
20090157662Electroencephalography based systems and methods for selecting therapies and predicting outcomes - A method and system for utilizing neurophysiologic information obtained by techniques such as quantitative electroencephalography (QEEG), electrode recordings, MRI in appropriately matching patients with therapeutic entities is disclosed. The present invention enables utilization of neurophysiologic information, notwithstanding its weak correlation with extant diagnostic schemes for mental disorders, for safer and expeditious treatment for mental disorders, discovering new applications for therapeutic entities, improved testing of candidate therapeutic entities, inferring the presence or absence of a desirable response to a treatment, and deducing the mode of action of one or more therapeutic entities. In particular, methods for effectively comparing neurophysiologic information relative to a reference set are disclosed along with database-based tools for deducing therapeutic entity actions on particular patients such that these tools are readily accessible to remote users.06-18-2009
20090157670CONTENTS-RETRIEVING APPARATUS AND METHOD - An image database stores data of variable images as contents, and at least a keyword is attached to each image. A degree of relevancy between every pair of keywords of the images stored in the image database is calculated at constant time intervals, to produce time-sequential data on inter-keyword relevancy of each pair. When a search keyword entered, a basic relevancy is calculated by smoothing the time-sequential data on the inter-keyword relevancy between the search keyword and a keyword that is attached to an image extracted on the basis of the search keyword. If other keywords are attached to the extracted image, a total relevancy of the extracted image is calculated by averaging the basic relevancies of the respective keywords of the extracted image to the search keyword. Among many extracted images, those having higher relevancies to the search keyword are output as a search result.06-18-2009
20090157669SEARCH SUPPORTING APPARATUS AND METHOD UTILIZING EXCLUSION KEYWORDS - Facilitating a user determination of an exclusion keyword in order to specify an efficient exclusion of an unwanted piece of data when the user narrows searching objects. Exclusion is accomplished in a system having a searching object data storage for storing pieces of searching object data, a searcher for performing a primary narrowing of the search, a common keyword extractor for extracting the common keywords associated with a piece of data, an input/output device for passing a selected keyword selected the extracted common keywords while receiving and displaying a result from an exclusion efficiency calculator. The exclusion efficiency calculator calculates exclusion efficiency and indicates a level of exclusion efficiency of data that is not associated with a selected keyword for an individual common keyword.06-18-2009
20090157668METHOD AND SYSTEM FOR MEASURING AN IMPACT OF VARIOUS CATEGORIES OF MEDIA OWNERS ON A CORPORATE BRAND - A method and system for determining influence of various categories of content sources on a selected brand is disclosed. The method defines a brand profile using terms and URLs associated with the selected brand and queries popular search engines using the terms and URLs as search terms. The results are classified according to their category of content sources and a brand ownership score is calculated from the classified results and from other weights associated to the ranks of the results, to the category of content sources and to the search engines. The category of content sources having ownership of the selected brand is then identified from the brand ownership score.06-18-2009
20090157666METHOD FOR IMPROVING SEARCH ENGINE EFFICIENCY - In a method for improving the efficiency of a search engine in accessing, searching and retrieving information in the form of documents stored in document or content repositories, the search engine comprises an array of search nodes hosted on one or more servers. An index of the stored document is created. The search engine processes a user search query and returns a result set of query-matching documents. The index of the search engine is configured on the basis of one or more document properties and partitioned, replicated and distributed over the array of the search nodes. The search queries are processed on the basis of the distributed index. The method realizes a framework for distributing the index of a search engine across several hosts in a computer cluster, relying on three orthogonal mechanisms for index distribution, namely index partitioning, index replication, and assignment of replicas to hosts. In this manner, different ways of configuring the index of a search engine are obtained and provide a much improved resource usage and performance, combined with any desired level of fault tolerance.06-18-2009
20090157665DEVICE AND METHOD FOR AUTOMATICALLY EXECUTING A SEMANTIC SEARCH REQUEST FOR FINDING CHOSEN INFORMATION INTO AN INFORMATION SOURCE - A device (D) is intended to work for at least one communication terminal (T) arranged for searching information in at least one information source (IS) when it is connected to it. This device (D) comprises i) a first means (FM) arranged for storing at least one semantic search request defining chosen information to be searched into the information source(s) (IS) at chosen instants for a chosen terminal user, ii) a second means (SM) arranged for interpreting the stored semantic request at these chosen instants, and iii) a third means (TM) arranged for automatically executing the semantic search request interpreted at the chosen instants, and for warning a user when information corresponding to his executed request has been found.06-18-2009
20090157661DIGITAL CONTENT SEARCHING TOOL - Embodiments of the invention may include a method for searching digital content in a data processing system. The method may include providing a set of sample digital resources. Each sample digital resource may be associated with metadata describing its content, including fields having associated metadata values. A user may select at least one sample digital resource from the set. One or more metadata values of the sample digital resource may be displayed to the user. The user may then select at least a portion of the metadata values. A digital resource having one or more metadata values substantially matching the selected metadata value of the sample digital resource may then be retrieved.06-18-2009
20090157660Methods and systems employing a cohort-linked avatar - Avatars, methods, apparatuses, computer program products, devices and systems are described that carry out obtaining at least one item description; determining an indication of fit between at least one aspect of the item description and at least one cohort-linked avatar; and transmitting the indication of fit to at least one entity.06-18-2009
20090157659SYSTEMS, METHODS, AND COMPUTER PRODUCTS FOR INFORMATION SHARING USING PERSONALIZED INDEX CACHING - Systems, methods, and computer products for information sharing using personalized index caching. Exemplary embodiments include a method including receiving a search query history from a node X in a node A, extracting characteristics of an index of the node A, searching the extracted characteristics, which include a file ID that is included in the index of the node A, adding metadata information to the index of the node A, in response to a determination that the node A includes at least one additional local metaindex, searching the at least one additional metaindex with the search query history from the node X in the node A, and merging search results with the metaindex of the node A, wherein the one additional metaindex merged to the metaindex of the node A includes an acquisition path, and sending the metaindex of the node A to the node X.06-18-2009
20090157657METHOD AND SYSTEM FOR TRANSCODING WEB PAGES BY LIMITING SELECTION THROUGH DIRECTION - Signature schema documents, pre-defined in a query language, provide one or more instructions for application by an engine to transcode web pages of respective web sites. The instructions identify a web page family for the web page and extract a subset of data from the web page using one or more signatures previously identified within web pages of the same web page family (e.g. in accordance with a shared template for each family) of the web site. The instructions may include one or more directional references relative to the signatures to locate and extract the subset of data within the web page. Signatures may comprise text strings within the code of the web page and the directional references indicate positions of respective data relative to the location of the text strings. Transcoding may facilitate use of e-commerce web sites by wireless mobile devices.06-18-2009
20090157654System and method for presenting mixed media - Disclosed is directed to a system and method for presenting mixed media. The system at least comprises location component, time component, and event component. Event component provides specified records or incidents. Location component provides specified places or areas. Time component provides specified time or time intervals. Disclosed embodiments create the multi-dimensional information retrieval, which is useful for user to obtain the relevant location and time while inquiring about a specific interesting event. Disclosed embodiments also present the interaction and relation of multi-dimensional information retrieval.06-18-2009
20090157650OUTBOUND CONTENT FILTERING VIA AUTOMATED INFERENCE DETECTION - One embodiment of the present invention provides a system that facilitates filtering outbound content via inference detection. During operation, the system identifies content sent to a first address and extracts keywords from the identified content. The system then issues queries based on these keywords and extracts expected-content keywords from the hits returned in response to the queries. The system then searches the outbound content for occurrences of the expected-content keywords and produces a result which allows a user to determine whether the outbound content is proper. In a further embodiment, the system extracts keywords from a piece of outbound content, and issues queries based on these keywords. The system then extracts keywords from the hits, and present at least one keyword to a user, thereby allowing the user to determine whether the outbound content is proper.06-18-2009
20090157649Hybrid Method and System for Content-based 3D Model Search - The present disclosure concerns a hybrid content-based 3D model search and retrieval method for queries in generic 3D model datasets. The hybrid nature of the method is two-fold. First, a combination of 2D and 3D features is used as the shape descriptor of a 3D model and second, two alternative alignment techniques, CPCA and NPCA, are employed for rotation normalization. The 2D features are Fourier coefficients extracted from three pairs of depth buffers which are computed for each Cartesian plane capturing the model's thickness along each axis. The 3D features are spherical harmonic coefficients extracted from a spherical function based representation that captures the model's surface as well as volume information.06-18-2009
20090157648Method and Apparatus for Discovering and Classifying Polysemous Word Instances in Web Documents - A method and apparatus for discovering polysemous words and classifying polysemous words found in web documents. All document corpi in any natural language have words that have multiple usage contexts or words that have multiple meanings. Semantic analysis is not feasible for classifying all word occurrences in all documents on the web, which contain trillions of words in total. In addition, semantic analysis typically cannot distinguish multiple usages of a given meaning of a given word. In one embodiment of this invention, polysemous words in natural languages can be discovered by analyzing the co-occurrence of other words with the polysemous word in web documents. In one embodiment, the multiple meanings and usages of a polysemous word can be determined by analyzing the co-occurrences of other words with the polysemous word. In one embodiment, counting overcorrelations is achieved probabilistically to minimize use of network bandwidth.06-18-2009
20090157646MITIGATION OF SEARCH ENGINE HIJACKING - The subject matter disclosed herein relates to mitigation of search engine hijacking.06-18-2009
20090157645RELATING SIMILAR TERMS FOR INFORMATION RETRIEVAL - A resource analyzer selects a resource (e.g., document) from a grouping of resources. The grouping of resources can be any type of social tagging system used for information retrieval. The selected resource has an assigned uncontrolled tag and an assigned controlled tag. The controlled tag is a term derived from a controlled vocabulary of terms. Having selected the resource for analyzing, the resource analyzer identifies a first set of resources in the grouping of resources having also been assigned a same value as the uncontrolled tag as the selected resource. Similarly, the resource analyzer identifies a second set of resources in the grouping of resources having also been assigned a same value as the controlled tag. With this information, the resource analyzer then produces a comparison result indicative of a similarity between the first set of resources and the second set of resources.06-18-2009
20090157644EXTRACTING SIMILAR ENTITIES FROM LISTS / TABLES - Large numbers of lists of entities may be mined for similar entities to related searches. A representation for each list may be determined to provide for a comparison between lists and to support membership checks. A score for an element in a list may be computed that represents the validity of an item in the corpus of lists. Thus, a spurious element would receive a very low score, where a valid element would receive a higher score. A list weight is then computed using the constituent element weights, and the element and list weight are used to compute the nearest neighbors of a given query element.06-18-2009
20090157643SEMI-SUPERVISED PART-OF-SPEECH TAGGING - Relevant search results for a given query may be determined using click data for the query and the number of times the query is issued to a search engine. The number of clicks that a result receives for the given query may provide a feedback mechanism to the search engine on how relevant the result is for the given query. The frequency of a query along with the associated clicks provides the search engine with the effectiveness of the query in producing relevant results. Edges in a graph of queries versus results may be weighted in accordance with the click data and the efficiency to rank the search results provided to a user.06-18-2009
20080301125EVENT PROCESSING QUERY LANGUAGE INCLUDING AN OUTPUT CLAUSE - An event processor can use queries to operate on event streams. Event processing queries can include an output clause to restrict the output of the query.12-04-2008
20080319971Phrase-based personalization of searches in an information retrieval system - An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are identified that predict the presence of other phrases in documents. Documents are the indexed according to their included phrases. Related phrases and phrase extensions are also identified. Phrases in a query are identified and used to retrieve and rank documents. Phrases are also used to cluster documents in the search results, create document descriptions, and eliminate duplicate documents from the search results, and from the index.12-25-2008
20090144254AGGREGATE SCORING OF TAGGED CONTENT ACROSS SOCIAL BOOKMARKING SYSTEMS - Embodiments of the present invention address deficiencies of the art in respect to social bookmarking and provide a method, system and computer program product for aggregating scoring of tagged content across social bookmarking systems. In an embodiment of the invention, a method for aggregating scoring of tagged content across social bookmarking systems can be provided. The method can include combining tag scores for a tag in content across multiple different social bookmarking systems into a single aggregate tag score and applying the single aggregate tag score to the tag in the content. In this regard, combining tag scores for a tag in content across multiple different social bookmarking systems into a single aggregate tag score can include computing either a simple or a weighted average of the tag scores for the tag to produce the single aggregate score.06-04-2009
20090100037SUGGESTIVE MEETING POINTS BASED ON LOCATION OF MULTIPLE USERS - A system, method, and computer readable medium are provided for suggesting meeting locations to multiple users based, at least in part, on their current locations. In one example, a method includes receiving location information associated with at least two users, determining a center location with respect to the received location information, causing a search for a meeting location based on the determined center location, and causing communication of the meeting location(s) to at least one of the users. The method may further include receiving search criteria, where the search includes searching point-of-interest locations based on the center location and filtering the search results based on the search criteria. The method may further include receiving or using additional context information in addition to location information, such as time of day, day of the week, traffic conditions, weather conditions, and the like, to filter or order the search results.04-16-2009
20100030767METHODS AND SYSTEMS THAT PROVIDE UNIFIED BILLS OF MATERIAL - A method for presenting a user with a unified bill of material from a plurality of bills of materials respectively stored in multiple databases is disclosed. The method includes receiving at least one keyword from the user, querying a taxonomy associated with the at least one keyword, utilizing a semantic based ontology model to generate queries for forwarding to the databases, the queries based on the taxonomy and the at least one keyword, receiving from the databases, a listing of the available information stored in the databases that includes the at least one keyword, presenting the listing to the user, receiving from the user, based on the listing, a selection of the information they wish to retrieve from the databases, generating a query requesting retrieval of the user selected information from the databases, receiving the retrieved information from the databases, and providing the retrieved information in an organized format.02-04-2010
20100030762REDUCING LAG TIME WHEN SEARCHING A REPOSITORY USING A KEYWORD SEARCH - Embodiments of the invention provide systems and methods for searching a repository of information such as a database using a keyword search and/or an attribute search in near real time. According to one embodiment, a method of searching a repository of information can comprise receiving a set of search criteria for performing the search and selectively performing one or more of an attribute search and a keyword search of the information in the repository based on the received search criteria.02-04-2010
20090125512SCHEMA MAPPER - Embodiments of the present invention provide the ability to effectively visualize the mapping between two schemas, referred to herein as a source schema (or first schema) and a destination schema (or second schema), regardless of the size or complexity of the schemas and mappings. According to one aspect of the present invention a method for visually representing a mapping between a first schema and a second schema is provided. The method includes receiving a selection of an object, emphasizing the selected object and identifying a plurality of objects that are relevant to the selected object. The objects that are identified as being relevant to the selected object are also emphasized.05-14-2009
20090125511PAGE RANKING SYSTEM EMPLOYING USER SHARING DATA - Standard web content search result relevance and ranking is improved by considering certain social reference data, such as the number of times an item of content is shared, normalized for the number of times it is viewed. A system and method for improving the relevance and ranking includes a system and method for tracking the social references and a system and method for operating on search engine results to either re-order the results based on social reference data, re-order the search results based on a combination of the social reference data and the web search engine's ordering, and/or display the social reference data either with the search results reordered or in the order provided by the web search engine. Many different forms of data constitute social reference, including sharing content or a link thereto by email, SMS, posting to a link-sharing site, blog, and bookmarking in a web browser.05-14-2009
20090125510DYNAMIC PRESENTATION OF TARGETED INFORMATION IN A MIXED MEDIA REALITY RECOGNITION SYSTEM - A context-aware targeted information delivery system comprises a mobile device, an MMR matching unit, a plurality of databases for user profiles, user context and advertising information, a plurality of comparison engines and a plurality of weight adjusters. The mobile device is coupled to deliver an image patch to the MMR matching unit which in turn performs recognition to produce recognize text. The recognized text is provided to a first and second comparison engines to produce relevant topics and relevant ads. The relevant topics and relevant ads are adjusted with information from a user context database including information such as location, date, time, and other information from a user profile. The weight adjusted relevant topics and relevant ads to a third comparison engine. The third comparison engine compares the weighted relevant topics and relevant ads to produce a set of final ads that are most related to the topics of interest for the user and delivered for display on to the mobile device.05-14-2009
20090125509DOCUMENT RECOGNIZING APPARATUS AND METHOD - A document recognizing apparatus includes a display control unit which displays a document data including a character string related to a character string selected by a user, and an area that includes at least a character string of the document data.05-14-2009
20090125508SYSTEMS AND METHODS FOR FILE TRANSFER TO A PERVASIVE COMPUTING SYSTEM - Embodiments of the invention described herein provides a system and method for transferring files in a pervasive computing system. The methods comprises the steps of selecting at least a sub-set of files to be made available to a remote computing system and determining a relevance score for each of the sub-set of files to a remote computing system. Information regarding the relevance of each of the files in the sub-set of files is made available with the file.05-14-2009
20090125507COMPUTER AIDED DESIGN SYSTEM - A computer aided design system is provided that includes a display, an input unit for inputting a circuit search-range narrowing condition, and a processing unit for, when a circuit topology of a circuit to be designed is changed, finding recommended circuits by searching a database, which stores part data and circuit data, based on the circuit search-range narrowing condition, and displaying a list of the recommended circuits on the display.05-14-2009
20090125506Method of Managing Messages In Archiving System For E-Discovery - Provided is a method for managing messages in an archiving system for E-Discovery. The method includes capturing a message by classifying the message using at least one of a port number, a packet content and a packet pattern at the time of messaging a message transmitted by all communication devices officially recognized in a company, storing the message at an on-line storage through an indexing and a compression after removing a duplicate content of the message for a large capacity retrieval, and backing up the data at a unalterable permanent recording media in accordance with a priority selectively designated according to the attribute.05-14-2009
20090125505INFORMATION RETRIEVAL USING CATEGORY AS A CONSIDERATION - Category affinity may be used as a consideration in providing search results. A taxonomy of substantive categories is created and/or obtained. A corpus of document is compared with the taxonomy to determine the category(ies) with which the documents affine. A query is also compared with the taxonomy to determine the category(ies) with which the query affines. A document may receive a category score based on how well the document's category(ies) match the query's category(ies). This document score may be combined with other scores, such as a text score, a link score, and a distance score, and/or any other factors, to determine an overall relevance score. The relevance score may then be used to rank and present search results.05-14-2009
20090125504Systems and methods for visualizing web page query results - Systems and methods for providing search results responsive to a search query are provided. A submitted search query is received from a search requester. Search results relevant to the search query are obtained from a document index. Each search result comprises (i) a source document or a reference to a source document and (ii) a static graphic representation of the source document. The static graphic representation of the source document was obtained from the source document at a time before the submitted search query was received. The static graphic representation of the source document of a first search result in the search results is displayed in a center position of a graphic output device. The static graphic representation of the source document of a second search result in the search results is displayed in an off-center position of the graphic output device. The static graphic representation of the source document of the second search result is displayed rotated about an axis of rotation that lies between the center position and the off-center position.05-14-2009
20090125502SYSTEM AND METHODS FOR GENERATING DIVERSIFIED VERTICAL SEARCH LISTINGS - A method of generating a diversified vertical search results listing, including listing attribute values related to search criteria and their frequency of occurrence to create a plurality of listings; creating a plurality of interval bands based on the plurality of listings; generating a random diversity score for each listing over a substantially uniform distribution within each of the plurality of bands; and sorting a set of search results for diversified listing in response to a user searching for the search criteria according to the diversity score of each listing.05-14-2009
20090125501RANKER SELECTION FOR STATISTICAL NATURAL LANGUAGE PROCESSING - Systems and methods for selecting a ranker for statistical natural language processing are provided. One disclosed system includes a computer program configured to be executed on a computing device, the computer program comprising a data store including reference performance data for a plurality of candidate rankers, the reference performance data being calculated based on a processing of test data by each of the plurality of candidate rankers. The system may further include a ranker selector configured to receive a statistical natural language processing task and a performance target, and determine a selected ranker from the plurality of candidate rankers based on the statistical natural language processing task, the performance target, and the reference performance data.05-14-2009
20090125500OPTIMIZATION OF ABSTRACT RULE PROCESSING - Embodiments of the invention provide techniques for optimizing the processing of abstract rules. In general, the results of executing an abstract query may be used as data inputs for processing an abstract rule. In one embodiment, query results may be sorted according to input field values required for processing a deterministic abstract rule. If a record of the sorted query results includes the same input values as a preceding record, then the rule output of the preceding record may be reused, rather than processing the abstract rule again. Accordingly, the demand load placed on a rule engine may be reduced.05-14-2009
20090125499MACHINE-MODERATED MOBILE SOCIAL NETWORKING FOR MANAGING QUERIES - Systems and methods of a machine-moderated mobile social networking for managing queries are disclosed here. In one aspect, embodiments of the present disclosure include a method, which may be implemented on a system, of receiving queries from a mobile device and intelligently distributing the queries among users that are deemed suitable to provide useful insight to the queries. The queries are typically questions asked by potential patrons regarding specific venues, patrons looking for specific businesses and/or events that fit their specific criteria, by way of example but not limitation, geography, locale, type of cuisine, ambience, music, etc. In most instances, a consumer can send the query from a portable device (e.g., cell phone, Blackberry, telephone, iPhone, Treo, etc.) in various formats (e.g., SMS text, voice call, USSD message, IM, and/or email, etc.) to a predetermined phone number and/or other types of address identifiers.05-14-2009
20090125498Doubly Ranked Information Retrieval and Area Search - In a search system, document terms are weighted as a function of prevalence in a data set, the documents are scored as a function of prevalence and weight of the document terms contained therein, and then independently, the documents are ranked for a given search as a function of (a) their corresponding document scores and (b) the closeness of the search terms and the document terms. The steps can all be accomplished using matrices. Subsets of the documents can be identified with various collections, and each of the collections can be assigned a matrix signature. The signatures can then be compared against terms in the search query to determine which of the subsets would be most useful for a given search.05-14-2009
20090307204ADAPTIVE APPLICATION OF SAT SOLVING TECHNIQUES - A computer-implemented method for solving a satisfiability (SAT) problem includes defining a formula, including variables, which refers to properties of a target system. Using a chosen search strategy, a search process is performed over possible value assignments of the variables for a satisfying assignment that satisfies the formula. A performance metric estimating an effectiveness of the search process is periodically evaluated during the search process. The strategy of the search process is modified responsively to the evaluated performance metric. The method determines, using the search process, whether the formula is satisfiable on the target system.12-10-2009
20090307203METHOD OF LOCATING CONTENT FOR LANGUAGE LEARNING - A method is disclosed to gather content for use in language learning, the method comprising combining a user entered search phrase with a user model of derived by a language learning program, which user model identifies subject matter that should be learned or practiced. The delivered content then represents topics of interest to a user, and also is optimized to work with the language learning program to teach the target language.12-10-2009
20090307202Method for Automatically Indexing Documents - A method for retrieving based on a search term together with its corresponding meaning from a set of base documents those documents which contain the search term and in which the certain search term has the certain meaning to enable the building of an index on the retrieved documents. The method includes searching for those base documents among the set of base documents which contain the certain search term and evaluating the found base documents as to whether the search term contained in the found base documents, respectively, has a certain meaning. Evaluation includes generating a text document to represent elements surrounding the search term and their corresponding absolute or relative position with respect to the search term; inputting the text document into a trainable classifying apparatus; classifying the inputted text document to judge whether the search term has the inputted meaning.12-10-2009
20090043764AUGMENTING A TRAINING SET FOR DOCUMENT CATEGORIZATION - A method and system for augmenting a training set used to train a classifier of documents is provided. The augmentation system augments a training set with training data derived from features of documents based on a document hierarchy. The training data of the initial training set may be derived from the root documents of the hierarchies of documents. The augmentation system generates additional training data that includes an aggregate feature that represents the overall characteristics of a hierarchy of documents, rather than just the root document. After the training data is generated, the augmentation system augments the initial training set with the newly generated training data.02-12-2009
20090043763System of fast launching network link service and method thereof - A system of fast launching network link service comprising: a search engine server, and a client device. The search engine server, through a receiving unit, receives a key-data produced by the client device, and through an analysis unit of a search unit analyzes the form of key-data, and then searches a database to output at least one network link service. The client device, through a link menu interface, produces a link candidate corresponding to the network link service. Furthermore, the client device can open the network link service represented by the link candidate. Hence, the present invention can achieve the purpose of simplifying the steps of operating search engine.02-12-2009
20090043762INFORMATION RETRIEVAL SYSTEM AND METHOD - An information retrieval method, process, and apparatus are provided which includes iterative or parametric data set querying. The result of each query iteration is displayed in an easy to analyze fashion, enabling the user to interactively refine the query with additional iterations. Each field of data in a data set is represented by a filter in a filter tree table. A user may graphically select and de-select filters using the filter tree table. The selections are converted into a filtering query that is run against the data set to produce filtered data. A summary query is then run against the results of the filtering query. The filtered data is displayed, along with the selected filters of the filter tree table. The filter tree table may also include and display other information related to each filter, such as an associated data item count as generated by the summary query. Further user input is accepted, with the user input further selecting or de-selecting data groupings to be displayed. The user input is fed back to generate another filtering iteration. In this manner, when the user makes a single selection or de-selection, all applicable filters are changed, and the user changes are propagated through all appropriate filters.02-12-2009
20090043757METHOD AND SYSTEM FOR CREATING A REPRESENTATION OF A WEB PAGE USING KEYWORDS OR SEARCH PHRASES - The invention provides a method of providing information over a network, comprising preparing a representation of a web page, wherein A computer program may compile the representation from a plurality of information sources that may be included in the representation, the information sources including a plurality of keywords or search phrases, and providing the representation to a search engine.02-12-2009
20090043756COMPUTER PROGRAM, SYSTEM AND METHOD FOR CREATING REPRESENTATIONS OF WEB PAGES AND TRANSMITTING CRAWLER LINKS FOR CRAWLING THE REPRESENTATIONS - The invention provides a method of providing information over a network, comprising utilizing a computer program to create a representation of a web page, utilizing the computer program to store the representation at a representation location, and utilizing the computer program to transmit a crawling link to the search engine, the crawling link being utilized by a crawler to access and copy the representation from the representation location to the search database, to provide the representation to a search engine.02-12-2009
20090043755SYSTEMS AND METHODS FOR DYNAMIC PAGE CREATION - A computer-implemented method of dynamically creating a page module for a word on a display screen is provided. The page module is creating by determining a word provided for defining the page module, searching for text directly associated with the word and text contextually associated with the word, and, searching for media directly associated with the word and contextually associated with the word. The type of page module layout for the word is then identified. The page module layout includes placeholders for displaying at least some of the text that is directly or contextually associated with the word and at least some of the media that is directly or contextually associated with the word in the page module. The page module is then displayed by drawing the page module layout on the display screen and populating the page module layout with at least some of the media that is directly or contextually associated with the word.02-12-2009
20090043753METHOD FOR GENERATING STRUCTURED QUERY RESULTS USING LEXICAL CLUSTERING - The present invention provides for the generation of structured query results using lexical clustering which includes collecting one or more search queries and data associated with the one or more search queries. The present invention further includes preprocessing the one or more queries into a canonicalized form of each of the one or more queries. The canonicalized form of each of the one or more queries may be accomplished using stemming, punctuation, pluralization, word order or other canonicalization rules. The present invention further includes building a lexical index of the one or more search queries and data associated with the one or more search queries and mining the lexical index of the one or more search queries and data associated with the one or more search queries in order to generate a structured query result set.02-12-2009
20090043748ESTIMATING THE DATE RELEVANCE OF A QUERY FROM QUERY LOGS - Techniques are provided maintaining data that indicates for a plurality of query terms whether the plurality of query terms are date-qualified query terms. A query is received, and in response to receiving the query, the query is inspected to determine that the query contains a particular date-qualified query term. Then it is determined that the particular date-qualified query term has been associated with a plurality of dates, and it is determined which of the plurality of dates with which to associate the date-qualified query term for the query, based at least in part on the frequency with which each particular date of the plurality of dates has been associated with the particular date-qualified query term.02-12-2009
20090043752Predicting Side Effect Attributes - A method and system for predicting attribute side effects of predisposition modification are presented in which a set of attributes for predisposition modification of an individual is used to modify the individual's attribute profile to generate a modified attribute profile. A set of attribute combinations is compared against the modified attribute profile in order to identify those attribute combinations from the set that also occur in the modified attribute profile. Candidate side effect attributes that are statistically associated with the identified attribute combinations, but not present in the modified attribute profile, are stored as predicted side effect attributes that would result from the intended predisposition modification of the individual.02-12-2009
20090043749EXTRACTING QUERY INTENT FROM QUERY LOGS - Techniques are provided for storing queries received by a search engine are in a query log. For a particular query term in the query, it is determined how many queries in the query log contain that particular query term and an intent-indicating term, and determined how many queries in the query log contain that particular query term without an intent-indicating term. Based on the ratio between the number of queries in the query log that contain the particular query term and the intent-indicating term and the number of queries in the query log that contain the particular query term without the intent-indicating term, it is determined whether the particular query term is an intent-qualified query term. In response to determining that the particular query term is an intent-qualified query term, data is stored in a computer-readable medium that identifies the query term as an intent-qualified query term. Implicit-intent queries that contain the intent-qualified query term are processed based, at least in part, on the intent associated with the intent-qualified query term.02-12-2009
20090043759Display and search interface for product database - A technique for displaying and searching databases provides a user interface that displays a list of attribute values of a product along with corresponding user interface elements, each containing a set of clickable sub-elements corresponding to subsets of possible attribute values with different ranks. The sub-element whose corresponding rank matches a corresponding rank of the attribute value is displayed as highlighted. Clicking a sub-element constrains a current selected set of products to those whose attribute values have the same rank as the clicked sub-element. On mouse-over of a selected sub-element, pop-up text is displayed containing a set of possible attribute values whose rank corresponds to that of the selected sub-element, and decision support information associated with each of the displayed possible attribute values, e.g., a percentage of users who have selected the attribute value, a percentage of users who have purchased a product with the attribute value, or a price range of product records having the attribute value.02-12-2009
20100005082SINGLE-TAP INPUT REMOTE SERVER ACCESS - A method for searching a remote server using a portable device including an ambiguity keyboard, the ambiguity keyboard having a plurality of keys where at least one key corresponds to more than one symbol, includes: according to a single-tap input of at least one key of the ambiguity keyboard, generating an ambiguity string including at least a symbol corresponding to the key; sending the ambiguity string to the remote server; utilizing a database in the remote server to match the ambiguity string to an existing keyword in the database; and sending the existing keyword back to the portable device.01-07-2010
20090327276ORGANISING AND STORING DOCUMENTS - A data handling device has access to a store of existing metadata pertaining to existing documents having associated metadata terms. It analyses the metadata to generate statistical data as to the co-occurrence of pairs of terms in the metadata of one and the same document. When a fresh document is received, it is analysed to assign to it a set of terms and determine for each a measure of their strength of association with the document. Then, for each term of the set, a score is generated that is a monotonically increasing function of (a) the strength of association with the document and of (b) the relative frequency of co-occurrence of that term and another term that occurs in the set; metadata for the fresh document are then selected as the subset of the terms in the set having the highest scores.12-31-2009
20090319516Contextual Advertising Using Video Metadata and Chat Analysis - A method of delivering advertising content over the internet to a selected client, the selected client being one of a plurality of clients causing display of media content synchronously. The method includes receiving chat text from at least one of the plurality of clients, generating a set of keywords using the chat text, receiving advertising content selected on the basis of the set of keywords from an advertising system, and delivering the advertising content to at least the selected client over the internet.12-24-2009
20090319512AGGREGATOR, FILTER, AND DELIVERY SYSTEM FOR ONLINE CONTENT - A computing device that filters online content to be delivered to an individual includes a processor, and a computer readable storage medium storing instructions. When the instructions are executed, the computing device is caused to: receive content from a plurality of sources, and to index the content; receive input from the individual to indicate a relevance of certain of the content; filter the content based on the input from the individual and identify relevant content for the individual; and deliver the relevant content to the individual.12-24-2009
20090313235SOCIAL NETWORKS SERVICE - A social network service provides trusted, timely and managed communications between a querying individual and an informed individual by optimizing distribution of queries to reflect a requisite amount of expertise necessary (i.e., interest, background, education, demographic attribute, etc.). Those candidate recipients with a rare level of expertise or specialization can specify a desired level of participation, which is respected. In order not to exhaust their availability, those who are less qualified or part of a larger demographic category appropriate for the query are selected to handle queries of lesser difficulty or less specialization. Anonymity if desired by the recipient party can be supported by increasing the pool of candidate recipients so that the querying party cannot reasonably ascertain who is responding. Timeliness of response, as well as satisfaction in the response, is tracked in order to affect redirection of a query.12-17-2009
20090307217Method, Device and System for Processing, Browsing and Searching an Electronic Documents - A method for processing electronic document and its corresponding device, a method for browsing electronic document and its corresponding browser, as well as a method for searching electronic document and its corresponding searching system are disclosed in the present invention. The method comprises at least the following steps of: generating one or more query according to the content of said document when an author is composing the electronic document; and correspondingly storing information about said one or more query with said electronic document. Wherein the query comprises keywords, keyword string or questions, and the query has passed the verification in order to ensure its reliability.12-10-2009
20090307216Systems and Methods for User-Constructed Hierarchical Interest Profiles and Information Retrieval Using Same - Systems and methods for delivering Web content are provided. The systems and methods include a mechanism for providing interest data that may be applied to filter Web content at the provider side. A hierarchical data set of user-identified interests in received from the user's Web client. The hierarchical data set is parsed, and responsive thereto, one or more keyword attribute values are extracted from the hierarchical data set. The extracted keyword values are applied to filter content for delivery to a requesting Web client.12-10-2009
20090307215NETWORK RESOURCE ANNOTATION AND SEARCH SYSTEM - A method and system for annotation of network resources existing within an electronic network. Further provided for is a method for increasing, or decreasing the relevance of network resources forming part of a search result of a network through use of annotations associated with network resources.12-10-2009
20090307214COMPUTER SYSTEM FOR PERFORMING AGGREGATION OF TREE-STRUCTURED DATA, AND METHOD AND COMPUTER PROGRAM PRODUCT THEREFOR - A computer system, methods, and programs for creating an index for aggregating data in at least one tree structure including at least one node each including one label indicating node type and values. The system includes a node ID assignment processing unit for assigning IDs to the nodes in a post order; first, second, and third index creation processing units. The first unit creates a first index having one or more sets of data including the node ID and values included in the node; the second unit creates a second index having one or more sets of data including node ID and ID of a descendant node having the minimum ID; and the third unit creates a third index having one or more sets of data including IDs of one or more nodes having specific values.12-10-2009
20090307213Suffix Tree Similarity Measure for Document Clustering - The subject innovation provides for systems and methods to facilitate weighted suffix tree clustering. Conventional suffix tree cluster models can be augmented by incorporating quality measures to facilitate improved performance. Further the quality measure can be employed in determining cluster labels that show improvements in accuracy over conventional means. Additionally “stopnodes” can be defined to facilitate traversing suffix tree models efficiently. Quality measurements can be determined based in part on weighting factors applied to terms in a vector model, said terms being mapped from a suffix tree model.12-10-2009
20090307212SYSTEM AND METHOD FOR EVENT MANAGEMENT - A cooperative scheduling system for cooperative scheduling between large numbers of independent users, the users being divided into a several interest groups, comprises: a networked server, a scheduling database associated with the networked server for storing scheduling data, the scheduling database allowing categorization of the data for the interest groups; a multi-user input interface for allowing multiple remotely located users to enter scheduling data to the scheduling database, the data being categorized for the interest groups; and a multi-user output interface for allowing multiple remotely located users to retrieve scheduling data from the scheduling database, the output interface including a configuration for filtering of the retrieval according to category. Thus scheduling data is stored at a central location in a cooperative effort and is retrieved according to the level of relevance to the user.12-10-2009
20090307211INCREMENTAL CRAWLING OF MULTIPLE CONTENT PROVIDERS USING AGGREGATION - A method for incremental crawling of content stored on a plurality of content providers using aggregation is provided. The method comprises receiving a request to crawl content on one or more associated content providers; retrieving one or more first references to content on a first content provider; retrieving one or more second references to content on one or more second content providers during the same request; aggregating the first and second references; and returning the aggregated first and second references. This is done while taking into consideration opaque timestamp object which is managed in a distributed manner. The opaque timestamp is filled in by the content providers but stored in the crawler side between crawling sessions.12-10-2009
20090307210Text Mining Device, Text Mining Method, and Text Mining Program - Provided is a text mining device capable of showing a user whether the characteristics extracted by a text mining are either common to all texts independently of the citations, in case the text to be mined is configured with texts of a plurality of kinds of different citations, or deviated toward a text of a predetermined citation. The text mining device includes a citation information creating device for creating the citation information of texts containing characteristics extracted from a text set collected from a plurality of citations, and a mining result output device for outputting the characteristics and the citation information in a corresponding manner.12-10-2009
20090307209TERM-STATISTICS MODIFICATION FOR CATEGORY-BASED SEARCH - An apparatus for searching a document collection is provided. The apparatus includes a memory, which is arranged to store a plurality of documents that are respectively associated with one or more categories and contain terms, a search processor, which is arranged to provide an index of the terms indicating the documents in which the terms appear, to estimate a first statistical distribution of each of at least some of the terms in the index over the documents in the collection, to estimate a second statistical distribution of each of at least some of the categories over the documents in the collection, to accept a query comprising one or more of the terms and a specified category restriction referring to at least one of the categories, to compute a local term distribution, which is indicative of occurrence frequencies of at least one of the terms in the query within the specified category restriction, using the first and second estimated statistical distributions to determine a category-specific score for the at least one of the terms responsively to the local term distribution within the specified category restriction, and to apply the query to the index using the category-specific score so as to return a response, wherein the processor is arranged to construct term histograms of the at least some of the terms in the index, to construct category histograms of the at least some of the categories, and to map the documents in the collection to bins of the histograms, so as to estimate the first and second statistical distributions, and wherein the processor is arranged to determine a category restriction histogram based on the category histogram of the at least one of the categories responsively to the category restriction, and to multiply the category restriction histogram by the term histogram of the at least one of the terms in the query so as to produce a localized term histogram.12-10-2009
20090307208 SEMANTIC ENHANCED LINK-BASED RANKING (SEL RANK) METHODOLOGY FOR PRIORITIZING CUSTOMER REQUESTS - One exemplary method embodiment, pre-processes customer requests that are maintained in a dataset to create a matrix between products and the customer requests. Each of the customer requests comprises at least a customer identification, a textual request, and a product identification related to the textual request. After such pre-processing of the dataset, the method can respond to queries of the dataset using the matrix.12-10-2009
20090307206SELF VERIFYING ADDRESS UPDATE PROCESS AND SYSTEM - Address information is analyzed and ranked to provide a relative indication of the reliability of a particular address. The system and method utilized, perform modeling of address data, which produces a model that can be applied to a particular address. A resulting score is generated for each address discovered relating to a particular individual or entity. Using these scores, multiple addresses can then be ranked, to determine which address is most likely to be accurate and reliable.12-10-2009
20090307205FRIENDLY SEARCH AND SOCIALLY AUGMENTED SEARCH QUERY ASSISTANCE LAYER - Community search query technology operable to provide users with the means to collaborate on search queries and share their query results with other users in a community is disclosed. The community search query technology provides a collaborative search engine that utilizes community feedback and personal profiles. The community search query technology also comprises personal task, information management, project creation, listing queries by activity categories, setting deadlines for ongoing search needs, setting up search queues, and annotation of search sessions.12-10-2009
20090112855METHOD FOR ORDERING A SEARCH RESULT AND AN ORDERING APPARATUS - A method for ordering a search result, wherein the search result comprises a plurality of data unit candidates from hierarchical data which further comprises a data unit corresponding to a search requester, the method comprising: calculating a relative distance between the data unit of the search requester and each of the plurality of data unit candidates according to the hierarchical data, the relative distance representing relativity between the data units in the hierarchical data; and ordering the plurality of data unit candidates according to the relative distances. The present invention can order the data unit candidates according to the relativity between the data unit candidates and the data unit of the search requester in the hierarchical data so that the search requester can quickly determine the most concerned data unit candidate. The present invention also provides a corresponding ordering apparatus as well as a searching method based on hierarchical data and a searching engine.04-30-2009
20090112838ONTOLOGY-BASED NETWORK SEARCH ENGINE - A method and apparatus for searching for a documents residing on a network comprises receiving a search request from a user. The search request comprises one or more search terms of an ontology. The ontology includes a plurality of terms. One or more of the plurality of terms includes a plurality of sub-category terms. One or more documents residing on the network is identified based on the one or more search terms and an ontology index. The ontology index comprises a plurality of relationships between the plurality of terms and sub-category terms of the ontology and a plurality of documents residing on the network. One or more search results that describe the one or more documents is presented to the user. The one or more documents contain the one or more search terms, or one of the plurality of sub-category terms of the one or more search terms.04-30-2009
20090106233QUERY ENGINE INTERPRETER AND PRIORITIZATION ENGINE - A method for refining a search query includes receiving a query from a user, submitting at least one question to the user based on information provided in the query, and receiving an answer to the question from the user. The method also includes refining the query based on the answer received from the user, querying the database using the refined query to identify a subset of records tagged with categories relevant to the query, and delivering search results to the user.04-23-2009
20090094234IMPLEMENTING AN EXPANDED SEARCH AND PROVIDING EXPANDED SEARCH RESULTS - Implementing an expanded search and providing expanded search results comprises receiving a search query generated by a user. A type of expansion to apply to the search query is determined. Expanded search queries are automatically generated according to the determined expansion type without intervention from the user. A search is executed on each one of the expanded search queries to retrieve search results, and the search results are provided for presentation to the user in modules. A module comprises search results for one of the expanded search queries.04-09-2009
20090094231Selecting Tags For A Document By Analyzing Paragraphs Of The Document - In one embodiment, assigning tags to a document includes accessing the document, where the document comprises text units that include words. The following is performed for each text unit: a subset of words of a text unit is selected as candidate tags, relatedness is established among the candidate tags, and certain candidate tags are selected according to the established relatedness to yield a candidate tag set for the text unit. Relatedness between the candidate tags of each candidate tag set and the candidate tags of other candidate tag sets is determined. At least one candidate tag is assigned to the document according to the determined relatedness.04-09-2009
20090094224COLLABORATIVE SEARCH RESULTS - Methods, systems, and apparatus, including computer program products, for providing alternative search results for a query. In one aspect, a method includes transmitting a set of one or more search results for a query to a client device for presentation to a user, where each search result refers to a respective resource, receiving from the client device an alternative search result submitted by the user for the query, associating the alternative search result with the query, and storing in a repository the query and the alternative search result, where the alternative search result is transmitted with the set of one or more search results for a new search of the query.04-09-2009
20090094222METHOD AND SYSTEM FOR MULTIFACETED SCANNING - A method and system for multifaceted scanning, the method having the steps of receiving a data source; processing the data source for a plurality of scanning aspects, the processing step utilizing rules and policies for the plurality of scanning aspects to provide transformed, modified or adapted content; and outputting the transformed, modified or adapted content.04-09-2009
20090094218Method and system for improving performance of counting hits in a search - One embodiment of the present invention includes a method for automatically enabling a search system or application to quickly and accurately count hits corresponding to a search expression. For example, a search expression is received or retrieved that may include redundant and/or overlapping search expression components. Each narrow search expression component is removed from the search expression if joined by an “OR” operator to a broader or equivalent search expression component. Additionally, each broad search expression component is removed from the search expression if joined by an “AND” operator to a narrower or equivalent search expression component. By modifying the received search expression in this fashion, a performance gain is typically achieved for calculating the hit count while maintaining its accuracy.04-09-2009
20080270373Method and Apparatus for Content Item Signature Matching - An apparatus for content item signature matching comprises a database (10-30-2008
20090300002Proactive Information Security Management - A method and apparatus for proactive information security management is described. In one embodiment, for example, a computer-implemented method for controlling access to sensitive information, the method comprising: maintaining access constraint data that can be used to control access to the sensitive information, wherein the access constraint data includes match pattern data and apply pattern data; receiving a semantic query from a querier requesting access to the sensitive information; based on the match pattern data, determining whether the semantic query should be constrained according to the apply pattern data; where said semantic query should be constrained according to the apply pattern data, rewriting the semantic query according to the apply pattern data to produce a rewritten query; executing the rewritten query against a database that contains the sensitive information; and returning any results of executing the rewritten query.12-03-2009
20090300000Method and System For Improved Search Relevance In Business Intelligence systems through Networked Ranking - Method and system for optimizing search results in a business intelligence system. An member is selected in the business intelligence system having a user space, a content space, a data space, a master-data space and a metadata space. A relationship is determined between the member and a plurality of objects in the user space, the content space, the data space, the master-data space, or the metadata space. A ranking of the member is calculated based on the relationship. A relevance of the member in the business intelligence system is calculated using the ranking, thereby optimizing search results of the business intelligence system using the relevance of the object.12-03-2009
20090287699METHOD, DEVICE AND SYSTEM FOR QUALITY CHECK - An embodiment of the present invention discloses a quality check (QC) method, including: determining a QC object to be checked and its QC content; searching a system where QC data needed for the QC is located, according to the determined QC object and its QC content, and obtaining the corresponding QC data from the system; and computing QC result according to the obtained QC data. With the QC method, it may perform a uniform hierarchical QC on service representatives in different systems and QC results can be stored uniformly in the same database, which facilitates query, statistics and analysis of the QC results. An embodiment of the present invention also discloses a QC system including a database and QC device. The QC device includes a user input unit and a master control unit.11-19-2009
20090287695SYSTEMS AND METHODS FOR BIDIRECTIONAL MATCHING - Described herein are systems and methods for bidirectional matching. In overview, various embodiments provide software, hardware and methodologies underlying a bidirectional matching approach that implements a multi-level importance weighting procedure. Generally speaking, potential relationships between parties are scored on the basis of criterion matches. In some embodiments, a value is assigned to each criterion match based on a function of predefined factor, which is optionally experientially defined, and a further factor, which is defined based on individual preferences.11-19-2009
20090287686PLAYBACK DEVICE - A playback device includes a communication component, an operation component and a playback control component. The communication component is configured to communicate with a network device via a network. The operation component is configured to select a random playback of a plurality of content items that is stored in the network device. The playback control component is configured to control the random playback of the content items. The playback control component acquires only numerical information of the content items from the network device when the operation component selects the random playback of the content items with the numerical information indicating number of the content items. The playback control component randomly determines one of the content items based on the numerical information. The playback control component acquires the one of the content items from the network device to play the one of the content items.11-19-2009
20090271401System for software source code comparison - A system for analyzing similarities between a first and second corpus or between a set of concepts and a corpus uses natural language processing and machine intelligence methods to replace terms or phrases in the corpus with concepts, determine the frequency of each concept in the corpus, and convert the corpus into a concept frequency file to enable easy comparison of the two corpuses or easy retrieval of items from the corpus that contain concept. Difference analysis and a combination of content and spectral analysis may be employed.10-29-2009
20090271400Point of Interest Search Device and Point of Interest Search Method - A point of interest (POI) search device includes: a static POI data storage means for storing therein a static POI data registered in advance; an added POI data storage means for storing therein an added POI data added or changed; a deleted POI data storage means for storing therein a deleted POI data for identifying a POI data which is not subjected to a search anymore, among the static POI data; a search keyword setting means; a computation switch means for computing the number of POI data matching the search keyword from among the static POI data and switching a computation processing of computing the number of POI data according to the computed number of POI data; and a POI search means for computing the number of POI data, using at least the static POI data and the added POI data according to the switched computation processing.10-29-2009
20090271399METHOD AND SYSTEM FOR SEARCHING CONTENT AT A PRIMARY SERVICE PROVIDER THROUGH A PARTNER SERVICE PROVIDER - A method and system for generating a search includes a user device, a partner service provider in communication with the user network device, and a primary service provider in communication with the partner service provider. The user device generates a search request for search data at the user device and communicates the search request to a partner service provider. The partner service provider communicates the search request to a primary service provider. The primary service provider generates search results data and communicates search results data to the user device. The user device displays the search results on a display device associated with the user device.10-29-2009
20090271398METHOD AND SYSTEM FOR RECOGNITION OF VIDEO CONTENT - A method and system is provided for recognizing video content represented by temporally segmented video content. An example system includes a communication module and a search and match module. The communications module may be configured to receive a source table of contents (TOC) related to a temporally segmented video content. The source TOC may include one or more titles and a source playback length. The search and match module may be configured to interrogate a video products database with the source TOC to determine one or more match results, utilizing a fuzzy matching technique.10-29-2009
20090271397STATISTICAL RECORD LINKAGE CALIBRATION AT THE FIELD AND FIELD VALUE LEVELS WITHOUT THE NEED FOR HUMAN INTERACTION - Disclosed is a system for, and method of, calculating parameters used to determine whether records and entity representations should be linked. The system and method apply iterative techniques such that parameters from each linking iteration are used in the next linking iteration. The system and method need no human interaction in order to calibrate and utilize record matching formulas used for the linking decisions.10-29-2009
20090271394DETERMINING THE DEGREE OF RELEVANCE OF ENTITIES AND IDENTITIES IN AN ENTITY RESOLUTION SYSTEM THAT MAINTAINS ALERT RELEVANCE - An entity resolution system and alert analysis system configured to process inbound identity records and to generate alerts based on relevant identities, entities, conditions, activities, or events is disclosed. One process of resolving identity records and detecting relationships between entities may be performed using a pre-determined or configurable entity resolution rules. Further, the entity resolution system may include an alert analysis system configured to allow analysts to review and analyze alerts, entities, and identities, as well as provide comments or assign a disposition to alerts generated by the entity resolution system. Furthermore, the entity resolution system may be configured to handle duplicate alerts, i.e., one or more identical or near-identical alerts generated using the same entities and/or identities as well as assign a relevance score to the particular entities and identities included in the alert.10-29-2009
20090271393System and Method for Utilizing Organization-Level Technology Demand Information - A plurality of subtechnologies may be identified in which each of the plurality of subtechologies is characterized by a common granularity level. The organization-level demand and/or expertise for each of the identified plurality of subtechnologies may also be identified. Thereafter, a corresponding plurality of subtechnology profiles may be generated, which include a corresponding organization-level demand and/or expertise, as well as other subtechnology attributes. In one embodiment, the generated subtechnology profiles may then be stored in a common technology database. The technology database may be searched based on user queries entered via the common graphical user interface. The subtechnology search results may be ranked based, at least in part, on a quantitative comparison of the subtechnology's general relevance to the user, with the internal relevance to the user's company or organization.10-29-2009
20090271392System and Method for Utilizing Technology Interconnectivities - A plurality of subtechnologies is identified in which each of the plurality of subtechologies may be defined or characterized by a common granularity level. Thereafter, a plurality of subtechnology interconnectivities, relating to two or more of the identified plurality of subtechnologies, may correspondingly be identified. In one embodiment, a plurality of subtechnology profiles may be generated, wherein the subtechnology profiles include the subtechnology interconnectivities, as well as other subtechnology attributes. In one embodiment, the generated subtechnology profiles may then be stored in a common technology database, to which access may be provided using a common user interface. The technology database may be searched based on user queries entered via the common graphical user interface, as well as the identified subtechnology interconnectivities.10-29-2009
20090271391METHOD AND APPARATUS FOR RATING USER GENERATED CONTENT IN SEACH RESULTS - Generally, a method and apparatus provides for rating user generated content (UGC) with respect to search engine results. The method and apparatus includes recognizing a UGC data field collected from a web document located at a web location. The method and apparatus calculates: a document goodness factor for the web document; an author rank for an author of the UGC data field; and a location rank for web location. The method and apparatus thereby generates a rating factor for the UGC field based on the document goodness factor, the author rank and the location rank. The method and apparatus also outputs a search result that includes the UGC data field positioned in the search results based on the rating factor.10-29-2009
20090271390PRODUCT SUGGESTIONS AND BYPASSING IRRELEVANT QUERY RESULTS - A computer system, computer media, and computer-implemented method for generating product suggestions and providing product information are provided. The computer system includes a relevance engine, a product database, and a graphical user interface to respond to user queries and to provide product details associated with one or more products included in the user queries. The relevance engine determines which products are similar to products included in the user queries. The graphical user interface displays product suggestions that refine the user queries without executing the query on the product database, where a subset of the product suggestions are linked to product details pages. User selection of any of the product suggestions within the subset directs the user to a product details page for a specific product and bypasses a listing of results having many products that match the refined user queries.10-29-2009
20090271389PREFERENCE JUDGEMENTS FOR RELEVANCE - The claimed subject matter provides a system that trains or evaluates ranking techniques by employing or obtaining relative preference judgments. The system can include mechanisms that retrieve a set of documents from a storage device, combine the set of documents with a query orjudgment task received via an interface to form a comparative selection panel, and present the comparative selection panel for evaluation by an assessor. The system further requests the assessor to make a selection as to which document included in the set of documents and presented in the comparative selection panel most satisfies the query or judgment task, and thereafter produces a comparative assessment of the set of documents based on the selections elicited from the assessor and associated with the set of documents.10-29-2009
20090271388ANNOTATIONS OF THIRD PARTY CONTENT - The subject matter disclosed herein relates to creating a search query based on content and subject of a web page, for example. In one particular example, such a search query may be established by a selection of one or more keywords in a web page. Consequently, the search query may be affected by a determination of content and/or a subject of the web page.10-29-2009
20090271387Extraction Method of Interview Relation by Optimal Condition and Record Medium Recording Thereof - A method of selecting the most suitable partner for a date for the purpose of marriage and a recording medium storing the method are provided. A system for arranging the date comprises a member client management unit for managing registration of clients, an account unit for managing membership fees and usage fees, a security authorization unit for security authorization of a client, if necessary, a match calculating unit for performing calculations to select the most suitable partner, a statistical database for storing and recording various statistical data required for the calculations, an input/output unit inputting and outputting statistical data and data required for operating a server 11 a marriage consultancy, and the server 11 for providing information on marriage. The system enables the most suitable partner to be selected using a status index table indicating an indexed social and economic status, a physical attraction index table indicating an indexed appearance and physique, a home environment table indicating indexed wealth of parents, incomes, and educational backgrounds of siblings, and a database of registered member clients who have gotten married.10-29-2009
20090271386Iterative Search with Data Accumulation in a Cognitive Control Framework - Searching hypotheses for locations of objects in a playback image corresponding to a recorded image generated by a graphical user interface (GUI) of an application program may be accomplished by capturing the playback image, detecting at least one active object in the recorded image, searching subsets of hypotheses from the playback image for an object according to predetermined criteria, recalculating old actions for the object in the playback image by applying actions according to an execution scenario and loading a next set of data, when the object is found, and checking dynamic conditions.10-29-2009
20090077066METHOD OF BIBLIOGRAPHIC FIELD NORMALIZATION - A method of normalizing a bibliographic field of a structured field relational database is disclosed. The method includes weighting potential candidate records according to the value in the corresponding field in the records, together with other related fields in the candidate record and other related records in the database. Each of the candidate records is successively evaluated and compared against an acceptable threshold. If the weight exceeds the threshold, the candidate record is returned from the query. Otherwise, a new entry in the database is created. Optionally, before creating such a new entry, the highest weighted candidate record may be compared against a minimally acceptable threshold and if the weight exceeds such a lower threshold, the candidate is returned from the query.03-19-2009
20090077056CUSTOMIZATION OF SEARCH RESULTS - Methods and apparatus are described which enable the customization of search results. Various embodiments of the invention relate to machine-readable representations of configurations of one or more components of a search results page. The machine-readable representations are operable in conjunction with a search engine to present, in response to a search query, one or more search results in an interface in accordance with the corresponding configuration.03-19-2009
20090070319System and method for offering content on a mobile device for delivery to a second device - A system and method of matching content for a customer and offering the content to the customer on a mobile device and downloading the content on a customer device is disclosed. The matching of the content may include matching a customer with categories and the content with categories. The system may include sizing the matched content in portion to the size of the display of the mobile device.03-12-2009
20090030896INFERENCE SEARCH ENGINE - A system and method for processing an electronic query that includes defining a set of rules accessible to an inference engine and wherein the set of rules are configured for (a) parsing the query into one or more subsequent-queries; and (b) determining whether additional information is necessary to answer each subsequent-query and, if so, (i) accessing a registry containing descriptions of one or more data resources, (ii) analyzing the descriptions of the one or more data resources to locate information responsive to the subsequent-query; and then the system and method could apply the set of rules to the information, aggregate all of the information responsive to the query, and supply an answer to the query that is compliant with the set of rules.01-29-2009
20090030900Information processing apparatus, information processing method and computer readable information recording medium - An information processing apparatus uses a storing unit configured to store search conditions, search results obtained based the search conditions and importance levels of the search results in association with each other. When an input search condition has been stored in the storing unit, a search result and an importance level from the storing unit with the use of the search condition as a key, and, when the input search condition has not been stored in the storing unit, a new search result based on the input search condition. On a display screen, the search results modified according to the importance levels, or the new search results, are displayed.01-29-2009
20090030899Processing a content item with regard to an event and a location - Associating a content item with an event is disclosed. A location associated with a received content item is determined. The received content item is associated with an event, at least in part based on an indicia of relatedness, other than the determined location, between the received content item and the event. A criterion that the indicia of relatedness is required to satisfy for the content item to be determined to be associated with the event has a lower value if the determined location associated with the received content item has a first degree of correspondence to a location associated with the event than if the determined location associated with the received content item has a second, lower degree of correspondence to the location associated with the event.01-29-2009
20090030898FILE SEARCH SYSTEM, FILE SEARCH DEVICE AND FILE SEARCH METHOD - A file search system includes : a file analyze device including: a character string identifying unit; a first calculating unit; a second calculating unit; an analyzing unit; an output unit; and a file search device including a first obtaining unit; a second obtaining unit; an index storage unit; a file identifying unit; a ranking unit; and a notifying unit.01-29-2009
20090030897Assissted Knowledge Discovery and Publication System and Method - A system and method is presented for knowledge discovery that incorporate both human and computers to index, process, and communicate and share the knowledge and electronic contents. It also provides a platform for launching unlimited number of qualified and content reviewed publishing/broadcasting ventures. The system assists individuals for faster and more efficient discovery/creation of new and useful knowledge, and valuable artistic content. It also provides incentives to the owners of the ventures and a method for rewarding or compensating all contributors.01-29-2009
20090030895Method And Apparatus For Detecting Predefined Signatures In Packet Payload - A method and apparatus for detecting predefined signatures in packet payload is disclosed. In one embodiment, a method of string matching in a network packet payload includes performing hash on a current search string received in the network packet payload to generate respective search string hash values, storing the search string hash values in a hash buffer, performing rehash using the search string hash values to generate an associated search string rehashed value, performing a parallel search of the search string rehashed value against Content Addressable Memory (CAM) entries to determine if the search string rehashed value matches with one of the CAM entries, and identifying the current search string in the network packet payload as a match with one of the CAM entries based on the outcome of performing the parallel search.01-29-2009
20090030893Query generation system for an information retrieval system - According to one embodiment of the disclosure, a query generation system generally includes an element rank and inference engine in communication with a computing system and a user interface. The element rank and inference engine is operable to receive a user supplied element from the user interface, the user supplied element being associated with a first filter criterion. The element rank and inference engine is also operable to create, using the first filter criterion, at least one second element and rank according to their relative importance, the at least one first element and the at least one second element according to their associated first filter criterion and second filter criterion. Next, the element rank and inference engine may output the at least one first filter element and the second filter element to the computing system.01-29-2009
20090030891METHOD AND APPARATUS FOR EXTRACTION OF TEXTUAL CONTENT FROM HYPERTEXT WEB DOCUMENTS - Textual content is extracted from hypertext documents by generating for each text document a pruned document model tree of merged text nodes by removing selected tag nodes from a document model tree of the text document, calculating for each merged text node of the pruned document model tree a set of text features which are compared with predetermined feature criteria to decide whether the merged text node is an informative merged text node, and assembling the informative merged text nodes to generate a text file containing the textual content.01-29-2009
20090030890BROADCAST RECEIVING APPARATUS AND CONTROL METHOD THEREOF - A broadcast receiving apparatus and a control method thereof are provided. According to the control method of the broadcast receiving apparatus, the broadcast receiving apparatus collects updated information from a network, and generates a display screen based on a comparison of the collected information with keywords or sentences input by a user. Therefore, a user may be informed whether desired information is registered on the network.01-29-2009
20090030889VIEWING OF FEEDS - Feed selections are automatically arranged into a single publication, and the publication is sent to print.01-29-2009
20090030888TECHNIQUES FOR SCORING AND COMPARING QUERY EXECUTION PLANS - Techniques for scoring and comparing query execution plans are provided. Predefined parameter types are identified in query execution plans and predefined weighted values are assigned to any identified parameters within the query execution plans. The weights are summed on a per processing step bases and the sum of the processing steps represents a total score for a particular query execution plan. The total scores or individual step scores from different query execution plans can then be compared or evaluated against one another for optimization and problem detection analysis.01-29-2009
20090259649SYSTEM AND METHOD FOR DETECTING TEMPLATES OF A WEBSITE USING HYPERLINK ANALYSIS - The present invention relates to methods, systems, and computer readable media comprising instructions for detecting templates within one or more web pages comprising a website. The method of the present invention comprises generating one or more groups of hyperlinks within a respective web page of the one or more web pages comprising the website. An in-link score is calculated for a given uniform resource locator associated with the one or more web pages comprising the website. The hyperlink groups in which the uniform resource locators associated with the one or more web pages comprising the website appear are identified. A template score is assigned to the identified hyperlinks groups on the basis of the in-link score associated with the uniform resource locators to which the hyperlinks comprising the hyperlink group correspond. The hyperlink groups with template scores exceeding a given template score threshold are thereafter identified as templates.10-15-2009
20090254550METHOD AND SYSTEM FOR OFFERING SEARCH RESULTS - A method of providing a search result and a system for executing the method are provided. A method of providing a search result includes: setting a grade of a category associated with a keyword based on click information; creating a category list to maintain the category list in association with the keyword, wherein the category list includes the category that is arranged according to the grade; and providing a search result for the keyword in an order of the category in the category list, wherein the click information includes information regarding whether the category is clicked on and a clicked order.10-08-2009
20090248667Learning Ranking Functions Incorporating Boosted Ranking In A Regression Framework For Information Retrieval And Ranking - Embodiments of the present invention provide for methods, systems and computer program products for learning ranking functions to determine the ranking of one or more content items that are responsive to a query. The present invention includes generating one or more training sets comprising one or more content item-query pairs, determining preference data for the one or more query-content item pairs of the one or more training sets and determining labeled data for the one or more query-content item pairs of the one or more training sets. A ranking function is determined based upon the preference data and the labeled data for the one or more content-item query pairs of the one or more training sets. The ranking function is then stored for application to query-content item pairs not contained in the one or more training sets.10-01-2009
20090240687Method of Processing a Collection of Document Sources - In a method of processing a collection of a number of document sources in a computer system to retrieve a number of relevant substrings from the document sources, each relevant substring has relevance with respect to at least one set of selection parameters. The method includes splitting each document source of the collection of document sources into a plurality of source substrings, whereby each source substring comprises at least two concepts, the plurality of source substrings including at least the relevant substrings. The source substrings are stored, and the relevant substrings are uniquely identified among the source substrings. Representations of the sets of relevant substrings may be displayed in a matrix. The sets of selection parameters may be augmented with further selection parameters derived from concepts from a predefined concept hierarchy.09-24-2009
20090240686Thread-based web browsing history - A method and system for cataloguing browsing activity into separate browsing threads. Each browsing thread is an archived set of links that were considered during a specific time-period. The user is also provided with the ability to add metadata to a browsing history. The invention permits the user to reload an archived thread and resume any browsing from the point at which the thread was paused or suspended. In addition, the method and system provides cataloguing of the user browsing activity into separate threads with the ability to add metadata to threads or the individual entries within a thread. The threads may also be archived by date and time and indexed by keyword such that saved threads may be located, resumed, reviewed, and amended at a later point in time, including by other users, if desired.09-24-2009
20090234848SYSTEM AND METHOD FOR RANKING SEARCH RESULTS - A computer-implemented system and method for automatically sorting a plurality of business listings (i.e., business members) returned from a search query based on an overall-relevance value previously assigned to each business member. Business members with corresponding higher-overall-relevance values are ranked as more relevant than business members with lower-overall-relevance values. The overall-relevance value is generated based on weighted scores corresponding to different parameters associated with the business member. These weighted scores are cumulatively combined to generate the overall-relevance value assigned to the business member and is stored in a database.09-17-2009
20090234847INFORMATION RETRIEVAL APPARATUS, INFORMATIN RETRIEVAL SYSTEM, AND INFORMATION RETRIEVAL METHOD - Provided is an information retrieval method including: retrieving, by a computer, a name including input characters from a database for storing the name, an attribute word associated with the name, and a degree of relevance between the name and the attribute word; outputting the retrieved name as a candidate name; and extracting an attribute word associated with the candidate name, the extracting including: calculating a degree of independency indicating a degree of difference between the extracted attribute words, a degree of coverage indicating an extent to which the combination of the extracted attribute words covers the candidate names, and a degree of equality of a number of corresponding candidate names for each attribute word; and calculating a score of the combination of the attribute words based on at least one of the independency, the coverage and the equality to output the combinations of the attribute words to an output unit.09-17-2009
20090234846SYSTEM TO GENERATE AN AGGREGATE INTEREST INDICATION WITH RESPECT TO AN INFORMATION ITEM - A system is provided to establish a ranking for published data. The system may include ranking and monitoring components. A number of registrations of user interest in an instance of published data may be determined. A ranking for the instance of published data may be generated based on the number of registrations of user interest in the instance of published data. A user of the system may be enabled to activate a monitoring process to monitor activity pertaining to the instance of published data.09-17-2009
20090234845LAWFUL ACCESS; STORED DATA HANDOVER ENHANCED ARCHITECTURE - The present invention relates to methods in a telecommunication system to provide access to data received to a centralized storage medium from interfacing traffic nodes in the system. The centralized storage medium is part of a Mediation and Delivery Function which is associated with a Law Enforcement Monitoring facility. The method comprises the following steps: 09-17-2009
20090234844Systems and Methods for Extracting Application Relevant Data from Messages - Systems and methods are provided for extracting application relevant data from messages. In one embodiment, a system can comprise a message parser that parses messages and builds a message tree having one or more objects, one or more data type templates that define a given data type based on one or more data elements and a comparison engine that matches data elements in the one or more objects with data elements in the one or more data type templates. The comparison engine groups data elements in the one or more objects that matches data elements in the one or more data templates as a specific data type corresponding to the associated data type template that is matched.09-17-2009
20090234843RELATIVE DOCUMENT REPRESENTING SYSTEM, RELATIVE DOCUMENT REPRESENTING METHOD, AND COMPUTER READABLE MEDIUM - A relative document representing system includes: a first storage; a receiving unit; a specifying unit; a calculating unit; and a representing unit.09-17-2009
20090234842IMAGE SEARCH USING FACE DETECTION - An image search method and system using face detection, begins with receiving a query submitted by a user. Next a query word is searched in the query from an image resource using an image search engine to obtain an initial image collection. Any faces are detected in each image in the initial image collection which has been searched. A search for the query word in a text surrounding each image having the face in the initial image collection is performed. A determination is made whether the query word indicates at least one person's name in the surrounding text matching the query word. An image an image in the initial image collection is returned to a user in which the face is included and the query word in the surrounding text indicates the person's name.09-17-2009
20090234841Retrieving Method for Fixed Length Data - IP addresses included in a route table are segmented so as to be able to be retrieved all together, and are retrieved at a high rate. As means for retrieving the IP address, a pointer table 200, a secondary pointer table, a local table, and a route table are provided, and a table with a numerical value comparing function is also provided when the further segmentation is necessary. In the retrieval for the ACL table, a fixed length data table of fixed length data configured in the ACL table is generated, and the ACL table is retrieved by using a retrieving method for retrieving the route table. Such tables are provided with a table manager 600 as means for efficiently composing and managing the table, and managing to prevent the retrieving operation from being obstructed.09-17-2009
20090234839SMART SENSOR BASED ENVIRONMENT FOR OPTIMIZING A SELECTION OF MEAL PLANS - A computer implemented method, apparatus, and computer program product for selection of meal plans. In one embodiment, a set of prospective guests are identified from at least one of a set of sensors collecting historical attendance data and a calendaring application. A set of nutritional requirements is then identified for the set of prospective guests. Thereafter, a set of meal plans is selected on an availability of ingredients and the nutritional requirements of the set of prospective guests, wherein the availability of ingredients is determined by sensors from the set of sensors monitoring the ingredients.09-17-2009
20090234827CITIZENSHIP FRAUD TARGETING SYSTEM - Methods and systems for biographic scoring are disclosed. One system includes a biographic scoring module that applies one or more scoring rules to each of a plurality of biographic records to generate a score associated with each of the plurality of biographic records. The system also includes a result generation module that generates a report based on one or more scores generated by the biographic scoring module. The methods and systems disclosed can, in certain cases, reflect a likelihood that an individual associated with the biographic record is an unauthorized alien.09-17-2009
20090234826SYSTEMS AND METHODS FOR MANIPULATION OF INEXACT SEMI-STRUCTURED DATA - The data constraint framework solution of the present invention addresses data quality issues by standardizing, verifying, matching, consolidating and merging data records using powerful inexact matching logic and search reduction technologies. The data conditioning framework uses these technologies to more efficiently condition data to improve the quality of data and/or resolve quality data issues such as incomplete, inaccurate and duplicate data records. For example, the data conditioning framework is used to “cleanse” incorrect, incomplete and duplicate data from a data source, such as an information system. The data conditioning framework uses the following approximate searching and matching techniques to improve the efficiency of the approximate matching, reduce the search space for approximate matching, and improve the speed of executing approximate searches and matches: 1) inexact trimmed matching, 2) adaptive search ordering, 3) cascading search space reduction, 4) tiered and metric indexing, and 5) domain knowledge matching.09-17-2009
20090049029Method and system of detecting keyword whose input number is rapidly increased in real time - A method and a system of detecting a keyword whose input number is rapidly increased in real time which can estimate a search number at a future point in time by reflecting an input trend of the keyword in real time at a present point in time and can immediately detect the keyword whose input number is rapidly increased according to a criterion value calculated by the estimated search number. Specifically, the method and system of detecting a keyword whose input number is rapidly increased in real time which can estimate the search number for each keyword at the future point in time in real time and can immediately detect the keyword whose input number is rapidly increased according to a criterion value calculated by the estimated search number.02-19-2009
20090157667Reputation of an Author of Online Content - Methods, computer program products and systems are described for online-content management. Multiple online content items authored by multiple authors are received at one or more first computers for online publication. For each online content item, a reputation score is determined for the author of the online content item. The reputation score is based at least in part on one or more reviews of the online content item provided by one or more reviewers other than the author. In response to a query for online content received from a second computer, a set of search results is generated that includes an online content item from the multiple online content items. A ranking of the online content item in the set is determined based at least in part on the reputation score of the author.06-18-2009
20090024613CROSS-LINGUAL QUERY SUGGESTION - Cross-lingual query suggestions (CLQS) aims to suggest relevant queries in a target language for a given query in a source language. The cross-lingual query suggestion is improved by exploiting the query logs in the target language. The disclosed techniques include a method for learning and determining a similarity measure between two queries in different languages. The similarity measure is based on both translation information and monolingual similarity information, and in one embodiment uses both the query log itself and click-through information associated therewith. Monolingual and cross-lingual information such as word translation relations and word co-occurrence statistics may be used to estimate the cross-lingual query similarity with a discriminative model.01-22-2009
20090024610COMPUTER AIDED AUTHORING, ELECTRONIC DOCUMENT BROWSING, RETRIEVING, AND SUBSCRIBING AND PUBLISHING - Provides methods, apparatus, and systems for computer aided authoring. Included are: a method for browsing an electronic document, an apparatus for aided authoring, an electronic document browser, a method for retrieving an electronic document, a system for retrieving electronic documents, a method for subscribing and publishing an electronic document as well as a system for subscribing and publishing electronic documents. An example method for computer aided authoring includes: generating one or more topic summaries based on an electronic document while a writer is writing said electronic document, wherein the reliability of the topic summary is ensured by the writer; and saving said topic summary information in correspondence with said electronic document.01-22-2009
20090019026Clustering System and Method - An increase in information available to a user of computing technologies has a tendency to increase the number of topics that are similarly related. Given the large amount of information that is now available, it is increasingly likely that a first set of search results generated in response to an initial search query will contain information that is not of interest to the user. What is needed in the art is a technique to enable a search query to be conducted by taking advantage of linguistic feedback. Furthermore, what is needed is a technique to enable the presentation of search results to be refined in a manner based on what is not of interest to a user, either intrinsically or because the user has already seen and evaluated certain information and next wants to see more or different information.01-15-2009
20090144257METHOD OF OPERATING A SEARCH APPLICATION - A method of operating a search application, in which previous search queries have been received and stored, including receiving a current search query, searching for previous search queries including beginning portions thereof, mid-portions thereof and end-portions thereof which match a sequence of an input of the current search query in real-time, and displaying previous search queries found in the searching operation.06-04-2009
20080301112ENABLING SEARCHING OF USER RATINGS AND REVIEWS USING USER PROFILE LOCATION, AND SOCIAL NETWORKS - A system and method are directed towards a free-form search query of user reviews using user profile, location information, and/or social networks, to obtain a result having an associated universal aggregated rating. The user may enter in free-form a search query that may then be transparently modified using the user's profile, social network, and/or current physical location. The search results may then be presented to the user along with aggregated weighted ratings. The user may also enter products and/or services into a data store, including comments, and a universal rating. In one embodiment, the user may provide a tag to another reviewer's comments that may be useable to aggregate ratings. In one embodiment, the user's profile, location, and/or social networking information may be used to further annotate the user's inputs.12-04-2008
20080294634SYSTEM AND ARTICLE OF MANUFACTURE FOR SEARCHING DOCUMENTS FOR RANGES OF NUMERIC VALUES - Provided are a system and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents include at least one value that is a member of a set of values. A number of posting lists is generated, wherein each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values. A query on a query range of values within the set of values is received and a determination is made of a minimum number of posting lists associated with consecutive values that together include the query range of values. The determined posting lists are merged to form a merged posting list including document identifiers of documents including values within the query range. The document identifiers in the merged posting list are returned.11-27-2008
20080294633COMPUTER-IMPLEMENTED METHOD, SYSTEM, AND PROGRAM PRODUCT FOR TRACKING CONTENT - A system, method, and program product for tracking content are described. Aspects of invention allow bodies of content, whether from a common channel or from different channels, to be compared for relatedness. Comparison of different bodies of content involves analyzing both the actual content, characteristics of the source(s) of the content, and optionally, elapsed time between their respective broadcasts/communications. To this extent, a content similarity value, a source characteristic value and an optional temporal value for the portions of content are determined, and then used to compute a relatedness value of the (bodies of) content.11-27-2008
20080294632Method and System for Sorting/Searching File and Record Media Therefor - A method, a system and a recorded medium for sorting and searching files are disclosed. A method of sorting and searching files have the steps of (a) outputting an annotation interface for an original file selected by a user, (b) receiving annotation details inputted through the annotation interface, (c) generating an annotation file in accordance with the annotation details, and (d) storing the annotation file. With the present invention, the efficient sorting and searching of files can be easily performed by using all kinds of fields stored in a user terminal.11-27-2008
20080294629PROCESS FOR FACILITATING A TELEPHONE-BASED SEARCH - The process for facilitating a telephone-based search includes accepting a telephone inquiry, receiving search criteria through the telephone inquiry and searching an electronic database for information relevant to the search criteria. A portion of the search criteria may be received from a software application installed on a telephone. Such search criteria may include a keyword or a category. Next, a search result relating to the search criteria is conveyed in response to the search inquiry. The search result should include a searchable category, a third party or a list of selectable third parties. Accordingly, the telephone inquiry is routed to a third party associated with the search result.11-27-2008
20080294627Board Recruiting - Methods, systems, and computer program products for recruiting candidates for a position on a board are described. In a computer system, a degree of matching between a profile of a candidate and a profile of the board is determined, and the candidate is introduced to the board after establishing a mutual interest between the candidate and the board based on the determined degree of match.11-27-2008
20080294626Method and apparatus for leveraged search and discovery - leveraging properties of trails and resources within - A method of automatically extracting knowledge from user generated trails. The method of this invention provides: 11-27-2008
20080294625Item recommendation system - To recommend an item which is highly unexpected to a user because its similarity to user preferences is low and which is useful to the user. A rule that modifies a set of keywords for recommending an item is randomly applied, and a keyword which a user does not prefer is added and a keyword which a user prefers is removed, and then this recommendation result is mixed with a recommendation result of a set of keywords before the above modification, and the mixed result is presented to a user and at the same time the application probability of a rule is learned on the basis of the user's evaluation to a recommended item.11-27-2008
20080294624Recommendation systems and methods using interest correlation - A search technology generates recommendations with minimal user data and participation, and provides better interpretation of user data, such as popularity, thus obtaining breadth and quality in recommendations. It is sensitive to the semantic content of natural language terms taken from user profiles at social networking and online dating applications and blogs. The profiles and blogs can include interests, eccentricities, age, gender, and location information associated with the user. The interest information can include music, movies, sports and personality traits. Based on the user's profile information, the system determines which ad from a stock of ads is best suited to a given profile and delivers that ad. The system can enable advertisers to create and manage online advertising campaigns using a campaign manager in which they attach descriptions to ads in their inventory, thereby generating a profile for each ad which is then compared to the profiles in the target online environment. A user interface can be provided to enable the user to fine-tune product and service recommendation results. The system can be used to match user profiles to provide mate-matching in an online dating environment.11-27-2008
20080294623APPARATUS AND METHOD FOR RECOVERING FINAL DISPLAY - An apparatus and method of recovering a final display are provided. The apparatus includes a query-string-creating module creating query strings in response to a cursor-request message, a query-string-controlling module creating a first cursor as a result of processing the query strings, and returning the created first cursor to the query-string-creating module, and a cursor-recovery module storing information about the first cursor and recovering information about a second cursor in response to the cursor-request message.11-27-2008
20080294618SYSTEM AND METHOD FOR ADVANCED HANDLING OF MULTIPLE FORM FIELDS BASED ON RECENT OPERATOR BEHAVIOR - A method, system and computer program product for enhancing the usability of web browsers by analyzing the recent behavior of an operator while executing a search pattern on a computer network. In particular, a browser enhancement utility provides web browsers with the ability to store (for a limited time period) search terms used in a variety of web search patterns. The browser enhancement utility employs ranking algorithms to identify the relationships between searches and a ranking and matching algorithm to utilize stored search terms to find (text) matches in a web document. When the browser displays web pages after a search has occurred, the browser enhancement utility utilizes these matches in order to take actions to enhance document usability. These actions include: Highlighting terms that have been recently searched for; pre-selecting matching terms from drop down boxes or radio buttons; and focusing a web page to relevant sections of text.11-27-2008
20080294617Probabilistic Recommendation System - A recommendations system uses probabilistic methods to select, from a candidate set of items, a set of items to recommend to a target user. Some embodiments of the methods effectively introduce noise into the recommendations process, causing the recommendations presented to the target user to vary in a controlled manner from one visit to the next. The methods may increase the likelihood that at least some of the items recommended over a sequence of visits will be useful to the target user. Some embodiments of the methods are stateless such that the system need not keep track of which items have been recommended to which users.11-27-2008
20080294616SYSTEM AND METHOD FOR DATABASE SEARCHING USING FUZZY RULES - An apparatus and method for database searching using fuzzy rules is presented. The apparatus and method may accept a word or word phrase such as a persons name and returns fuzzy rules for database searching. Applicable search rules are selected and word or word phrase equivalents are displayed to a user. The user accepts or rejects each of the word or word phrase. The word or word phrase along with the user's acceptance or rejection are stored in a sample database. The fuzzy rules are modified according to the data in the sample database. The database is filtered by training and testing portions of the database for accuracy and purging the least accurate portions.11-27-2008
20080294628ONTOLOGY-CONTENT-BASED FILTERING METHOD FOR PERSONALIZED NEWSPAPERS - The invention is an ontological-content-based method for filtering and ranking the relevancy of items. The filtering method of the invention utilizes a hierarchical ontology, which considers the distance, or similarity between concepts representing each user to concepts representing each item, according to the position of related concepts in the hierarchical ontology. Based on that, the filtering algorithm computes the similarity between the items and users and rank-orders the items according to their relevancy to each user. The method finds general use in the fields of information filtering and publishing, specifically the production of electronic newspapers for which the invention provides methods of filtering and ranking the relevance of news content to specific readers in order to allow production of personalized electronic newspapers.11-27-2008
20090119282DECISION SUPPORT SYSTEM WITH EMBEDDED CLINICAL GUIDELINES - A context-aware decision-support system automatically selects the clinical guideline pertaining to the patient's medical care and automatically deduces the current stage in the guideline (S05-07-2009
20090138466System and Method for Search - A method for associating graphical information and text information includes providing the graphical information, the graphical information comprising at least one identifier in the graphical information for identifying at least one portion of the graphical information. The method further includes providing the text information and associating the portion with the text information through a commonality between the identifier and the text information.05-28-2009
20090164442Interactive hybrid recommender system - A hybrid recommender system, in which the initial stereotype is manually defined by an expert and an affinity vector of stereotypes relating to each specific user who registers onto the system, is created to define a specific profile for each user. Recommendations for a specific user are generated according to the initial stereotype and the affinity vector of stereotypes. A binary feedback, from user regarding specific items picked by him is received (e.g., while of the item), which can be either positive or negative. Then the affinity vector of stereotypes is updated.06-25-2009
20090319510Systems and methods for document searching - Systems and methods are provided for document searching. In one implementation, a computer-implemented method provides keyword searching. The method may receive a plurality of noisy keywords for a document collection. A server may generate tokens for a plurality of keywords in the document collection and merge the tokens to create an index. A search query may be received. The search query may include at least one search phrase. For the at least one search phrase, an indication may be received from a user specifying to perform one of a noisy phrase search or a noiseless phrase search. The method may search the index for the at least one search phrase based on the indication received from the user.12-24-2009
20090282019Sentiment Extraction from Consumer Reviews for Providing Product Recommendations - A system and method for recommending a product to a user in response to a query for a product with a feature wherein the recommendation is accompanied by a quotation expressing a sentiment about the feature or the product.11-12-2009
20090125513SYSTEM FOR REMOTELY SEARCHING A LOCAL USER INDEX - A system is provided for enabling a user to search for documents that the user has previously viewed on its local machine. The system may include three main components: the desktop integration module, the index module, and the graphical user interface module. The desktop integration module is an application which monitors documents with which the user interacts for predetermined events, and obtains content data and metadata from the monitored documents. The index module indexes the content data and metadata received from the desktop integration module. The graphical user interface module then permits a user to utilize the desktop integration module and index module by allowing a user to search for a document.05-14-2009
20080275870Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance - A computer-readable medium comprises data structure for providing information about levels of similarity between pairs of N documents. The data structure comprises a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of the documents. Each of the similarity values represents a level of similarity of one document of a given pair relative to the other document of the given pair. The similarity value of each entry is greater than a threshold similarity value that is greater than zero. The plurality of similarity-value entries are fewer than N11-06-2008
20090248661IDENTIFYING RELEVANT INFORMATION SOURCES FROM USER ACTIVITY - A relevant information source identification technique that exploits a combination of searching and browsing activity of many users to identify relevant resources for future queries. The technique relies on such data to identify relevant information sources for new queries. In one embodiment, the technique is term-based: past queries are decomposed into individual (possibly overlapping) terms and phrases, and the most relevant documents are identified for each phrase from the browsing patterns of users that follow the query. Then, for a new query that consists of several terms or phrases, the most relevant destinations for each term/phrase are combined to produce overall predictions of the best or most relevant sources for the new query. This allows for providing predictions for previously unseen queries, which comprise a large proportion of the overall query volume.10-01-2009
20090106239Document Review System and Method - A system and method for reviewing electronic documents. The method may include the step of using a computing device to rate a document's relevancy to a concept. Depending on the document's relevancy rating, the document could be routed to either substantive review personnel or relevancy review personnel. If the relevancy rating indicates that the document is likely relevant to the concept, the document is routed to substantive review personnel for substantive analysis. If the relevancy rating indicates that the document is likely irrelevant to the concept, the document is routed to relevancy review personnel to confirm whether the document is irrelevant to the concept. If the relevancy review personnel determine that the document is likely relevant to the concept, the document is rerouted to the substantive review personnel for substantive analysis.04-23-2009
20090198670METHOD AND SYSTEM FOR COLLECTING AND ORGANIZING DATA CORRESPONDING TO AN EVENT - A system and method for analyzing data from a plurality of computer environments. The computer environments are authenticated and data is imported to a memory location. The data is converted into a uniform format to enable expedited searching by one or more authenticated users. The data may be marked so that a user may determine which computer environment provided the data. The system may also create one or more indexes of the data to assist one or more users in searching the data.08-06-2009
20090019035SYSTEM AND METHOD FOR TRANS-FACTOR RANKING OF SEARCH RESULTS - A system and method for trans-factor ranking of search results. Any number of attributes of items in a database being searched may be synthesized into a uniquely suitable ordering that brings relevant and authoritative results to the top of the list whenever such results exist in the database. The manner in which the attributes are used to create this ordering may be varied by users to tailor the ranking to their needs, and by database providers to take advantage of unique database contents. Attributes that are not in the database may be created dynamically and used to synthesize the order, based on the intersection between attributes that do exist in the database and attributes that are associated with the user or database manager. Attenuation and amplification factors are applied to attributes to control rankings. A multi-ranking interleaver determines a final ordering when more than one ranking strategy is used.01-15-2009
20090089272System and method for adaptive text recommendation - Network system provides a real-time adaptive recommendation set of documents with a high statistical measure of relevancy to the requestor device. The recommendation set is optimized based on analyzing the text of documents of the interest set, categorizing these documents into clusters, extracting keywords representing the themes or concepts of documents in the clusters, and filtering a population of eligible documents accessible to the system utilizing site and or Internet-wide search engines. The system is either automatically or manually invoked and it develops and presents the recommendation set in real-time; for example, upon logging onto a web site or as the client views additional documents or pages of a website. The recommendation set may be presented as a greeting, notification, alert, HTML fragment, fax, voicemail, or automatic classification or routing of customer e-mail, personal e-mail, job postings, and offers for sale or exchange.04-02-2009
20090150380SYSTEM AND METHOD FOR PROCESSING SOCIAL RELATION ORIENTED SERVICE - A method for social relationship oriented service processing, which comprises steps of: providing relationship data, searching for at least one first level social member from a first level social network according to the relationship data, forwarding a list of the first level social members to a server, and searching by the server for at least one second level social member from a second level social network according to the list and the relationship data.06-11-2009
20090070315TABLE ELIMINATION OPTIMIZATIONS - Methods for transforming a query to remove redundant tables and eliminate superfluous join operations is provided. The methods provided transform queries to remove redundant tables and anti-joins, semi-joins, and outer-joins. Whether a table is redundant is determined based on a set of criteria which, if fulfilled, indicates that the removal of the table and the anti-join, semi-join, or outer-join operation does not impact query results. The removal of a redundant table from a query also results in the elimination of the anti-join, semi-join, or outer-join operation that references the removed table.03-12-2009
20090307207CREATION OF A MULTI-MEDIA PRESENTATION - A computer implemented method, computer system, and program storage device can be used for displaying images or videos simultaneously with a composition text that is read or sung. The displayed images or videos have been identified as related to selected words or phrases of the composition text and are displayed only when those selected words or phrases are read or sung in the accompanying audio playback. A number of techniques can be used to identify the appropriate images or videos for the selected words or phrases.12-10-2009
20080288484Distributed User Profile - A reasoning apparatus (11-20-2008
20090300006TECHNIQUES FOR COMPUTING SIMILARITY MEASUREMENTS BETWEEN SEGMENTS REPRESENTATIVE OF DOCUMENTS - Keyword frequency data for a plurality of document-derived segments is represented in a matrix form in which each segment is represented as a vector of dimensionality equal to the number of keywords. The matrix may be subdivided into a plurality of sub-matrices, each preferably corresponding to a non-overlapping portion of the plurality of keywords. When determining a similarity measurement between any pair of segments, at least a portion of the keyword frequency data for each sub-matrix's non-overlapping keywords are used to determine a sub-matrix dot product for the pair of segments. The resulting plurality of sub-matrix dot products are then summed together in order to provide the similarity measurement. Keywords that are synonyms of each other may be accommodated through the modification of keyword frequency data. Where the keyword frequency data in the matrix representation is relative sparse, compressed views of the matrix representation may be provided.12-03-2009
20090292691System and Method for Building Multi-Concept Network Based on User's Web Usage Data - A system and method for building a multi-concept network based on web usage data that collect keywords used in a search site utilized by a plurality of users and web page information and build the multi-concept network for the keywords are provided. The method includes (a) collecting the keywords input by the users for searches in the site and the information on web pages read according to keyword search results; (b) for each keyword, selecting read web pages for each user; (c) for each keyword, setting each selected web page as one node, grouping the web page nodes for each user, connecting the web page nodes in a row, and arranging the web page nodes around the keyword; and (d) obtaining a similarity between two groups of the web page nodes arranged around the keyword, and integrating the two groups to form one group connected in a row when the similarity is above a predetermined standard value.11-26-2009
20090177643Geocoding Multi-Feature Addresses - A system and method of parsing natural language descriptions of features to determine an approximate location. An embodiment includes splitting the natural language descriptions into components, geocoding each component, and returning the geocode with the highest confidence level. The geocode references a specific location, and this information may be determined by content from a variety of sources. The system may use an assortment of techniques for determining highest confidence level.07-09-2009
20090019040PROCESSING CROSS-TABLE NON-BOOLEAN TERM CONDITIONS IN DATABASE QUERIES - Processing non-Boolean term conditions in database queries. A query that is a request for data in a database is received and includes at least one uneven non-Boolean term condition that spans multiple tables in the database. The non-Boolean term condition is split into separate portions, each of the portions providing a Boolean term that can be satisfied by accessing one table in the database. The separate portions are executed independently to find at least one data result in the database that satisfies the Boolean term of each separate portion, and the data result from each separate portion are combined into a final result that satisfies the query.01-15-2009
20100030765AUTOMATIC GENERATION OF ATTRIBUTION INFORMATION FOR RESEARCH DOCUMENTS - Systems and method for providing source attribution for a document are provided. A source attribution generator includes a source determiner and an attribution information generator. The source determiner is configured to determine a source for a section of content received in an electronic document by accessing a network-based search index. The attribution information generator is configured to generate attribution information that indicates the determined source in the electronic document, and to provide the generated attribution information to be included in the electronic document.02-04-2010
20090248669METHOD AND SYSTEM FOR ORGANIZING INFORMATION - A system and method to process data having a module stored on the server computer system for receiving a query over a network from a client computer system. A search engine utilizes the query to extract a search result from a data source. A query decomposition module decomposes the query into at least one n-gram which is a subset of the query. A processing module processes the at least one n-gram to determine at least one related search suggestion. A merging module merges the at least one related search suggestion into a ranked output data set. A transmission module transmits the search result and the at least one related search suggestion from the server computer system to the client computer system.10-01-2009
20090144267Searching for Virtual World Objects - Systems and methods for searching for objects located in a virtual world include having a virtual construct such as a bot crawl the virtual world by moving from place to place. Object information is collected about the objects associated with the place and the object information is stored in a searchable database. Users can search the database for objects in the virtual world. The information can be further filtered or classified to aid in searching.06-04-2009
20090282029METHOD, A SYSTEM AND A COMPUTER PROGRAM PRODUCT FOR DETECTING A LOCAL PHENOMENON - A system for detecting a local phenomenon, the system includes an interface for receiving queries information from a system for retrieving art related media, and a processor, configured to: (a) create a first local popularity chart, wherein the creating of the first local popularity chart includes enumerating, for each geographic area of a group of sampled geographic areas, identical query strings of queries that are included in a group of queries; (b) create a first global popularity chart, wherein the creating of the first global popularity chart includes enumerating identical query strings of the queries that are included in the group of queries; and (c) select at least one query string in response to a scoring of the query string at the first local popularity chart and to a scoring of the query string at the first global popularity chart; wherein the group of queries includes queries which were queried during a first period of time.11-12-2009
20090282018METHOD TO IDENTIFY EXACT, NON-EXACT AND FURTHER NON-EXACT MATCHES TO PART NUMBERS IN AN ENTERPRISE DATABASE - A method of searching for customer part numbers stored in an enterprise database includes creating a set of discrete search strings from a set of supplier part numbers by which a search of the customer part numbers is performed and identifying any exact, non-exact and further non-exact matches between the discrete search strings and the customer part numbers from an output of the search.11-12-2009
20090037409METHOD AND SYSTEM FOR INFORMATION RETRIEVAL - Retrieving information from information sources using links. A set of information sources is preprocessed to extract content from text and existing links in the information sources according to some predetermined criteria. A set of search results is generated from amongst the preprocessed information sources in response to a received search query.02-05-2009
20090070317Patent claim and specification analysis - This invention relates to providing automated support for the analysis of patents, patent applications and other texts, and more specifically, for supporting the analysis of claims in view of specifications, and for supporting claim analysis as compared to published texts that might constitute possible prior art or infringement of the patent. 03-12-2009
20090282013ALGORITHMICALLY GENERATED TOPIC PAGES - A method and system for generating a topic page for a search query on a search webpage includes receiving a query at the search webpage on a client. The query is transmitted from the search webpage on the client to a search engine on a server. A topic page generator available to the search engine analyzes the query to identify a plurality of dimensions. One or more content modules that match one or more of the dimensions are selected from a plurality of sources based on a weight associated with each of the content modules. The weight defines the ranking of a content module. The content modules for the plurality of dimensions are glued together and presented on the topic page in the order of the corresponding weight of the content modules. The order of presentation identifies the relevancy of the content modules to the query. The presented topic page provides the most relevant content modules for the query, and for a user located in a specific geo location.11-12-2009
20090276423FILE MANAGEMENT APPARATUS AND METHOD, AND STORAGE SYSTEM - The present invention provides a file management system and method, and a storage system that can prevent file-multiplexing in a storage apparatus, and efficiently use the storage capacity of the storage apparatus. The storage apparatus stores first management information for managing two or more kinds of classification list, each classification list including one or more keywords, and second management information for managing the kinds of classification list set for each of one or more users with regard to each of one or more tiers of their respective virtual file trees; and sends, in response to a request from a client apparatus to search the classification lists for a classification list set for a directory, the classification list set for the relevant user with regard to the tier matching the request to the client apparatus with reference to the first and second management information.11-05-2009
20090299993Candidate Recruiting - Methods and systems for candidate recruiting are described. Bio/demographic information and behavioral data is collected from candidates and processed to provide score signals. The score signals are transduced to an observable form and made available along with the data to employers and organizations for use in identifying candidates of interest for employment and other purposes. The candidates may be offered incentives for providing information to the service.12-03-2009
20080294621Recommendation systems and methods using interest correlation - A search technology generates recommendations with minimal user data and participation, and provides better interpretation of user data, such as popularity, thus obtaining breadth and quality in recommendations. It is sensitive to the semantic content of natural language terms and lets users briefly describe the intended recipient (i.e., interests, eccentricities, previously successful gifts). Based on that input, the recommendation software system and method determines the meaning of the entered terms and creatively discover connections to gift recommendations from the vast array of possibilities. The user may then make a selection from these recommendations. The search/recommendation engine allows the user to find gifts through connections that are not limited to previously available information on the Internet. Thus, interests can be connected to buying behavior by relating terms to respective items.11-27-2008
20090019032Method and a system for semantic relation extraction - The invention provides a method for semantic relation extraction, wherein on the basis of an annotated training corpus having tokens with associated relational labels each indicating a relation between the respective token and a selectable key entity semantic relation between said key entity and other entities are directly extracted from unstructured text using a probabilistic extraction model.01-15-2009
20090177644SYSTEMS AND METHODS OF MAPPING ATTENTION - The disclosure describes systems and methods of ranking user interest in physical entities based on the attention given to those entities as determined by an analysis of communications from devices over multiple communication channels. The attention ranking systems allow any “Who, What, When, Where” entity to be defined and ranked based, at least in part, on information obtained from communications between users and user proxy devices. An entity rank is generated for entity known to the system in which the entity rank is derived from the information in communications that are indicative of user actions related to the entity. The entity ranks are then used to modify the display of information or data associated with the entities. The system may also generate a personal rank for each entity based on the relation of the entity to a specified user.07-09-2009
20090070326SEARCH SYSTEMS AND METHODS USING IN-LINE CONTEXTUAL QUERIES - Systems and methods are provided for implementing searches using contextual information associated with a Web page (or other document) that a user is viewing when a query is entered. The page includes a contextual search interface that has an associated context vector representing content of the page. When the user submits a search query via the contextual search interface, the query and the context vector are both provided to the query processor and used in responding to the query.03-12-2009
20100036837INFORMATION SEARCH METHOD AND INFORMATION SEARCH APPARATUS - An information search apparatus includes a receiving unit which receives a search request from a searcher's terminal; a search unit which searches and retrieves, from a database, candidate information that includes a content satisfying a search criteria set in the search request; a constriction unit which constricts the retrieved candidate information based on similarities between an attribute of an authorized person who is authorized to determine disclosure or non-disclosure of the candidate information included in the candidate information, and an attribute of the searcher who transmits the search request from the searcher's terminal; and a transmission unit which transmits disclosure requests for the constricted candidate information to an authorized person's terminal and transmits candidate information in which a response for the disclosure request received by the receiving unit is set to permit disclosure to the searcher's terminal.02-11-2010
20090198676Indexing Documents for Information Retrieval - Information retrieval systems such as web search systems locate documents amongst millions and even billions of possible documents on the basis of query terms. In order to achieve this document indexes are created. We propose creating new fields in the documents to store feedback information. This information comprises query terms used in a particular search as well as information about whether a particular document retrieved is given positive or negative feedback for example. Indexes are created on the basis of this feedback information in addition to other available information. As a result relevance of search results is improved. Multiple fields of information are available for given documents (such as abstract fields, title fields, anchor text fields as well as our feedback fields). Any search algorithm which deals with multiple fields as well as multiple query terms and which provides for differential weighting of document fields is used.08-06-2009
20090282016Systems and Methods for Building a Prediction Model to Predict a Degree of Relevance Between Digital Ads and a Search Query or Webpage Content - Systems and methods for building a prediction model to predict a degree of relevance between digital ads and a search query or webpage content are disclosed. Generally, an indication of relevance is received between a plurality of digital ads and one of a webpage content or a search query. A set of features is extracted from the plurality of digital ads and one of the webpage content or the search query. A prediction model is then built to predict a degree of relevance between the set of candidate digital ads and one of a second webpage content or a second search query, where the prediction model is built based at least one the received indication of relevance and the extracted set of features.11-12-2009
20090248664APPARATUS, SYSTEM, AND METHOD FOR IDENTIFYING TIME-BASED INFORMATION WITH HISTORICAL EVENTS - An apparatus, system, and method are disclosed for identifying time-based information. A detection module detects time-based information. A selection module monitors events in an information stream. The information stream is of interest to a target user. In addition, the information stream is not related by content to the time-based information. The selection module further selects a first event with temporal relation to the time-based information. An association module associates the first event and the time-based information.10-01-2009
20090287697AGENT RANK - The present invention provides methods and apparatus, including computer program products, implementing techniques for searching and ranking linked information sources. The techniques include receiving multiple content items from a corpus of content items; receiving digital signatures each made by one of multiple agents, each digital signature associating one of the agents with one or more of the content items; and assigning a score to a first agent of the multiple agents, wherein the score is based upon the content items associated with the first agent by the digital signatures.11-19-2009
20090287683Network server employing client favorites information and profiling - An Internet infrastructure that supports searching of web links wherein a user profile is used to reorder search results in a search result list for improved searching. The Internet infrastructure consists of a plurality client devices with web browsers that are incorporated with user-profiling modules and a search engine server. The process of searching and reordering includes the search engine server receiving a search string along with a user profile from the user-profiling module (or retrieving the user profile from a database). Then, the search engine server stores the user profile in a database that is associated with the search engine server and delivers search results based upon the search string, and reorders the search results based upon stored data in the database.11-19-2009
20090222441System, Method and Computer Program Product for Performing Unstructured Information Management and Automatic Text Analysis, Including a Search Operator Functioning as a Weighted And (WAND) - Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).09-03-2009
20090193020INFORMATION RETRIEVAL METHOD, INFORMATION RETRIEVAL APPARATUS, AND COMPUTER PRODUCT - An information retrieval apparatus includes an acquiring unit that acquires a numerical value defining a boundary of a numerical range; a detecting unit that detects a number of places in and a head numeral of the numerical value; an extracting unit that extracts from a bit string group, a bit string indicating whether a numerical value in a numerical value group having the number of places and the head numeral is present in files subject to retrieval; a specifying unit that specifies a file corresponding to a bit in the extracted bit string, the bit indicating the presence of a numerical value of the numerical value group; a determining unit that determines whether a numerical value in the specified file meets the boundary condition; and a designating unit that, based on a determination by the determining unit designates the specified file to have a numerical value within the numerical range.07-30-2009
20090299999SEMANTIC EVENT DETECTION USING CROSS-DOMAIN KNOWLEDGE - A method for facilitating semantic event classification of a group of image records related to an event. The method using an event detector system for providing: extracting a plurality of visual features from each of the image records; wherein the visual features include segmenting an image record into a number of regions, in which the visual features are extracted; generating a plurality of concept scores for each of the image records using the visual features, wherein each concept score corresponds to a visual concept and each concept score is indicative of a probability that the image record includes the visual concept; generating a feature vector corresponding to the event based on the concept scores of the image records; and supplying the feature vector to an event classifier that identifies at least one semantic event classifier that corresponds to the event.12-03-2009
20090177645ADAPTING A CONTEXT-INDEPENDENT RELEVANCE FUNCTION FOR IDENTIFYING RELEVANT SEARCH RESULTS - Techniques for predicting user interests based on information known about a specific context is provided. A context-independent relevance function is generated from information gathered from many users and/or from many documents (or files). Information about a specific context (e.g., a particular user, a particular group of users, or type of content) is used to adapt the CI relevance function to the specific context. Based on a query submitted by a user, the adapted relevance function is used to identify results that the user would most likely be interested in. Results may include references to webpages and advertisements.07-09-2009
20090177650METHOD, COMMUNICATION SYSTEM AND COLLECTION CONTROLLER ALLOWING THIRD PARTY INFLUENCE ON THE PROVISION OF A SERVICE TO A USER STATION - A content provider provides a content provider and service identification to a collection controller. The collection controller retrieves content provider and service specific service provision characteristics from a user subscription database and sets these service provision characteristics as a filter in a service provision control device to be used in the provision of a service from the content provider to the user equipment. Thus, the content provider, through the retrieved content server & service related characteristics from the user subscriber database, can influence the charging and transmission policies used by service provision control device for providing the service.07-09-2009
20090222444QUERY DISAMBIGUATION - A search query is resolved prior to being submitted to one or more search engines. The query is resolved such that the query unambiguously corresponds to a category included in a query ontology that relates search queries to query categories. The query may be resolved by supplementing the query with additional information corresponding to the category. For example, the query may be formatted into a canonical form of the query for the category. Alternatively or additionally, the query may be supplemented with one or more keywords that are associated with the category and that represent words or phrases that appear in a high percentage of search results for queries from the category. Resolving the query yields search results that more closely reflect search results desired by a user submitting the query.09-03-2009
20090089277System and method for semantic search - Systems and methods for semantic search are provided. A corpus of information grouped into passages are indexed by semantic key terms generated from packed knowledge representations that document the semantic relationships of information within those passages. When a search is conducted, a query is similarly transformed into a packed knowledge representation that documents the semantic relationships from which semantic key terms are also generated. An inverted index relating the semantic key terms associated to the passages is searched using the semantic key terms generated from the query. A set of candidate passages is selected and refined by analysis of the semantic key terms and other information. The semantic representations associated with the set of candidate passages are then matched to the semantic representation of the query to determine a search result set.04-02-2009
20090083257Method and subsystem for information acquisition and aggregation to facilitate ontology and language-model generation within a content-search-service system - Various embodiments of the present invention include information-aggregation-and-classification components of content-search-service systems which acquire information from information sources, aggregate and normalize the acquired information, and classify the acquired information prior to storing the normalized and classified information for use by language-model-builder components and ontology-builder components of the content-search-service systems. Additional embodiments of the present invention include the ontology-builder components, which builds ontologies from the normalized and classified information for specific dates, date/times, date ranges, or date/time ranges and for specific categories.03-26-2009
20090204605Semantic Search Via Role Labeling - A method and system for searching for information contained in a database of documents each includes an offline part and an online part. The offline part includes predicting, in a first computer process, semantic data for sentences of the documents contained in the database and storing this data in a database. The online part includes querying the database for information with a semantically-sensitive query, predicting, in a real time computer process, semantic data for the query, and determining, in a second computer process, a matching score against all the documents in the database, which incorporates the semantic data for the sentences and the query.08-13-2009
20090187557ARRANGING SEARCH ENGINE RESULTS - Search engine results arranged according to one or more first criteria (e.g., relevancy) are obtained. The results are assigned groups within chosen or calculated relevancy ranges. The results are then resorted within each group according to one or more second criteria (e.g., payment). The groups maintain original placement relative to each other during resorting. A list of at least some of the resorted results is then created for various uses, including search or further manipulation.07-23-2009
20090177655CATEGORY SEARCHING - Performing a category search to identify categories of web sites that relate to a search term includes receiving at least one search term that then is compared with a hierarchy of category identifiers, and with terms related to one or more categories, to determine whether matches exist. A category identifier is selected based on the matches that are determined to exist within the hierarchy and the terms, and at least the category identifier is displayed. Performing a search to identify web sites and categories of web sites that relate to a search term also may include receiving at least one search term that then is compared with a list of recommended web sites, previously performed searches, a hierarchy of category identifiers, and terms related to one or more categories to determine whether matches exist. Results based on matches that are determined to exist are displayed.07-09-2009
20090319517QUERY IDENTIFICATION AND ASSOCIATION - Apparatus, systems and methods for predictive query identification for advertisements are disclosed. Candidate query are identified from queries stored in a query log. Relevancy scores for a plurality of web documents are generated, each relevancy score associated with a corresponding web document and being a measure of the relevance of the candidate query to the web document. A web document having an associated relevancy score that exceeds a relevancy threshold is selected. The selected web document is associated with the candidate query.12-24-2009
20090083250PROBABILISTIC SEARCH AND RETRIEVAL OF WORK ORDER EQUIPMENT PARTS LIST DATA BASED ON IDENTIFIED FAILURE TRACKING ATTRIBUTES - This disclosure describes, generally, methods and systems for creating dynamic subsets of larger equipment parts lists (EPLs). For example, a method may include receiving a search request that includes an associated failure code and a target asset. The method might further include providing an EPL for the asset type, and retrieving sub-lists of the EPL based on previous search requests which are associated with the failure code for the asset type. The method may further predict which one of the plurality of sub-lists has the highest probability of being associated with the failure code for the asset type and might present the predicted sub-list of the EPL to a user.03-26-2009
20090043751GRAPHICAL USER INTERFACE FOR DATA MANAGEMENT - The exemplary embodiments provide a computer implemented method, apparatus, and computer usable program code for managing data. A user interface is generated. A user makes a selection at the user interface of at least one data type of a set of data types to be measured for relevancy. The set of data types comprise an age of data, modification of data, and access of data. The user also selects, at the user interface a granularity of the at least one data type to be measured. Data is collected from multiple sources. The collected data is analyzed to determine a relevance for the data type selected by the user based on the granularity selected by the user, which forms a result. The result is displayed to the user, by the user interface. The result includes a visual representation of the relevance of the data type selected by the user based on the granularity selected by the user.02-12-2009
20090150386SYSTEMS AND METHODS FOR LINKING AND COMMUNICATIONS BETWEEN EMPLOYERS AND EMPLOYEES - Systems and methods for linking and communications between employers and employees are described. In an aspect there is a system and method for setting up an interview with an employee through a web-based interface is described. Employee information is stored in a database that can be searched by employees who have registered to use the system. A web-based search engine provides search results to the employer based upon the specific criteria inputted by the employer. The employer is provided with virtual tokens through an account maintained by the system that are used as payment to the system if an employer contacts an employee for an interview. The employer requests an interview with the employee by sending a request for interview through a communication system. The employer accepts the request and is provided with the employee's contact information so that an interview may be initiated. After an interview, the employer provides an offer to hire to the employee through a communication system. The invention further comprises methods and systems of tracking advertising information of a business by recording the number of persons who view website-based advertising materials or information contained in electronic newsletters and relaying this information back to the business.06-11-2009
20090150381METHODS AND APPARATUS FOR COMPUTING GRAPH SIMILARITY VIA SIGNATURE SIMILARITY - This disclosure describes systems and methods for identifying and correcting anomalies in web graphs. A web graph is transformed into a set of weighted features. The set of weighted features are then transformed into a signature via a SimHash algorithm. The signature is compared to the signature of one or more other web graphs in order to determine similarity between web graphs. Actions are then carried out to remove anomalous web graphs and modify parameters governing web mapping in order to decrease the likelihood of future anomalous web graphs being built.06-11-2009
20090150377Method and system for merging extensible data into a database using globally unique identifiers - A method of merging data from one database into another database uses metadata identifiers to indicate the type of data. One of the databases can be stored on a medical device, and the other of the databases can be stored on a computer. When transferring data from the first database to the second database, the metadata identifiers are used to identify and merge common data types.06-11-2009
20090100048Mixed Media Reality Retrieval of Differentially-weighted Links - An MMR system for publishing comprises a plurality of mobile devices, an MMR gateway, an MMR matching unit and an MMR publisher integrated into a network with an advertiser, an ad broker, and an MMR service bureau. The MMR matching unit receives an image query from the MMR gateway and sends it to one or more of the recognition units to identify a result including a document, the page and the location on the page. The MMR service bureau uses the result to retrieve advertising or other links associated with the location on the page. The list of results and links are sent back to the MMR gateway for presentation on the mobile device. The present invention also includes a number of novel methods including a differentially weighting links associated with an MMR document.04-16-2009
20090049033METHOD OF USER-GENERATED, CONTENT-BASED WEB-DOCUMENT RANKING USING CLIENT-BASED RANKING MODULE AND SYSTEMATIC SCORE CALCULATION - Present invention allows a user to rank web-documents that he or she accesses with a web-browser. Ranking of web-documents is facilitated by a client-based ranking module, a software program functionally compatible with a user's web-browser. A user sends voting information together with an identification number unique to each version of the ranking module and the URL of currently active web-document to Modelane ranker system for processing. While voting for the content quality of a web-document, a user is limited to only three options: positive, negative, and zero. Scores for web-documents are calculated in such a way as to give each vote an equal opportunity to affect a score. The method of score calculation is designed to separate the scores for each individual web-document as much as possible. The method allows systematic comparison of web-documents based on popular opinion of their contents and precise ordering of web-documents on a linear scale.02-19-2009
20090144272RATING RATERS - A computer-implemented method includes identifying a plurality of ratings on a plurality of items, wherein the plurality of ratings are made by a first user, determining one or more differences between the plurality of ratings, and ratings by other users associated with the items, and generating a quality score for the first user using the one or more differences.06-04-2009
20100057731INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, INFORMATION PROCESSING PROGRAM, REPRODUCTION DEVICE, AND INFORMATION PROCESSING SYSTEM - An information processing apparatus includes a communication unit configured to receive, from a reproduction device that reproduces content recorded on a loaded recording medium, content forming information regarding the structure of the content; a degree-of-similarity calculation unit configured to compare the content forming information received by the communication unit with content forming information registered in advance for each package of an information recording medium in a predetermined database and calculate a degree of similarity between the content forming information received by the communication unit and the content forming information registered in advance in the database; and a package determination unit configured to determine the package of the information recording medium loaded into the reproduction device by using the degree of similarity calculated by the degree-of-similarity calculation unit.03-04-2010
20090150385WEB PAGE PERFORMANCE SCORING - A browser-based tool is provided that loads a Webpage, accesses the document object model (DOM) of the page, collects information about the page structure and parses the page, determines through the use of heuristics such factors as how much text is found on the page and the like, produces statistical breakdown of the page, and calculates a score based on performance of the page. Key to the operation of the invention is the ability to observe operation of the Webpage as it actually loads in real time, scoring the page for several of various performance factors, and producing a combined score for the various factors.06-11-2009
20090150392SYSTEM AND METHOD FOR PROVIDING A RESPONSE TO A SEARCH QUERY - A system receives a request to search an electronic catalog of a vendor which specifies a query term. The query term is used to search an electronic catalog of a third party. The results obtained by searching the electronic catalog of the third party are parsed to uncover a keyword recognized by a search engine associated with the electronic catalog of the vendor. The uncovered keyword is then used in the search engine associated with the electronic catalog of the vendor to locate one or more items in the electronic catalog of the vendor. Items located in this manner are the search results responsive to the query term.06-11-2009
20090150391RARE PATTERN EXTRACTING DEVICE AND RARE PATTERN EXTRACTING METHOD - A rare pattern that may be difficult to extract is extracted by extracting data where a rare pattern is likely to exist, and then generating the rare pattern using a degree of influence.06-11-2009
20090150389On-line geographical directory - A method is provided for categorising businesses, organizations and individuals in order to facilitate geographically-based searching over the Internet. The method includes entering in a database the names of businesses, organizations and/or individuals, for each name entry registering a geographical location identifier, the geographical location identifier indicating the precise geographical location at which the corresponding business, organization or individual is located, for each name entry registering further information such as contact details and a description of the goods or services offered by the business, organization or individual, and for at least some of the name entries adding credential information in respect of the business, organization or individual, or the goods or services offered by the business, organization or individual. A user interrogates the database by nominating a catchment area by reference to one or more geographical points to identify name entries within the catchment area nominated by the user.06-11-2009
20090150387GUIDED RESEARCH TOOL - A guided research tool and related method of researching for resources related to a predetermined research topic. The invention provides the benefits of an always available, automated research librarian combined with an expert in the research topic, wherein the tool will interactively provide expert assistance and advice regarding selection of an internet search string that is optimized to conduct a real time search of the latest available resources on the internet, and then to provide a limited number of search result links to current resources that are most relevant to the research topic, or to an important subtopic thereof.06-11-2009
20090150384SEARCHING A DATABASE - A method of searching a database is disclosed, in which the database comprises a plurality of components and respective descriptions, such as a UDDI database of web services and associated descriptions. The method includes transmitting a query to the database, receiving a response from the database, the response comprising a plurality of components, accessing one or more service requirements relating to the transmitted query, matching the service requirements to the respective descriptions of the plurality of components of the response, and ranking the components in the response according to an output associated with the matching.06-11-2009
20090150383Inquiry-Oriented User Input Apparatus And Method - User input from a reduced keypad is disambiguated and compared with a first dynamic lexicon, and predicted matches (e.g. either a single word or phrase) are offered. If a user continues to type beyond a boundary condition, then input is no longer predicted from the first lexicon, but instead is interpreted as a request for matches from a second, quasi-static lexicon allowing words or phrases to be entered. When the entry is accepted, data is transmitted to a remote receiver and may be parsed as an inquiry for subsequent operation. Following acceptance, the apparatus invokes a program suitable for interacting with the response generated to the inquiry.06-11-2009
20090150382Tailored intergenerational historic snapshots - A tailored intergenerational historic snapshot message informs a younger person about the world an older person lived in when they were young. The older person's age and the younger person's age are used to identify a historic time period in which the older person was the same age as the younger person. A circumstance which occurred in the historic time period is selected from a database or web search result. The message is tailored to the ages of the people involved. The message may also be tailored to recite circumstances specific to a topic area or a geographic location.06-11-2009
20090150378Method for estimating a prestige of an entity - A method and an apparatus for estimating a prestige of an entity, as for example a firm, company or name, is disclosed wherein a score value is assigned to an entity as a function of an occurrence of terms associated with said entity in search results. The search results are obtained by searching an information space such as the internet. This enables, for example, companies or divisions, to infer their public standing from an analysis of search results obtained through internet search engines. It is possible to compare a plurality of entities with respect to each other in an automated fashion.06-11-2009
20090150375Detecting zero-result search queries - A processing device and method may be provided for determining whether a zero search result may be produced with respect to a search for a document including all words of a word group. An index, with respect to words included in a group of documents, may be searched for documents including all words of the word group when a zero search result is determined not likely to occur with respect to the search for the document including all of the words of the word group. A method for creating multiple types of data structures corresponding to word grouping collections may further be provided to store occurrence information indicating a likelihood of a presence of a document including all words of a word group.06-11-2009
20090150373SYSTEM AND METHOD FOR SYNCHRONIZING DATA ON A NETWORK - The disclosure describes systems and methods for synchronizing data on a network based on temporal, spatial, social and logical data available to the network. The method includes receiving a first information object (IO) containing attributes for a first real-world entity (RWE), the first IO associated with a second RWE; identifying one or more second IOs, each second IO containing one or more attributes for the first RWE and each second IO independently associated with a third RWE; generating a different probability for each IO based on a comparison of contents of the first and second IOs and their associated RWEs; and replacing one or more of the attributes in at least one IO with at least one attribute from a different IO based on the probabilities for each IO.06-11-2009
20090150370System and Method For Restricted Party Screening and Resolution Services - A system and method for screening data for restricted party screening. The system comprises an input for entering data, a screening system for screening the data against a database comprising restricted entities information, generating a match score based on the screening of the data, providing a data match based on the match score, and outputting the data match, a work queue for reviewing the data match, and a report generated based on the review of the data match.06-11-2009
20100023503SYSTEM AND METHOD FOR AUTOMATICALLY SELECTING A DATA SOURCE FOR PROVIDING DATA RELATED TO A QUERY - A computer-implemented method of prioritizing a predefined set of electronic data sources includes a step of identifying one or more second data sources corresponding to one or more first data sources if it is determined that the first data sources do not have the ability to provide data related to one or more query dimensions of a query statement. The identified one or more second data sources meet the following criteria: (1) one or more source fields of the one or more second data sources are equivalent to the one or more query dimensions not contained in the first data source; and (2) each source dimension field of the one or more second data sources are either: (A) equivalent to a source field of the first source or (B) have values that are capable of being obtained from the query statement. The one or more first data sources are linked with the corresponding one or more second data sources to generate one or more composite data sources. Scores are electronically assigned to each of the composite data sources based on certain criteria, and the composite data sources are electronically and dynamically ranked based on the assigned scores. One or more of the composite data sources electronically identified as having the highest rank are selected as preferred data sources for locating the data value in response to the query statement.01-28-2010
20090037411Membership selection assistant - A method and apparatus for a membership selection assistant that assists users in selecting a set of memberships to carry out a set of actions. This assistance is provided by determining what actions the user wishes to carry out and what memberships the user wishes to consider for those actions; then determining what benefits the user would derive by use of the memberships they have identified; then consolidating and rank ordering the benefit information; then either presenting the user with consolidated and rank ordered information, or automatically making optimal membership selection.02-05-2009
20080250013System, Method And Computer Program Product For Electronically Responding To Requests For Product Related Data - A method for electronically responding to requests for product related data, the method includes: collecting product related data from feeder systems; organizing the collected product related data into digital libraries within a document management system; receiving a discovery request from legal counsel to identify related documents; searching the product related data for documents; tagging documents identified in the search and placing copies of the documents in a holding queue; and importing the documents in the holding queue to a litigation support system.10-09-2008
20090006389NAMED URL ENTRY - Methods and systems allow users to enter natural language terms that describe a particular web site into an address field of a browser instead of a formal URL. The terms are evaluated to determine whether they correspond, with a high likelihood, to a particular web site. If so, this web site may be immediately accessed. If not, a list of search results based on the terms may be displayed by the browser.01-01-2009
20090006378COMPUTER SYSTEM METHOD AND PROGRAM PRODUCT FOR GENERATING A DATA STRUCTURE FOR INFORMATION RETRIEVAL AND AN ASSOCIATED GRAPHICAL USER INTERFACE - A computer system for generating data structures for information retrieval of documents stored in a database. The computer system includes: a neighborhood patch generation system for defining patch of nodes having predetermined similarities in a hierarchy structure. The neighborhood patch generation subsystem includes a hierarchy generation subsystem for generating a hierarchy structure upon the document-keyword vectors and a patch definition subsystem. The computer system also comprises a cluster estimation subsystem for generating cluster data of the document-keyword vectors using the similarities of the patches.01-01-2009
20090144255AUGMENTING PRIVACY POLICIES WITH INFERENCE DETECTION - A system is provided for augmenting a privacy policy. During operation, the system obtains a set of training documents and at least one seed keyword associated with the privacy policy. The system extracts a number of candidate keywords from the training documents and formulates at least one query based on the candidate keywords. The system then issues the query to a corpus. In response to the query, the system receives a set of result documents. The system further determines whether a respective keyword extracted from the result documents matches at least one seed keyword. The system then augments the privacy policy by associating the candidate keyword corresponding to the respective keyword with the privacy policy based on the determination. In addition, the system applies the augmented privacy policy to a subject document and produces a result to indicate whether the subject document is in violation of the privacy policy.06-04-2009
20100030774MODEL ENTITY OPERATIONS IN QUERY RESULTS - The present invention provides systems and articles of manufacture that enhance the capability of a database abstraction model and query application constructed for an underlying physical database. Typically, the query application is used to compose and execute an abstract query. Once an initial query result is presented to a user, a user may select to execute a model entity operation by interacting with a query interface of the query application. A model entity operation allows the user to retrieve additional information from the underlying database, based on information included in the initial query result, without having to create a new query or having to correlate the results of multiple queries.02-04-2010
20080270388Method for providing keyword based on keyword providing range and system thereof - A method of providing a keyword includes: receiving a query from a user; setting, according to user's selection, a keyword providing range with respect to the query; and providing a representative keyword or a tail keyword with respect to the query based on the keyword providing range.10-30-2008
20080270386DOCUMENT RETRIEVAL SYSTEM AND DOCUMENT RETRIEVAL METHOD - A document retrieval is performed with similarities between documents in numeric data taken into consideration. To this end, generated is a set E of intervals in which each element of a set D of numeric values representing a feature A is included in any one of the intervals. Each numeric value in each document is indexed by assigning, with 1, an interval including an element x of the set D, and with 0, an interval without the element x. Each document data including numeric values is indexed by indexing its text part with term frequencies, and by indexing its numeric-value part with the above-described numeric value indexing scheme. By use of indices thus created for each of the document data, similarities between the document data are calculated using a vector space model or a probability model, and the document data are presented in order of similarity.10-30-2008
20080270376WEB SPAM PAGE CLASSIFICATION USING QUERY-DEPENDENT DATA - A web spam page classifier is described that identifies web spam pages based on features of a search query and web page pair. The features can be extracted from training instances and a training algorithm can be employed to develop the classifier. Pages identified as web spam pages can be demoted and/or removed from a relevancy ranked list.10-30-2008
20080270383System for Generating and Displaying Community Awareness Data - A system and method links one or more disparate community awareness management (CAM) datasets for a community awareness program (CAP) with one or more spatial layers to create linked CAM datasets. One or more data attributes common to a CAM dataset and a spatial layer are identified, and the link is defined between the CAM dataset and the spatial layer. The spatial layer and the linked CAM dataset then may be queried using a single input query. Features from the spatial layer and features from the linked CAM dataset that match the query are generated for display. In one embodiment, a system and method manage CAP assets, transactions, interest areas for the CAP, and buffer areas for the CAP. An audience utility enables entering and maintaining audience data for the CAP. A journal utility enables making journal entries for one or more audience members, CAP assets, transactions, and/or other CAM data. A link document utility enables linking one or more documents to CAM data.10-30-2008
20080270384SYSTEM AND METHOD FOR INTELLIGENT ONTOLOGY BASED KNOWLEDGE SEARCH ENGINE - The present invention relates to a system and method for intelligent ontology based knowledge search engine (IATOPIA KnowledgeSeeker). Said IATOPIA KnowledgeSeeker, is an intelligent ontology-based system that is designed to help Web users to find, retrieve, and analyze any Web information such as news articles from the Internet and then present the content in a semantic web. We present the benefits of using ontologies to analyze the semantics of Chinese text, and also the advantages of using a semantic web to organize information semantically. IATOPIA KnowledgeSeeker also demonstrates the advantages of using ontologies to identify topics. We use a Chinese document corpus to evaluate IATOPIA KnowledgeSeeker and the testing result was compared to other approaches. It was found that the accuracy of identifying the topics of Chinese web articles is over 87%. It demonstrated a fast processing speed of less than one second per article. It also organizes content flexibly and understands knowledge accurately, unlike traditional text classification systems used in popular search engines today such as Google and Yahoo.10-30-2008
20080270380Method for Determining Contextual Summary Information Across Documents - In a method for determining contextual summary information across documents retrieved in response to a user query applied to a collection of documents the documents matching the query are identified. A query-dependent subsection of each of the matching documents is selected. Document properties associated with the document subsection are selected and associated with localized structures within the document. Relationships between localized document properties and user queries are determined and used to compute contextual summary information, whereby localized document properties are profiled across the retrieved documents in a contextual manner. The method allows a user query to select localized structures within a matching document and is generally applicable in information retrieval and the analysis of retrieved information.10-30-2008
20080270382System and Method of Personalizing Information Object Searches - Described are a system and method of performing an electronic search for information objects in data stores. A user submitting a text string is identified. A metadata model comprises interconnected nodes. At least one node corresponds to a metadata instance. A catalog of catalog items is provided. Each catalog item is linked to a metadata instance in the metadata model and uniquely associated with an information object. A user-access right is assigned to each metadata instance. The catalog is searched to find catalog items linked to a metadata instance satisfying the user-access right and criteria associated with the submitted text string. Links to metadata instances are extracted in real time from each catalog item found in the search of the catalog. Each metadata instance corresponding to an extracted link is displayed, if permitted for the identified user by the user-access right assigned to that metadata instance.10-30-2008
20080270379Online Search System, Method and Computer Program - A search system, method and computer program are disclosed in which characters of a search term are captured as they are entered into a client system (10-30-2008
20090157656AUTOMATIC, COMPUTER-BASED SIMILARITY CALCULATION SYSTEM FOR QUANTIFYING THE SIMILARITY OF TEXT EXPRESSIONS - A device and a method for automatic, computer-based similarity weighting of text expressions. The system and method contemplate a document data bank unit, a candidate expression memory unit and a similarity weight value calculation unit. The similarity weight values agw(t06-18-2009
20090157655Process For Computer Supported Processing of Course Data Elements, System and Computer Program Product - In summary, the present invention concerns processes for computer supported processing of source data elements (06-18-2009
20090157653Methods for enhancing digital search results based on task-oriented user activity - Methods for using task-related information to enhance digital searching are provided. A task-oriented user activity system maintains task-related information about resources accessed by a user and current user task. This task-related information is used to enhance search results by filtering and ranking results to increase relevance with respect to a user's current task. The task-related information can also be used to include task-related metadata in search engine index, e.g., by storing the metadata in the index or by storing it in resources which are subsequently indexed. Task-related information can also be used to enhance search results by enhancing search queries to include task-related search criteria.06-18-2009
20090157652METHOD AND SYSTEM FOR QUANTIFYING THE QUALITY OF SEARCH RESULTS BASED ON COHESION - A method and system for quantifying the quality of search results from a search engine based on cohesion. The method and system include modeling a set of search engine search results as a cluster and measuring the cohesion of the cluster. In an embodiment, the cohesion of the cluster is the average similarity between the cluster elements to a centroid vector. The centroid vector is the average of the weights of the vectors of the cluster. The similarity between the centroid vector and the cluster's elements is the cosine similarity measure. Each document in the set of search results is represented by a vector where each cell of the vector represents a stemmed word. Each cell has a cell value which is the frequency of the corresponding stemmed word in a document multiplied by a weight that takes into account the location of the stemmed word within the document.06-18-2009
20090157651Method and Apparatus for Detecting and Explaining Bursty Stream Events in Targeted Groups - A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.06-18-2009
20090094223SYSTEM AND METHOD FOR CLASSIFYING SEARCH QUERIES - A search facility for classifying search queries prior to executing the search queries. The facility can receive a search query from a user and perform one or more of a set of evaluations of the search query to determine likely query classifications. The facility can also decompose the search query into constituent parts and perform one or more of a set of evaluations of the individual constituent parts to determine likely classifications. The facility can then arbitrate amongst the likely query classifications and rank the arbitrated likely query classifications. The ranked arbitrated query classifications can be mapped to data sources and services. The facility can retrieve content from the mapped data sources and services using the user's original search query. Each of the ranked arbitrated query classifications can correspond to a display region that can display content from the mapped one or more data sources and services to the user.04-09-2009
20090094220ASSOCIATIVE TEMPORAL SEARCH OF ELECTRONIC FILES - A method, system, and computer program product are provided for identifying data objects related to temporal characteristics. A first data object that has been previously stored is identified. The first data object has one or more associated temporal characteristics. At least one associated temporal characteristic is extracted from the first data object, thus, forming at least one extracted temporal characteristic. The at least one associated temporal characteristic is extracted in order to perform a search for at least one second data object. A search is performed for at least one second data object based on the at least one extracted temporal characteristic. The results of the search are presented in a graphical user interface.04-09-2009
20090094235Ordering directory assistance search results by local popularity of search results - A platform for ordering search results according to result popularity a directory assistance service, itself, to determine the popularity of the listings associated with a particular category and, therefore, the order in which those listings should be delivered resulting from a category search. Priority may be determined by the popularity of each search result. For example, directory assistance users in a particular location will likely know which businesses have a reputation for providing the best service or highest-quality products. Such factors will determine the popularity of these businesses. Businesses that are more popular are likely to be selected more frequently from the category search results than those that are less popular. The system may examine the history of search results for each listing and order the results of a particular category search according to the number of historical requests for each returned listing.04-09-2009
20090094221QUERY SUGGESTIONS FOR NO RESULT WEB SEARCHES - Presenting one or more suggested search-engine queries based on an initial search-engine query is described herein. Once the initial query is received, a search engine determines whether any web content is relevant thereto. If not, a query-suggestion service determines whether any suggested queries can be substituted for the initial query. If not, the query is spell-corrected, if necessary, and parsed into individual terms. Each parsed term is then checked to see whether it can be associated with alternative search terms. Terms than can are combined and their combination is also checked for alternative search terms. All of the alternative search terms are scored and then assembled into a list of suggested search terms that is presented to the user.04-09-2009
20090094219METHOD AND SYSTEM FOR IDENTIFYING A CANDIDATE FOR AN OPPORTUNITY - A database stores information about persons and their qualifications. In response to information about a current opportunity that specifies at least one qualification, the database is searched to identify at least one of the following: first persons having qualifications that substantially match the current opportunity's specified qualification; and preexisting opportunities that specify one or more qualifications that substantially match the current opportunity's specified qualification. Also, the database is searched to identify target companies that satisfy at least one of the following conditions: the target company is where at least one of the first persons exists; the target company is where at least one of the first persons previously existed; the target company is where at least one of the preexisting opportunities exists; and the target company is where at least one of the preexisting opportunities previously existed. Further the database is searched to identify second persons that satisfy at least one of the following conditions: the second person exists in at least one of the target companies; and the second person previously existed in at least one of the target companies. A list of the second persons is output to a human user, so that the human user is equipped to contact the second persons.04-09-2009
20080222140COMPARATIVE WEB SEARCH SYSTEM AND METHOD - A system and method for a comparative web search engines, search result summarization, web snippet processing, comparison analysis, information visualization, meta-clustering, and quantitative evaluation of web snippet quality are disclosed. The present invention extends the capabilities of web searching and informational retrieval by providing a succinct comparative summary of search results at either the object or thematic levels.09-11-2008
20090216755Indexing Method For Multimedia Feature Vectors Using Locality Sensitive Hashing - A computer implemented method for indexing multimedia vectors and for searching and retrieving a query vector using a locality sensitive hashing. Indexing is performed by calculating hash codes from the multimedia vectors using several hash functions. Each hash code is a different subset of the entries in the hash vector. The method utilizes the structure of the hash vector space in order to define the hash codes in a way that improves the retrieval efficiency. Retrieval is performed by applying the hash functions to a query vector and measuring the distances between the query vector and multimedia vectors with hash codes identical to the hash codes of the query vector.08-27-2009
20090287674Method for Enhancing Search and Browsing in Collaborative Tagging Systems Through Learned Tag Hierachies - A number of Web 2.0 sites support collaborative tagging systems, which allow users to tag resources with keywords. The tags enable search and retrieval of resources both for the user and for other users, using interfaces like a conventional search form or a tag cloud. A tag hierarchy-based search and retrieval system is provided that enhances the existing interfaces by improving search recall and allowing the discovery of even poorly annotated resources. The system uses tag co-occurrence information to automatically learn tag hierarchies. The learned hierarchies are used for automatically inferring additional tags to resources. These inferences are used to improve the recall of queries issued from a search form or via a tag cloud. The learned hierarchies can be viewed as an emergent ontology that is built up through the collaborative wisdom of a large number of users.11-19-2009
20090210404DATABASE SEARCH CONTROL - Identifying a search engine user's preference for handling quotations, using easily remembered variations for enclosing a quote, simplifies the user interface. An example is enclosing the quote in either single or double quotation marks to indicate search options for the quote. A method of controlling a database search comprises receiving a search string; identifying, within the search string, a pair of phrase indicia such as quotation marks; identifying, between the pair of indicia, a quote string; matching the pair of phrase indicia to one of a plurality of pairs of indicia, wherein first and second ones of the plurality indicate an exact quote search and a modified quote search, respectively; and identifying, responsive to the matching, a request for an exact quote search or a modified quote search. The modified quote search may be a spell corrected search, a word stemmed search, an alternate spelling search, or a translated search.08-20-2009
20080270378Method, Apparatus and Computer Program Product for Determining Relevance and/or Ambiguity in a Search System - An apparatus for a determining relevance and/or ambiguity in a search system may include a processing element configured for receiving visual media comprising a query, determining search results including a matching score for at least one candidate visual media with respect to the query based on ambiguity and relevance, utilizing a mapping function to provide a confidence level associated with the search results, and providing a visualization of the search results based on the confidence level.10-30-2008
20080270374METHOD AND SYSTEM FOR COMBINING RANKING AND CLUSTERING IN A DATABASE MANAGEMENT SYSTEM - A system for combining ranking and clustering in a query. Bit vectors are intersected on Boolean attributes resulting in a vector. Two summary grids are constructed by intersecting bit vectors on clustering and ranking attributes. The vector is intersected with each summary grid to obtain a filtered clustering and ranking grid. An algorithm is applied on the clustering grid to obtain clusters. Vectors associated with buckets in the clusters are intersected resulting in one vector for each cluster. The vector corresponding to each cluster is intersected with the ranking grid to obtain a modified grid. Buckets are pruned according to bounds of each bucket in the modified grid and a predetermined number to obtain candidate buckets containing the predetermined number of data. The data are retrieved and a ranking score is calculated. The top predetermined number of data are sorted according to ranking scores and a result is returned.10-30-2008
20080270377CALCULATING GLOBAL IMPORTANCE OF DOCUMENTS BASED ON GLOBAL HITTING TIMES - A calculate importance system calculates the global importance of a web page based on a “mean hitting time.” Hitting time of a target web page is a measure of the minimum number of transitions needed to land on the target web page. Mean hitting time of a target web page is an average number of such transitions for all possible starting web pages. The calculate importance system calculates a global importance score for a web page based on the reciprocal of a mean hitting time. A search engine may rank web pages of a search result based on a combination of relevance of the web pages to the search request and global importance of the web pages based on a global hitting time.10-30-2008
20090292689SYSTEM AND METHOD OF PROVIDING ELECTRONIC DICTIONARY SERVICES - A database and techniques for managing and updating the database are described. The database includes defined terms and undefined terms stored therein. While each of the defined terms is stored in the database in association with a definition thereof, each of the undefined terms is stored in the database in association with a number of times search requests have been received for the associated undefined term. Further, a ranking list for the undefined terms is maintained. In the ranking list, the undefined terms are ranked with reference to the number associated with each. Search requests for particular terms are received. In response to the search requests, the search for the particular term is performed in the database. If a match is found between a first one of the particular term and a first one of the defined terms, the definition thereof is retrieved from the database. Also, if a match is not found between a second one of the particular term and any of the defined terms, the ranking list is updated with reference to the second particular term.11-26-2009
20090063476Method and Apparatus for Restricting a Fan-Out Search in a Peer-to-Peer Network Based on Accessibility of Nodes - A method, apparatus, and computer implemented instructions for restricting a fan-out type search of a distributed database. A search request is received indicating that a requesting node originating the search request desires to avoid receiving search results including inaccessible nodes. Responsive to receiving the search results from other nodes, the search results are filtered to remove search results from inaccessible nodes to form filtered search results. The filtered search results are passed to the requesting node.03-05-2009
20090299988APPLICATION OF USER CONTEXT TO SEARCHES IN A VIRTUAL UNIVERSE - An approach that applies user context to searches in a virtual universe is described. In one embodiment, there is an enhanced virtual universe search tool that includes a receiving component configured to receive a query from an avatar that is online in the virtual universe. A scanning component is configured to scan a collection of avatar data describing attributes that are relevant to behavioral, search and informational needs of the avatar. A resource search component is configured to return search results for the query that are in accordance with the scanned collection of avatar data.12-03-2009
20090287685METHOD AND APPARATUS FOR SOCIOLOGICAL DATA ANALYSIS - A method to enable improved analysis and use of sociological data, the method comprising identifying causal relationships between a plurality of documents, identifying a plurality of characteristics of a communication, including a modality used, actors involved, proximate events of relevance, and enabling a user to query based on available characteristics.11-19-2009
20090077060TECHNIQUES FOR SECURE NETWORK SEARCHING - Techniques for network searching are provided. A search is defined and the search is encrypted in a format known to a search service. Return instructions are defined for delivering search results of the search to a principal that defined the search and the return instructions. The return instructions are encrypted in a different format know to a return search process. The encrypted search is delivered to the search service for processing the search and the encrypted return instructions are delivered to the return search process for handling search results provided by the search service and for conforming delivery of the search results to the return instructions.03-19-2009
20090204611INFORMATION DISPLAY APPARATUS, INFORMATION DISPLAY PROGRAM AND INFORMATION DISPLAY SYSTEM - The interest of a user of an information terminal is extracted by a file operation and information suited to the interest is provided. When a system call for file access is issued from an application program, an access processor 08-13-2009
20090187566ASSOCIATING DOCUMENTS WITH CLASSIFICATIONS AND RANKING DOCUMENTS BASED ON CLASSIFICATION WEIGHTS - A method and apparatus for associating documents with classification values and ranking documents based on classification weights is provided. It is determined if a document is associated a classification. If the document is associated with a classification, then it is determined if a classification value, which is associated with the document, is associated with a weight. If the classification value is associated with a weight, then a rank of the document is adjusted based on the weight that is associated with the classification value.07-23-2009
20090177647Method and Apparatus for Conducting Data Queries Using Consolidation Strings and Inter-Node Consolidation - Query inefficiencies are improved and entity-interrelational blindness is overcome by employing two ideas: Consolidation Strings and Inter-Node Consolidation. These ideas can be typically employed in law-enforcement records systems (such as COPLINK systems), but is certainly not limited to such an application. Consolidation Strings represent key pieces of information that are in a text/character format, and may be encrypted/hashed. A system's hierarchy of consolidation rules automatically determine if two different rows in a database actually refer to the same real-world object. These rules are NOT statistical or probabilistic in nature, thus enhancing the confidence and reliability in the results. Three general classifications of Consolidation Strings are encompassed: Those based on positive identifiers, those based on demographic information, and those based on associative information that spans multiple-entity types. Inter-Node Consolidation provides a means to facilitate the communication of updated consolidation information between data-source nodes in order to leverage the advantages of Consolidation Strings.07-09-2009
20090171936System, Method, and Computer Program Product for Accelerating Like Conditions - A system, method, and computer program product are provided for optimizing LIKE-condition based queries on a table in a database system.07-02-2009
20090063443System and Method for Dynamically Supporting Indirect Routing Within a Multi-Tiered Full-Graph Interconnect Architecture - A method, computer program product, and system are provided for dynamically routing data through the data processing system. Data is received at a first processor that is to be transmitted to a destination processor. The data that is received includes address information. A lookup is performed in routing table data structures based on the address information to identify candidate paths through which the data is routed to the destination processor. A determination is made as to whether any of the candidate paths are not able to be used to route the data to the destination processor based on a setting of at least one identifier. A path is selected from the identified candidate paths for routing of the data based on a setting of the at least one identifier. Then, the data is transmitted from the first processor along the selected path toward the destination processor.03-05-2009
20090055386System and Method for Enhanced In-Document Searching for Text Applications in a Data Processing System - A system and method for implementing enhanced searching within a document in a data processing system. A search manager receives an original search term, wherein the original search term includes at least two words. The search manager creates a set of alternate search terms by: retrieving from a predetermined thesaurus database at least one synonym for at least one word in the original search term; and inserting at least on wildcard between the at least two words within the original search term. The search manager performs at least one search utilizing the set of alternate search terms and the original search term. The search manager ranks the search results from the at least one search according to a predetermined priority order. The search manager outputs the ranked search results.02-26-2009
20090204599USING RELATED USERS DATA TO ENHANCE WEB SEARCH - The claimed subject matter provides a system and/or a method that facilitates generating a personalized query result for a specific user. An interface can receive at least one of a portion of a text query to be searched or a portion of personalized content related to a user that submits the portion of the text query. A personalization component can combine the portion of personalized content related to the user with a portion of personalized content related to one or more disparate users to create group personalized content, wherein the group personalized content is compared with the portion of the text query to identify a relationship there between to generate a personalized query result in accordance with the relationship.08-13-2009
20090063466Resource selector, including for use in handheld devices - Described is a technology by which a resource selector traverses a hierarchical storage structure to enumerate its resources and provide a flat list of corresponding items. The user interacts with the flat list to select an item. The resource selector is particularly beneficial when incorporated into a handheld computing device. The resource selector may use a filtering criterion associated with an application program, e.g., the hierarchical storage may correspond to a file system, with the file extension (type) being the filtering criterion. A trigger coupled to the resource selector triggers the resource selector, in which the trigger may be incorporated into the application program, or may comprise an application-independent (e.g., operating system) component that knows which application program currently has focus and triggers the resource selector for that application.03-05-2009
20090055389Ranking similar passages - Passages in a digital corpus are scored and ranked based at least in part on characteristics of instances of the passages occurring in the corpus. Such characteristics include the popularity of the author, the characteristics of the words introducing and following the similar passage, frequency of appearance of the passage in the digital corpus, the length of the similar passage, the words of the similar passage, the usage of punctuation with the similar passage, and the diffusion of the similar passage within the digital corpus. The characteristics are scored and weighted to produce ranking scores for the associated passages. The ranking scores are used for purposes including selecting passages to display in association with a document and ranking passages displayed in response to a search.02-26-2009
20090006385SYSTEM AND METHOD FOR MEASURING THE QUALITY OF DOCUMENT SETS - Systems and methods are described that calculate the interestingness of a set of one or more records in a database, either absolutely (i.e., compared to an overall collection of records) or relative to some other set of records. In one embodiment, the measure is a relative entropy value that has been normalized. Various applications of the measure are described in the context of an information retrieval system. These applications include, for example, guiding query interpretation, guiding view selection and summarization, intelligent ranges, event detection, concept triggers and interpreting user actions, hierarchy discovery, and adaptive data mining.01-01-2009
20090006388SEARCH RESULT RANKING - A search engine (01-01-2009
20090006379FILTERING METHOD AND SYSTEM FOR THE CORRELATION BETWEEN TESTING OBJECTS AND PATENTS - In a filtering system for a correlation related to a composing element portfolio of a patent, a plurality of elements are defined by standard element codes and stored. Then composing elements of the patent are defined by the corresponding standard element codes in a standard element depository so as to form the composing element portfolio of the patent to be stored. Then an input module defines a composing element portfolio of the testing object according to the standard element codes. After that, the composing element portfolio input by the input module is matched with the composing element portfolio of each said patent. Afterward, matching result are sorted according to correlation and a sorted result based on the correlation is displayed.01-01-2009
20090006361METHOD AND SYSTEM FOR CONTAINING AND ACCESSING MULITPLE WEB BROWSERS - A container browser or a super browser stores browsers and enables a user to launch any stored web browser and display a designated web page from a preferred browser. This container browser can track and bookmark the browsers such that the browsers could be easily selected and initiated. The container browser could present the content in tabbed form, meaning that a tab that the user can click on will represent every open window. Once the user has designated a particular browser to display particular web page, then this information is stored in the container browser. Further accesses to that web site would be displayed using the designated web browser. Other designated web pages would be displayed on other designated web browsers.01-01-2009
20090259645Block compression algorithm - A method for compressing a data stream based on a 3 byte sequence is used. Each three byte sequence is assigned a code word including a location and a length of the data associated with the code word. When a 6 byte sequence is located, a binary tree of 6 byte sequences sharing the same first three bytes is built, associating each 6 byte sequence with a position in the stream where the 6 byte sequence is found. When the length of a code word is changed, a byte sequence is emitted that identifies the code word to be changed and updating the length of the code word, so that when a match is found, a byte sequence is emitted that identifies the code word associated with the matched data. The method finds particular application in data streams that are sent to printers, and which contain large blocks of identical data.10-15-2009
20090210406METHOD AND SYSTEM FOR CLUSTERING IDENTIFIED FORMS - A method is provided for organizing a plurality of documents that include forms. An initial set of clusters is defined for the plurality of documents. The initial set of clusters is reclustered based on similarity values calculated in multiple feature spaces. For example, a first feature space may be associated with a content of a document while a second feature space may be associated with a content of a form associated with the document. Each cluster has an associated centroid vector in each feature space that is used to represent the cluster. The similarity between the document and each cluster is calculated in both feature spaces. Each document is assigned to the cluster whose centroid is most similar. The cluster centroids may be recalculated and the process repeated until the cluster assignments become stable.08-20-2009
20090089280REAL-TIME SEARCH TERM POPULARITY DETERMINATION, BY SEARCH ORIGIN GEOGRAPHIC LOCATION - Information is generated indicative of frequency of search terms presented to at least one online search service. As event indications, indicative of user interaction generally with front end servers, are being provided for persistent storage, ones of the event indications that are indicative of search events are detected. The detected ones of the search event indications are processed and it is determined, based at least in part thereon, by location, frequency data indicative of a frequency of each of a plurality of search terms presented to the at least one online search service. An indication of at least some of the frequency data is caused to be associated with indications of locations to which the frequency data corresponds. For example, the frequency data may be displayed superimposed on a map.04-02-2009
20090187561CONTENT SEARCH METHOD AND SYSTEM - Provided are a content search method and system, and more particularly, a content search method and system which enable a user to easily access desired search results by reducing the need to input a keyword again or navigate through pages of search results. The content search method includes: receiving a keyword and extracting a list of contents which correspond to the received keyword; adjusting relative weights of one or more attribute information of the contents by using a user interface which can adjust the relative weights; and sequentially listing the contents based on the adjusted weights and providing the contents accordingly.07-23-2009
20090187550SPECIFYING RELEVANCE RANKING PREFERENCES UTILIZING SEARCH SCOPES - A mechanism for expressing a user preference to a set of documents based on user knowledge about the document corpora. The user preference input to the system can be positive, negative, or both. A set of documents that can be identified with a query can define a search scope definition. The search scope is mapped into an input ranking feature for a ranking function. The search scope definition is employed as a soft preference ranking feature, and thus, used to bias ranking via relevance feedback. The mechanism facilitates increasing or decreasing the final ranking score of a document based on whether the document falls into the user scope. The ranking weight can be configured by the user ad-hoc, or when relevance judgments are available, using machine learning techniques to find the optimal weights to optimize ranking.07-23-2009
20090276412Method, apparatus, and computer program product for providing usage analysis - An apparatus for providing usage analysis may include a processor. The processor may be configured to receive a plurality of usage attributes from a plurality of platforms, where the plurality of platforms may be associated with a user. In this regard, the plurality of usage attributes may have associations with a plurality of objects, and each usage attribute within the plurality of usage attributes may be indicative of an action taken by the user with respect to one object within the plurality of objects. The processor may be further configured to arrange indications of the objects within the plurality of objects based on the plurality of usage attributes for presentation. Associated methods and computer program products may also be provided.11-05-2009
20090276418INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, INFORMATION PROCESSING PROGRAM AND RECORDING MEDIUM - An information processing apparatus is disclosed which determines, based on fitness to a specified condition, an order of displaying multiple to-be-searched information items which are pre-stored. The apparatus includes a specifying-condition information obtaining unit, an index-information obtaining unit, a population-limiting information obtaining unit, an index-information modifying unit, and a fitness calculating unit.11-05-2009
20090276413MANAGING ELECTRONIC DATA WITH INDEX DATA CORRESPONDING TO SAID ELECTRONIC DATA - An improved approach for managing and sending electronic data which allows one to access electronic data corresponding to a hardcopy document is provided. For example, when the hardcopy bearing a visible image is output, an identification image corresponding to identification data identifying the document is added to the visible image. The identification data can be recognized from the identification image, and used to retrieve various information in a database corresponding to the document.11-05-2009
20090144253METHOD OF PROCESSING A SET OF CONTENT ITEMS, AND DATA- PROCESSING DEVICE - A method of processing a set of content items, the method comprising the steps of: (06-04-2009
20090282024CASE SEARCH SYSTEM, CASE DATABASE, CASE SEARCH APPARATUS, CASE SEARCH METHOD, AND PROGRAM - A case search system for searching for a case that serves as a reference in a design or operation of a wireless network.11-12-2009
20090282028User Interface and Method for Web Browsing based on Topical Relatedness of Domain Names - Systems, computer software and methods for searching plural domain names based on domain name system queries are described. The method includes receiving as input a domain name, searching a database for identifying scores measuring relatedness of the input domain name and other domain names of the plural domain names, retrieving related domain names with the highest relatedness scores, and associating the input domain name and the related domain names. The relatedness scores are calculated based on the domain name system queries of users.11-12-2009
20090282026SYSTEM TO GENERATE AN AGGREGATE INTEREST INDICATION WITH RESPECT TO AN INFORMATION ITEM - A method is provided to publish a list of top ranked listings. The method may include configuring a database to store a plurality of listings published over a network. An interest indication may be received from a user for a listing in the plurality of listings. An interest indication data structure may be created and stored that associates the user with the listing. Also, the top ranked listings may be identified from the plurality of listings based on the number of stored interest indication data structures for each listing. Further, the list of the top ranked listings may be published.11-12-2009
20090282023SEARCH ENGINE USING PRIOR SEARCH TERMS, RESULTS AND PRIOR INTERACTION TO CONSTRUCT CURRENT SEARCH TERM RESULTS - An Internet infrastructure contains a search server that delivers search result pages of search results or web sites to client devices based upon a search string. The search results provided to the user take into account prior search terms entered by the user, and may take into account user interaction (or lack thereof) with prior search results as well as additional information other than just the search string and popularity ranking of web pages on the Internet. Specifically, a web browser contained in the client devices displays a first set of search result pages of web sites delivered by the search server in response to a search string. Then, in response to a modified search string and/or monitored and processed user interaction with prior search results, the search server delivers a second set of search result pages, comprising more relevant search information.11-12-2009
20090282021WEB BROWSER ACCESSIBLE SEARCH ENGINE WHICH ADAPTS BASED ON USER INTERACTION - A search engine (SE) is capable of adapting based on the user's interaction with search results/WebPages. Information, based on user interaction, is subsequently used to modify the priority of search results to create a more relevant search list that provides the user more relevant search information in a shorter period of time. The search engine adaptation takes place by calculating evaluation inputs based on user interaction with search results to compute a metric herein called as the desirability number for one or more search results. The desirability number (DN) is tagged as a search result or page attribute and is stored in the search engine server database in association with each search result or page. Based on the DN and other possible indicators, the resultant search list is modified to better prioritize search results that appear to be more meaningful to the user before continuing to present results to the user.11-12-2009
20090282015Systems and Methods for Predicting a Degree of Relevance Between Digital Ads and Webpage Content - Systems and methods for predicting a degree of relevance between a set of candidate digital ads and webpage content are disclosed. Generally, an ad provider receives a digital ad request associated with webpage content. The ad provider identifies a set of candidate digital ads that may be served in response to the digital ad request. A relevance module extracts a set of features from the set of candidate digital ads and the webpage content, and determines a degree of relevance between the set of candidate digital ads and the webpage content based on a prediction model and the extracted set of features. If the relevance module determines the set of candidate digital ads is relevant to the webpage content, the ad provider may serve one or more digital ads from the set of candidate digital ads in response to the received digital ad request.11-12-2009
20090100049Methods and Apparatus for Entity Search - Methods and apparatus that deliver a searching experience that is substantially akin to consultation with a human expert, and that satisfies a user's information need in fulfilling projects such as purchasing, shopping, procurement, bartering, requesting for quotes, in online retail, traditional retail, wholesale, health care, travel, real estate, restaurant-going, entertainment, logistics, and sourcing are disclosed. Search results often contain entities that provide services and products. Records being searched are associated with industry sectors in a broad sense. Industry sector information is first derived from a user query; and is used in determining relevant and adequate additional questions for a searcher, and in matching, ranking, and presenting search results.04-16-2009
20090100041Public Electronic Document Dating List - Systems and methods are disclosed which enable the establishment of file dates and the absence of tampering, even for documents held in secrecy and those stored in uncontrolled environments, but which does not require trusting a timestamping authority or document archival service. A trusted timestamping authority (TTSA) may be used, but even if the TTSA loses credibility or a challenger refuses to acknowledge the validity of a timestamp, a date for an electronic document may still be established. Systems and methods are disclosed which enable detection of file duplication in large collections of documents, which can improve searching for documents within the large collection.04-16-2009
20100017399TECHNIQUE FOR RECYCLING MATCH WEIGHT CALCULATIONS - Disclosed is a system for, and method of, recycling field value weights as computed for database linking purposes. Such field value weights may be used for a search operation. In some embodiments, such weights may be used for a search operation prior to their values stabilizing during an iterative linking operation.01-21-2010
20080313166RESEARCH PROGRESSION SUMMARY - Systems, methods, and computer-readable media for generating a research progression summary are provided. A research progression summary provides a snapshot of documents (e.g., articles) that have had a significant impact on a particular field of research, or at least a portion thereof, over time. Research progression sorts through all accessible relevant documents, analyzes the importance of each, and summarizes for presentation only those documents determined to be of particular importance with respect to a topic of interest (i.e., the particular field of research or some portion thereof). In this manner, a researcher can readily determine how the thinking with respect to a particular topic has progressed over time. By way of example only, the research progression summary may focus on one or more of historical developments in a particular field, current developments with respect to a topic of interest, or an overall summary of a particular field/topic.12-18-2008
20090282014Systems and Methods for Predicting a Degree of Relevance Between Digital Ads and a Search Query - Systems and methods for predicting a degree of relevance between a set of candidate digital ads and a search query are disclosed. Generally, an ad provider receives a digital ad request associated with a search query. The ad provider identifies a set of candidate digital ads that may be served in response to the digital ad request. A relevance module extracts a set of features from the set of candidate digital ads and the search query associated with the digital ad request, and determines a degree of relevance between the set of candidate digital ads and the search query based on a prediction model and the extracted set of features. If the relevance module determines the set of candidate digital ads is relevant to the search query, the ad provider may serve one or more digital ads from the set of candidate digital ads in response to the received digital ad request.11-12-2009
20090287693METHOD FOR BUILDING A SEARCH ALGORITHM AND METHOD FOR LINKING DOCUMENTS WITH AN OBJECT - A computer-readable medium including computer-readable information thereon including instructions providing a method for refining a search algorithm is provided, the method comprising displaying a document, displaying at least one metadata about the search result, receiving instructions about a selection of at least one of the metadata; and modifying a search algorithm by including the selected metadata in the search algorithm. The method can be applied to internet pages based on met tags. A method for linking documents to an object is also provided. A system and interface for carrying same is also provided herein.11-19-2009
20090287689AUTOMATED CALIBRATION OF NEGATIVE FIELD WEIGHTING WITHOUT THE NEED FOR HUMAN INTERACTION - Disclosed is a system for, and method of, calculating parameters used to determine whether records and entity representations should be linked. Such parameters may be set as negative to account for fields that do not match. The system and method apply iterative techniques such that parameters from each linking iteration are used in the next linking iteration. The system and method need no human interaction in order to calibrate and utilize record matching formulas used for the linking decisions.11-19-2009
20090287688Method for Searching for Class and Function Based on .NET Card and .NET Card Thereof - The present invention relates to information security field and presents a method for searching for a class and a function based on a .NET card and a .NET card thereof. The method includes: building a first character string according to information of a class currently executed by the .NET card, or information of a function currently executed by the .NET card and a class that the function belongs to; computing a first index value from the first character string; searching for a first locator value corresponding to the first index value in an index table pre-stored in the .NET card, wherein index values in the index table are generated in the same way as the first index value is generated; finding, in a runtime library of the .NET card, the class or the function currently executed according to the first locator value. The .NET card includes a storage module, a building module, a computing module and a searching module. The invention improves the speed of searching for a class or a function when a program is executed in the .NET card. And the index table consumes a small part of the memory of the .NET card, therefore the method is convenient and easy to implement.11-19-2009
20090287672Method and Apparatus for Better Web Ad Matching by Combining Relevance with Consumer Click Feedback - A method and apparatus are provided for better web ad matching by combining relevance with consumer click feedback. In one example, the method includes receiving a query page, extracting features from the query page, re-weighting the query page, evaluating the query page in light of each ad in order to score each ad and pick substantially best ad matches of the indexed ads, and returning the substantially best ad matches to the consumer computer.11-19-2009
20080294635System for Conducting Searches on the World Wide Web Enabling the Search Requester to Modifying the Domain Context of a Search Responsive to an Excessive Number of Hits on Combinations of Keywords - The user requesting the search is enabled to analyze the list of excessive hits in a manner organized through a Web content manager on the user's display screen, and reduce the excessive hits through the elimination of extraneous domains or subdomains captured by the search.11-27-2008
20090089281COMMUNITY INFORMATION FILTER - A method and system to filter community information by rating members, rating their contributions, and evaluating the accuracy of predictions extracted from their contributions. These metrics can be a combination of subjective and objective factors. Subjective ratings can be weighted according to the ratings of those doing the rating.04-02-2009
20090089276SYSTEMS, METHODS AND COMPUTER PRODUCTS FOR A MONITORING CONTEXT GENERATOR - Systems, methods and computer products for generating calculation context classes from a relationship between structured data and a calculation procedure, the context classes having parent-child relationships. Exemplary embodiments include a method including searching the calculation procedure for a first data definition, generating a first context from a first scope applied to the first data definition, tracing back the calculation procedure to obtain a second data definition for calculating the first data definition and to which the first scope is applied, copying the calculation procedure into the first context until the second data definition is obtained, obtaining a second scope applied to the second data definition, obtaining a second context generated from the second scope, determining an existence of an order comparison of the first scope with the second scope and obtaining order from the structured data.04-02-2009
20090089274GRADIENT BASED OPTIMIZATION OF A RANKING MEASURE - Methods, systems, and apparatuses for generating relevance functions for ranking documents obtained in searches are provided. One or more features to be used as predictor variables in the construction of a relevance function are determined. The relevance function is parameterized by one or more coefficients. A query error is defined that measures a difference between a relevance ranking generated by the relevance function and a training set relevance ranking based on a query and a set of scored documents associated with the query. The query error is a continuous function of the coefficients and aims at approximating errors measures commonly used in Information Retrieval. Values for the coefficients of the relevance function are determined that substantially minimize an objective function that depends on the defined query error.04-02-2009
20090089273SYSTEM FOR DETECTING ASSOCIATIONS BETWEEN ITEMS - A method of detecting associations between items can include identifying a plurality of items represented in a data repository from which to select items to recommend to a target user, each item including one or more attributes. A degree of fit between an item's attributes and other items is calculated. The degree of fit can indicate the relevance of the attributes of one item to the other item. A degree of association between the two items is calculated based at least in part on the calculated degree of fit. The degree of association between the two items can indicate the relatedness of the two items. Based on this degree of association, an association between the items can be stored in a data repository.04-02-2009
20090144269RESOLVING UNKNOWN MAILBOXES - The present invention relates to a method and system for processing electronic mails in case of a address change of the addressee. It provides an ECOA resolving process of searching for a new or alternative address for this addressee, optionally triggered in case of unknown address or non delivery notification message. This process comprises forwarding the email under ECOA resolving from one MTA to another MTA in the network, for trying to reach a MTA connected with a database where old and new addresses are memorized in association. Such resolving forwarding is done according to specific routing tables, possibly independent from DNS routing server, which may include specific or local analysis rules based on the invalid address.06-04-2009
20080208838SYSTEM AND METHOD FOR DERIVING A HIERARCHICAL EVENT BASED DATABASE HAVING ACTION TRIGGERS BASED ON INFERRED PROBABILITIES - Inferring a probability of a first inference absent from a database at which a query regarding the inference is received. The query is used as a frame of reference for the search. The database returns a probability of the correctness of the first inference based on the query and on the data. An action trigger is executed responsive to at least one of a) the probability of the first inference exceeding a first pre-selected value, b) a significance of the inference exceeding a second pre-selected value, c) a rate of change in the probability of the first inference exceeding a third pre-selected value, d) an amount of change in the probability of the first inference exceeding a fourth pre-selected value, and e) combinations thereof. At least one of the probability of the first inference and the action trigger is stored in the database.08-28-2008
20080208849Methods for Identifying Audio or Video Content - The disclosed technology generally relates to methods for identifying audio and video entertainment content. Certain shortcomings of fingerprint-based content identification can be redressed through use of crowdsourcing techniques.08-28-2008
20080208844ENTERTAINMENT PLATFORM WITH LAYERED ADVANCED SEARCH AND PROFILING TECHNOLOGY - This disclosure provides various implementations for locating industry profiles representing members of an entertainment platform community. The software can query a plurality of industry profiles with a first set of search criteria associated with a target member of the entertainment platform community and generate a first cache of industry profiles based on the first set of search criteria, the first cache a subset of the plurality of industry profiles. Further, the software can query the first cache with a second set of search criteria, wherein the second set of criteria are mutually exclusive from the first set of criteria, and generate a second cache of industry profiles based on the second set of search criteria, the second cache a subset of the first cache of industry profiles. The software can then present information from at least one industry profile represented in the second cache to an interface.08-28-2008
20080208837Methods and apparatus for term normalization - Methods and data processing apparatus for normalization of mentions of subcellular entities, such as proteins and/or genes, in a natural language biomedical text document, in which the species of the individual mention of a subcellular entity is determined before an identifier is assigned to the individual mention of a subcellular entity and the identified species is taken into account when assigning an identifier to the said individual mention of a subcellular entity.08-28-2008
20080208848System and Method for Managing Bundle Data Database Storing Data Association Structure - A bundle database management system comprises a search server including a bundle definition unit for defining a core word and a relevant word connected to the core word, and connection relation between the core and relevant words to generate and store bundle data; a description definition unit for defining description data corresponding to the core and relevant words; a search request receiving unit for receiving a search request including a specific search word input by a user; a search result page generating unit for generating a search result page including the bundle data retrieved by the search word as a core word and the description data retrieved by the core word; and a search result page transmitting unit for transmitting the search result page to the user; and a user terminal connected to the search server for transmitting the search request and receiving the search result page.08-28-2008
20080208843DOCUMENT SEARCHING SYSTEM AND DOCUMENT SEARCHING METHOD - In a document searching system, a first storing apparatus, a second storing apparatus, and a document managing apparatus are connected to one another. The document managing apparatus stores structure information that shows a hierarchical structure regarding hierarchy positional relationships among the elements in the structured documents stored in the first and the second storing apparatuses. The document managing apparatus extracts an identical element that is a predetermined element in the structured documents stored in the second storing apparatus that matches the element in the structured documents stored in the first storing apparatus. The first storing apparatus stores the structured documents and conducts a search in the stored structured documents for one of the structured documents that contains the received identical element. The second storing apparatus conducts a search for one of the structured documents containing the identical element that matches the received text information.08-28-2008
20080208842APPARATUS AND METHOD FOR SELECTING AND PERFORMING AT LEAST ONE DATA FUNCTION - A method for displaying data items in a mobile terminal includes receiving a user search request, automatically identifying data items which individually comprise the search request, and displaying a distinct number in association with each of the identified data items, wherein each of the identified data items are individually selectable responsive to a corresponding number input by a user.08-28-2008
20080208839Method and system for providing information using a supplementary device - A method and system for providing access to information via a supplementary device is provided. User access to primary information via a primary device is monitored. Key information related to the primary content is obtained by extracting and analyzing metadata sources for the primary information. Then, supplementary information related to the primary information is obtained based on the key information. The supplementary information is provided for user access via the supplementary device.08-28-2008
20080208834Enhanced Search System and Method for Providing Search Results With Selectivity or Prioritization of Search and Display Operations - Application usage in a computing environment is monitored to record information that is indicative of what applications are most extensively or recently used, or otherwise preferred by the user. Applications (or data items of a data type of the application) are selected or prioritized over other applications (or data items) when a search operation is performed.08-28-2008
20080208832SYSTEM AND METHOD FOR DERIVING A HIERARCHICAL EVENT BASED DATABASE OPTIMIZED FOR PHARMACEUTICAL ANALYSIS - A computer implemented method, apparatus, and computer usable program code for inferring a probability of a first inference absent from a database at which a query regarding the inference is received. Each datum of the database is conformed to the dimensions of the database. Each datum of the plurality of data has associated metadata and an associated key. The associated metadata includes data regarding cohorts associated with the corresponding datum, data regarding hierarchies associated with the corresponding datum, data regarding a corresponding source of the datum, and data regarding probabilities associated with integrity, reliability, and importance of each associated datum. The query is used as a frame of reference for the search. The database returns a probability of the correctness of the first inference based on the query and on the data.08-28-2008
20090112837Proactive Content Dissemination to Users - A content repository of a system stores items of content to be disseminated to users. The content repository generates a content profile for each item of content as the item of content is received. The content profile for each item of content includes information regarding the item of content. A user repository of the system generates and stores a user profile for each user. The user profiles are generated from one or more information sources. The user profile for each user includes information regarding the user. A recommendation engine of the system determines which items of content should be delivered to each user based on the content profiles of the items of content and on the user profile of each user, to yield relevant items of content for each user. The recommendation engine then delivers the relevant items of content to each user.04-30-2009
20090112836Information Retrieval Apparatus and Method - An information retrieval apparatus includes a display unit which displays a first display menu showing a plurality of estimated meaning items and a second display menu showing a plurality of retrieval target items corresponding to the meaning items selected on the first display menu. A log storage unit stores a selection log on the first and second display menus. An estimation unit estimates a retrieval target item selection tendency from the selection log. A ranking unit ranks the plurality of retrieval target items on the basis of the selection tendency. The display unit displays a plurality of retrieval target items on the second display menu upon arranging them in accordance with ranking.04-30-2009
20090276414RANKING MODEL ADAPTATION FOR SEARCHING - Search results provided by a search engine (e.g., for the Internet) are improved and/or made more accurate by addressing the limited availability of human labeled training data for certain domains (e.g., languages other than English, within certain date ranges, corresponding to queries over a certain length, etc.). More particularly, a ranking model trained on in-domain data, for which a small amount of human labeled training data (e.g., query/URL pairs) is available (e.g., languages other than English) is adjusted based upon out-domain data, for which a large amount of human labeled training data (e.g., query/URL pairs) is available (e.g., English). Thus, even though the resulting adapted in-domain ranking model is used in the context of in-domain data (e.g., non-English) to provide search results, the search results are improved because they are influenced by an abundance of, albeit out-domain, human labeled training data.11-05-2009
20090276421Method and System for Re-ranking Search Results - A method for re-ranking search results, includes: generating the search results from a data source based on a search query from a user; retriving a re-ranking expression; re-ranking all or part of the documents in the search results based on the re-ranking expression; and displaying all or part of the documents in the search results with the re-ranked order.11-05-2009
20090276419METHOD AND SYSTEM FOR IMPROVEMENT OF REQUEST PROCESSING - A system and method of processing a request including improving usage and/or performance of resources is disclosed. Information relating to a user request may be provided to one or more resources which process the information and provide a result. A result and/or other information may be provided to a human assistant or guide who may process information to produce a result and/or review a result(s). Information provided by a guide may be processed and provided to a resource, which may improve the performance of a resource. A resource(s) and/or a guide(s) may be selected and/or provided with activities based on ratings and/or rankings associated with a request, which may optimize usage of system resources. Information obtained may be provided for various purposes.11-05-2009
20090276424METHOD AND SYSTEM FOR KEYWORD MANAGEMENT - A keyword management system for managing keywords used when a user terminal connected to a network accesses contents, includes a Burst value calculating unit that calculates a Burst value indicating an increase per unit time of a keyword, an overall Burst value calculating unit that calculates an overall Burst value by correcting the Burst value based on characteristics in the contents of the keyword corresponding to the Burst value, and an output controlling unit that extracts from the contents, a relevant keyword related to the keyword corresponding to the overall Burst value based on time series changes of the overall Burst value, and outputs the keyword and the relevant keyword associated with each other to the user terminal.11-05-2009
20090276422Apparatus for matching subject data sets with query data sets - Apparatus for matching subject data sets with a query data set comprising data input means, data processing means, data storage means and data presentation means, 11-05-2009
20090276420Method and system for extending content - The present invention provides a method and system for extending content based on the semantic meaning of content. It divides content into multiple content regions and finds words and/or phrases that are semantically relevant to the current content region and appends these words and/or phrases to the current content region as extended content. The extended content matches semantically with the original content in such a seamless way that users may think it is a part of the content.11-05-2009
20090276417METHOD AND SYSTEM FOR DATABASE QUERY TERM SUGGESTION - A method for automatically providing a plurality of additional database query terms to a user is provided. The method comprises receiving a query term from a user, receiving a plurality of characters from the user in addition to the query term, and selecting a set of records from a database based on the query term, wherein the database comprises records, and the records comprise text translated from audio. The method also determines a plurality of additional query terms based on the plurality of characters, and, for at least one of the plurality of additional query terms, processes at least a portion of the set of records to determine a relevance of the additional query term. Finally, the method includes displaying at least one of the plurality of additional query terms to the user for selection, the display based on the relevance of the at least one of the plurality of additional query terms.11-05-2009
20090276411ISSUE TREND ANALYSIS SYSTEM - A system of analyzing a large document-based propensity over a query language is disclosed. In the system of analyzing the large document-based propensity over the query language, the correlated words and sentences on the query language inputted by the user are searched on the basis of large on-line or off line documents and the general report of analyzing the relationship among the words of the corresponding documents, the propensity of the words and the sentences, the appearance frequency of the recent words and sentences and so on is provided to the user, whereby it can previously predict the propensity (the positive image, the negative image or Non-Applicable), the related word based on the importance and the tendency change through the result of the large document analysis generating for a recent predetermined period according to the query language of the user.11-05-2009
20090282031LOOK-AHEAD DOCUMENT RANKING SYSTEM - A method and system is provided for calculating importance of documents based on transition probabilities from a source document to a target document based on looking ahead to information content of target documents of the source document. A look-ahead importance system generates transition probabilities of transitioning between any pair of source and target documents based on analysis of links to target documents of the source document. The system may calculate the transition probabilities based on the number of links on documents a look-ahead distance away. The system then solves for the stationary probabilities of the transition probabilities. The stationary probabilities represent the importance of the documents.11-12-2009
20090282017NETWORK-COMMUNITY RESEARCH SERVICE - A network-community research service includes a research module to receive a research query from a requesting member belonging to a network community. The research module is configured to answer the research query with a ranked list of research results at least partially prioritized based on network-community activities of non-requesting members.11-12-2009
20090282033Search Engine with Fill-the-Blanks Capability - A client system provides to a server system a fill-the-blank query comprising one or more term segments and one or more missing term identifiers signifying missing information sought by a user. The client system receives from the server system a response to the query, the response including at least one or more potential answers corresponding to the one or more missing term identifiers in the fill-the-blank query, and then displays the response to the query, including displaying the one or more potential answers. Optionally, the client system displays a ranked list of documents containing the one or more potential answers. Optionally, the response to the query further includes snippets of text from one or more documents containing the one or more potential answers. Optionally, the fill-the blank query includes a respective missing term identifier located between two respective term segments.11-12-2009
20090282020AUTO-SELECTION OF MEDIA FILES - Apparatus and methods to control selection of media content provide a mechanism to enhance user interaction with multimedia devices. Additional apparatus, systems, and methods are disclosed.11-12-2009
20090282030SOLICITING INFORMATION BASED ON A COMPUTER USER'S CONTEXT - A user search request is received and context information for the user is identified. The user search request and the context information are then combined to generate search criteria corresponding to the user search request, providing for information solicitation based on a computer user's context.11-12-2009
20090193013METHOD FOR STORING MESSAGES IN A DIRECTORY - A method, system, and computer usable program product for storing messages in a directory executing in a data processing system are provided in the illustrative embodiments. A message is received over a network and identified in the directory. A base message entry that corresponds to the message is selected in a hierarchy of entries in the directory. A message instance entry for the message is created, such that the message instance entry becomes a child entry of the base message entry in the hierarchy.07-30-2009
20080243826System and method for determining semantically related terms - Systems and methods for determining semantically related terms are disclosed. Generally, a semantically related term tool receives a seed set and identifies a plurality of terms that constitute the seed set. For each term of the seed set, the semantically related term tool identifies one or more concept terms associated with terms of the seed set other than the term being processed, determines a plurality of concept terms based on at least one of combinations and permutations of the concept terms associated with terms of the seed set other than the term being processed, and adds the resulting terms to a plurality of semantically related terms. The semantically related term tool removes invalid terms from the plurality of semantically related terms based on a language model and ranks at least a portion of the remaining terms of the plurality of semantically related terms based on a metric indicating a degree of semantical relationship between a term of the plurality of semantically related terms and one or more terms of the set seed.10-02-2008
20090282012LEVERAGING CROSS-DOCUMENT CONTEXT TO LABEL ENTITY - Entities, such as people, places and things, are labeled based on information collected across a possibly large number of documents. One or more documents are scanned to recognize the entities, and features are extracted from the context in which those entities occur in the documents. Observed entity-feature pairs are stored either in an in-memory store or an external store. A store manager optimizes use of the limited amount of space for an in-memory store by determining which store to put an entity-feature pair in, and when to evict features from the in-memory store to make room for new pairs. Feature that may be observed in an entity's context may take forms such as specific word sequences or membership in a particular list.11-12-2009
20090287700QUERY EVALUATION USING ANCESTOR INFORMATION - Provided are techniques for processing a query. A query is received, wherein the query is formed by one or more paths, and wherein each path includes one or more steps. A hierarchical document including one or more document nodes is received. While processing the query and traversing the hierarchical document, one or more extraction entries are constructed, wherein each extraction entry includes a step instance match candidate identifying a document node and a step instance ancestor path for the document node, and one or more tuples are constructed using the one or more extraction entries by associating the step instance match candidate from one of the one or more extraction entries with the step instance match candidate from at least one of the one or more other extraction entries.11-19-2009
20090287698ARTIFICIAL ANCHOR FOR A DOCUMENT - Methods, systems, and apparatus, including computer program products, for linking to an intra-document portion of a target document includes receiving an address for a target document identified by a search engine in response to a query, the target document including query-relevant text that identifies an intra-document portion of the target document, the intra-document portion including the query relevant text. An artificial anchor is generated, the artificial anchor corresponding to the intra-document portion. The artificial anchor is appended the address.11-19-2009
20090287696METHOD AND SYSTEM FOR NAVIGATING AND SELECTING MEDIA FROM LARGE DATA SETS - Some embodiments of the invention provide a method of accessing a data set. The data set includes a set of data elements. The method collects the data elements of the data set. The method receives a lens item. The lens item provides a set of parameters for searching the data set. The method searches the data set by using the lens item to identify a data subset. The method sorts a list of data elements based on the data subset. The sorting generates an ordered list. The method filters the data subset. Filtering the data subset comprises excluding the data elements that are not relevant to the lens item. The method presents the ordered list in a first column of a matrix. The matrix has several cells. The cells of the matrix are based on the data subset. The method selects column headings for the matrix and populates the cells of the matrix. Some embodiments provide a system for providing access to a data set. The system has a set of data elements that comprises a first data source. The system has a first device for collecting the set of data elements. The first device receives a first lens item for searching the data elements. The first device filters the data elements by using the first lens item to generate a first subset. The first device presents the first subset in a variety of views for navigation through the first subset.11-19-2009
20090287694Four Dimensional Search Method for Objects in a Database - Embodiments of the disclosure provide a method and system used for searching among a plurality of entities on a computer network by a user. A computer server in communication with the computer network can include a database with a storage mechanism, a rule set, and an interaction calculation engine. The user can search for a first entity using a location calculation engine in communication with the computer network. The location calculation engine can locate the first entity and determine and display at least a second portion of the plurality of entities relevant to the first entity.11-19-2009
20090287692INFORMATION PROCESSING APPARATUS AND METHOD FOR CONTROLLING THE SAME - An information processing apparatus includes a holding unit configured to hold a plurality of indices associated with each document information stored in the storage unit, wherein each of the indices includes history information describing user information about users who have accessed each document information, and a user ranking unit allocates ranks to users who have accessed the document information that have been accessed by the search user in the past based on the history information included in a plurality of the indices. An index search unit searches the index held by the holding unit based on a keyword specified by the search user using the input unit, and a document ranking unit allocates ranks to the document information associated with the retrieved index, based on the user information about the access users in the index retrieved by the index search unit, and the user information about the users ranked by the user ranking unit.11-19-2009
20090287691PRESENTATION OF QUERY WITH EVENT-RELATED INFORMATION - In an embodiment, a method is provided for presenting a query directed at an information resource. In this method, a number of queries is accessed over a time period. A burst of the number of queries is detected within the time period. It should be noted that a burst is a determinable increase in the number of queries received within the time period relative to a historical number of queries received in a preceding time interval. Event-related information that is associated with the burst in the time is searched, and the query in conjunction with the event-related information is displayed at a display unit.11-19-2009
20090287690SUPPORT FOR INTERNATIONAL SEARCH TERMS - A search engine server supports delivery of search results using an international search string option by identifying websites that provide support in English as well as the language of the international search string. The international search string is a search string in any of the languages that are listed/supported by the search engine server. The search engine server delivers web links of websites that provide support in both English as well the language of the international search string by identifying conjugate English terms, strings or phrases for the international search string, that provide exact or approximate equivalent meaning for searching. In addition, the search engine server also provides web links of websites that provide international language support by utilizing a thesaurus in English that provides synonyms for the conjugate English terms. The search engine server also translates websites where there is no support in the language of the search string.11-19-2009
20090287687SYSTEM AND METHOD FOR RECOMMENDING VENUES AND EVENTS OF INTEREST TO A USER - A system and method is disclosed for recommending venues and events to individual users using a combination of collaborative filtering and integrating social behavioral pattern data gathered and computed via an electronic device. The system and method of the present invention is configured to receive data based on users' past, present and future social activity and interests, which are submitted to the system via an electronic device. When a new data item is made available from sources such as a mobile device, social networks or GPS systems, the system and method analytically breaks down the new item data, compares it to ascertained attributes of item data that a user (i.) indicated interest to in the past, (ii.) has a friend or related network of users that indicated interest in the venue or in an event in the past, and (iii.) indicated interest in the event or venue based upon general social statistics such as male to female ratio, age, and other demographics gathered and computed by the system. The system generates the recommendations using a previously-generated table which maps items to lists of “similar” items thereby making a audience-specific, time-specific and location-specific social recommendation.11-19-2009
20090287684HISTORICAL INTERNET - An Internet infrastructure that supports a timed window search service comprising a search server. The search server receives a search string from a client device and has access to a historical data repository from where different content can be provided for the search based on date/time inputs. The search server includes various modules for web crawling and reverse indexing various search keywords. The search server receives the search string along with certain user-defined criteria such as search in a geographical region or search within browser favorite lists. The search server performs the search operation and delivers the result to the client device. The search server can also retrieve the timed window data and deliver correlating content to client device. The historical data repository comprises indexing module, version manager, and time-based retrieving module facilitates for searching historical timed window data.11-19-2009
20090287680MULTI-MODAL QUERY REFINEMENT - A multi-modal search query refinement system (and corresponding methodology) is provided. In accordance with the innovation, query suggestion results represent a word palette which can be used to select strings for inclusion or exclusion from a refined set of results. The system employs text, speech, touch and gesture input to refine a set of search query results. Wildcards can be employed in the search either prompted by the user or inferred by the system. Additionally, partial knowledge supplemented by speech can be employed to refine search results.11-19-2009
20090287678System and method for providing answers to questions - A system, method and computer program product for providing answers to questions based on any corpus of data. The method facilitates generating a number of candidate passages from the corpus that answer an input query, and finds the correct resulting answer by collecting supporting evidence from the multiple passages. By analyzing all retrieved passages and that passage's metadata in parallel, there is generated an output plurality of data structures including candidate answers based upon the analyzing. Then, by each of a plurality of parallel operating modules, supporting passage retrieval operations are performed upon the set of candidate answers, and for each candidate answer, the data corpus is traversed to find those passages having candidate answer in addition to query terms. All candidate answers are automatically scored causing the supporting passages by a plurality of scoring modules, each producing a module score. The modules scores are processed to determine one or more query answers; and, a query response is generated for delivery to a user based on the one or more query answers.11-19-2009
20090287676SEARCH RESULTS WITH WORD OR PHRASE INDEX - Disclosed are apparatus and methods for providing a word or phrase index regarding a particular set of search results. In specific embodiments, a word or phrase index for summarizing the words or phrases (or a subset of same) within the particular search results may be determined. This index may be similar to the inverted index used by some search engines so that each of a plurality of words or phrases are associated with a plurality of search results (e.g., web pages and/or their cached copies) that contain such each word or phrase. The index is determined based on the search results, and the index for the search results is then provided along with the search results. The entries of the provided search result index are preferably selectable so that a user can access the search results that contain at least one of the listed word or phrase in the index.11-19-2009
20090287673RANKING VISUALIZATION TYPES BASED UPON FITNESS FOR VISUALIZING A DATA SET - Technologies are described herein for ranking visualization types. In order to rank the visualization types, visualization metadata is generated for each of the visualization types and data set metadata is generated for the data set. A suitability score is then computed based upon the visualization metadata and the data set metadata through the use of data mapping rules and chart selection rules. The visualization types are then ranked according to the computed scores. A user interface may then be displayed that includes visual representations corresponding to the visualization types that are ordered according to the ranking. One of the visual representations may then be selected to apply the corresponding visualization type to the data set11-19-2009
20090287679Evaluation of tamper resistant software system implementations - According to one embodiment of the present invention, a method for evaluating a software system includes defining a rating of the tamper resistance of a software system and breaking down the rating into a plurality of metrics relevant to the tamper resistance of the software system. A score may then be calculated for each metric and the scores may be combined into a composite score for the rating.11-19-2009
20090177641PATIENT MONITORING NETWORK AND METHOD OF USING THE PATIENT MONITORING NETWORK - In one embodiment, a method of using a patient monitoring network is provided. The method comprises steps of storing an association data at a primary server unit, the association data mapping a caregiver with at least one patient associated with the caregiver, storing an object data of at least one patient at a patient monitor coupled to the primary server unit, the object data comprising an identification data and a patient data of the patient, receiving a query at the patient monitor by the caregiver, fetching the association data of the caregiver from the primary server unit, displaying the identification data of at least one patient associated with the caregiver at the patient monitor, obtaining a selection for the patient at the patient monitor and displaying the patient data of the patient at the patient monitor.07-09-2009
20080275862SPECTRAL CLUSTERING USING SEQUENTIAL MATRIX COMPRESSION - A clustering system generates an original Laplacian matrix representing objects and their relationships. The clustering system initially applies an eigenvalue decomposition solver to the original Laplacian matrix for a number of iterations. The clustering system then identifies the elements of the resultant eigenvector that are stable. The clustering system then aggregates the elements of the original Laplacian matrix corresponding to the identified stable elements and forms a new Laplacian matrix that is a compressed form of the original Laplacian matrix. The clustering system repeats the applying of the eigenvalue decomposition solver and the generating of new compressed Laplacian matrices until the new Laplacian matrix is small enough so that a final solution can be generated in a reasonable amount of time.11-06-2008
20080275866DOMAIN INDEPENDENT SYSTEM AND METHOD OF AUTOMATING DATA AGGREGATION AND PRESENTATION - A domain independent system and method for automatically generating at least one presentation-ready report from either detailed or summarized data from database queries. The process of getting useful information requires querying a database to get detailed records, then meaningfully aggregating detailed data based on user experience and business needs and, finally, presenting the data using appropriate reports. This process of data aggregation and presentation can be automated and is accomplished by aggregating detailed data using domain metrics selected based on predefined and configurable rules or past usage; selecting one or more presentations based on predefined and configurable rules or past usage; and displaying one or more presentations based on device constraints and characteristics.11-06-2008
20080275873METHOD OF ENHANCING EMAILS WITH TARGETED ADS - A computer method and system for intercepting email messages, scanning the email messages for key words, determining whether the key words match or relate to key words determined to relate to advertising content, and enhancing the email message by routing the emails to recipients in a manner so that highly relevant, highly targeted advertising tag lines or other content are displayed together with the emails when the emails are accessed and viewed by email recipients.11-06-2008
20080275871SYSTEMS AND MEDIA FOR UTILIZING ELECTRONIC DOCUMENT USAGE INFORMATION WITH SEARCH ENGINES - Systems and media for utilizing electronic document usage information are disclosed. More particularly, hardware and/or software utilizing electronic document usage information to respond to user search requests with search engines are disclosed. Embodiments include receiving a search request from a requesting user and receiving document utilization information associated with one or more electronic documents, where the document utilization information provides an indication of the usage of the electronic documents by one or more users. Further embodiments include generating search results based at least partially on the search request and the document utilization information and transmitting an indication of the search results to the requesting user. Further embodiments include generating statistical information regarding the search results for electronic documents and transmitting the generated statistical information.11-06-2008
20080275869System and Method for A Digital Representation of Personal Events Enhanced With Related Global Content - There is provided a system and method for creating a digital representation of personal events enhanced with related global content. Personal event frameworks are selectable by the user and are provided with insertion points to receive media items therein and are structured to complement type of personal event. Media items are automatically reformatted as necessary. The instant invention allows the user to enhance the event framework by inserting related global content, wherein global content stands for media items matching either the date, location or event type. The additional content is provided from local sources or over the Internet.11-06-2008
20080275867System and Method for Presenting Content to a User - Assisting a user in locating particular content of interest from a collection of content including associated feature values and corresponding features. A user selects one of the plurality of feature values characterizing the collection of content and filters the content using the selected filtering feature value. The system groups the filtered collection using a grouping feature. The grouping feature may be associated with the user-selected filtering feature value and/or may be determined from the feature values of the filtered collection. The process of filtering/grouping may be repeated as many times as needed to locate the particular content of interest.11-06-2008
20080275865SEARCHING AND RANKING CONTACTS IN CONTACT DATABASE - In one aspect, a method may include receiving a request from a first mobile device for a search for contacts meeting a criterion; searching a database of contacts for the contacts meeting the criterion and including the contacts meeting the criterion in search results; determining whether a second mobile device associated with one of the contacts meeting the criterion is within a vicinity of the first mobile device; and ranking the search results based on the determination. In another aspect, the method may further include determining a database distance between a reference contact and each of one or more of the search results, where the request is associated with a reference contact in the database of contacts; and ranking the search results based on the determined database distance; where the database of contacts may include a plurality of subsets of contacts, where contacts in the subsets are linked to another contact in the database of contacts; where determining a database distance between the reference contact and each of the one or more of the search results may include determining the number of links between the reference contact and each of one or more of the search results.11-06-2008
20080275864ENABLING CLUSTERED SEARCH PROCESSING VIA TEXT MESSAGING - Methods and apparatus for searching data, grouping search results into categories that are ordered according to search relevance, and reviewing the search results via text messaging. In one embodiment, a search term is submitted via a search request text message to a short code for a search service. The search service searches for content based on the search term and context data, such as location of a submitting client device. The search results are clustered into categories and ranked by relevance to the search term and context within each category. The categories are also ranked relative to each other. The most relevant search result from the most relevant category is transmitted in an initial result text message, which also includes instructions to access additional results via subsequent text messages. Each result text message also includes a link to a web page of categorized search results for display in a browser.11-06-2008
20080275863SELECTING ADVERTISEMENTS BASED UPON SEARCH RESULTS - Computerized methods and systems for selecting advertisements for presentation based, at least in part, upon search-result items are provided. Upon receiving a search query, search-result items (e.g., uniform resource locators (URLs)) satisfying the search query are determined. The determined search-result items are then compared with search-result criteria associated with advertisements to determine if the search-result criteria are satisfied. The determination whether or not the search-result criteria associated with an advertisement is satisfied is then utilized to determine whether the advertisement is selected for presentation in association with the search-result items. Advertisements selected for presentation may be ranked relative to one another based upon a bid amount associated therewith, with only a pre-determined number of advertisements ultimately being displayed in association with the search-result items.11-06-2008
20090063470DOCUMENT MANAGEMENT USING BUSINESS OBJECTS - A computer-implemented method for processing information includes collecting data objects from one or more data repositories, the data objects having respective properties, which identify the data objects. The properties of the collected data objects are analyzed in order to derive respective identifiers corresponding to the data objects. A text string that matches one of the identifiers of a data object is identified within a context in a document. Responsively to the context, an indication that the identified text string is a valid instance of the data object is generated, and the document is processed responsively to the indication.03-05-2009
20100042608SYSTEM FOR OBTAINING RECOMMENDATIONS FROM MULTIPLE RECOMMENDERS - A personalization network service enables developers to develop recommenders that can be made available to content site operators for providing recommendations to end users. The personalization network service may also be capable of optimizing the use and selection of the recommenders for different end users, groups or segments of end users, content sites, and the like.02-18-2010
20090037405ADAPTIVE CURSOR SHARING - Techniques for sharing cursors are provided. When a new query is issued, a database server determines whether the new query is semantically equivalent to a previous query. If so, then database server computes statistics associated with the new query. Based on the statistics, the database server determines whether compiling the new query would produce an execution plan that satisfies certain criteria. If so, then the cursor is used to execute the new query. In another approach, one cursor sharing technique (CST) is used to determine which cursor to use to execute a first set of semantically-equivalent queries. Statistics are gathered during execution of the first set of queries. The database server determines, based on the statistics, when to switch from using the first CST to a different CST. The different CST is used to determine which cursor to use to execute a second set of queries that are semantically-equivalent to the first set.02-05-2009
20080243820Semantic analysis documents to rank terms - A method, apparatus and computer program product provides for a semantic analyzer to produce and rank semantic terms to reflect their relationship to the theme and topics of a document. The text and the document can have no relationship to any pre-selected keywords before the semantic analyzer performs text extraction. The semantic analyzer extracts text from a document and performs semantic analysis on the extracted text. The semantic analyzer provides a plurality of ranked semantic terms as a result of the semantic analysis and associates semantic terms with the document as semantic keywords. The semantic terms define content to be presented with the document where the content is an advertisement, a link to a remote information resource or a second document.10-02-2008
20080243819SEARCH MACRO SUGGESTIONS RELEVANT TO SEARCH QUERIES - Search macros suggestions are provided to refine a user's search. When a search query is received from an end user, one or more search macros are determined to be relevant to the search query. The search macros are then provided to the end user as suggestions for refining the user's search. In some instances, the end user may chose to select one of the suggested search macros. A search is then performed using the search query and the selected search macro to provide search results to the end user that may be more relevant to the user's search.10-02-2008
20080243831INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING SYSTEM, AND STORAGE MEDIUM - A first information processing apparatus includes a registration unit that receives, from an information processing apparatus, information of a derivation relationship in which a first document is a parent and a second document generated as a result of an operation performed with respect to the first document is a child and registers the information in a storage unit, and a search result output unit that, in accordance with a search instruction that specifies a designated document and a search condition including a condition that defines a derivation relationship that should exist between the designated document and a search subject document, outputs, as a search result, information concerning a document that satisfies the search condition among documents included in derivation relationships registered in the storage unit.10-02-2008
20080250015CORPUS EXPANSION SYSTEM AND METHOD THEREOF - A system and method for expanding new sample seeds to automatically expand corpora, in which sample seeds are used to collect corpus is provided. The new sample seeds are generated based on the already existed sample seeds and collected corpora; The corpus expansion strategy is determined based on all the sample seeds having been used and new sample seeds: The new sample seeds are refined based on the corpus expansion strategy, and the refined new sample seeds are used to further collect corpus. The above steps are repeatedly executed until predefined condition is satisfied. According to the invention, corpus may be automatically expanded from the web or other resources with low cost and in convenient way to improve the coverage of corpora.10-09-2008
20080250014Data search method and apparatus for same - A data search apparatus performs a data search by using a search string that has a space character included therein with a calculation of the number of target search data prior to the data search, on an assumption that at least two portions in the search string are used in an AND search. If the calculated number of the target search data exceeds a predetermined value, the space in the search string is excluded and the search string is considered as a single keyword for the data search. The data search by the above-described method yields a preferable search result in the data search apparatus having the AND search capability, because the space in the search string is excluded from an actual search string on a predetermined condition.10-09-2008
20080250010Method and system for determining and pre-processing potential user queries related to content in a network - A framework identifies data that a user would likely be interested to access, then extracts and stores such data for the user to efficiently use when desired. Thereby, the framework allows the user to access several types of information efficiently, without the user having to request the information.10-09-2008
20080250009ASSESSING MOBILE READINESS OF A PAGE USING A TRAINED SCORER - A method and system for ranking pages of a search result based on the mobile readiness of the pages is provided. A mobile-readiness system receives an indication of pages that are to be ranked. The mobile-readiness system evaluates the mobile readiness for each of the pages. Mobile readiness indicates suitability of the page for a mobile device. The mobile readiness system then ranks the pages based on the generated mobile readiness and some other criterion such as a relevance score or an importance score. The mobile-readiness system may train a classifier to classify pages based on their mobile readiness.10-09-2008
20080250007Document Characteristic Analysis Device for Document To Be Surveyed - An index term extraction device including: input means (10-09-2008
20080256064Pay per relevance (PPR) method, server and system thereof - The present invention relates to a server, system and method of pricing advertisements to be presented within a document, according to their relevance to user's search query, comprising: (a) receiving from an advertiser at least one keyword, for which his advertisement to be presented to said user, or receiving and processing said user's search query that contains at least one keyword; (b) determining the relevance weight of said advertisement to said at least one keyword, according to at least one predefined parameter; and (c) pricing said advertisement according to the determined relevance weight.10-16-2008
20080256055Word relationship driven search - Various technologies and techniques are disclosed for performing searches based upon word relationships. A search term is received from a user in the form of at least one primary word. A data store is searched to determine if the primary word is associated with any content. If so, at least one reference to primary content that has been pre-defined as being related to the primary word is included in the search result. At least one reference to secondary content is included in the search result, if secondary content is found. Secondary contents are those that contain words that have been pre-defined as being related to the primary word. The search results are displayed to the user, with the primary references displayed along with the secondary references, if applicable, in a hierarchical fashion.10-16-2008
20080235203Information Retrieval - A method and apparatus are provided for accessing a relevant information resource in response to a user query. An ontology is provided, defining relationships between a plurality of predefined concepts, and between each of the predefined concepts and one or more predefined context phrases. On receipt of a user query, portions of the received query are compared with the context phrases to identify one or more matching phrases and hence, from the predefined relationships with concepts in the ontology, one or more relevant concepts. Concepts identified in respect of the received user query are used to identify a relevant action using predefined relationships between concepts in the ontology and predefined actions, wherein an action comprises providing access to an information resource.09-25-2008
20090063456METHOD AND SYSTEM FOR TRACKING, EVALUATING AND RANKING RESULTS OF MULTIPLE MATCHING ENGINES - The present invention in various implementations provides for a method and system for removing suspect duplicate data in a database having a plurality of datasets for a suspect processing transaction, using a plurality of matching engines, comparing results of matching engines in a logically predetermined comparative assessment, and thereafter providing an ordered priority of results of matching engines to identify suspect candidates.03-05-2009
20090063445System and Method for Handling Indirect Routing of Information Between Supernodes of a Multi-Tiered Full-Graph Interconnect Architecture - A method, computer program product, and system are provided for selecting, from a plurality of routes through the data processing system, an indirect route for transmitting data. Data that includes address information is received at a first processor that is to be transmitted to a destination processor. Using routing table data structures, indirect route entries are identified that correspond to indirect routes for transmitting data. An accessed priority table data structure comprises a priority entry for each entry in the routing table data structures. The priority entry specifies a priority of a corresponding entry in the routing table data structures. An indirect route entry is selected that corresponds to an indirect route from the routing table data structures, based on specified priorities. Then the data is transmitted from the first processor to the destination processor using a path corresponding to the selected indirect route entry.03-05-2009
20080228763Fast Source File to Line Number Table Association - A mechanism is provided in a debugger for building a file information database while significantly reducing debug startup time. For each line number table, the mechanism of the present invention reads the header section and determines all the source files that contribute to the line number table. The mechanism also makes note of the line number table offset. The mechanism then inserts the source filename into the file information database. In one preferred embodiment, the file information database is implemented as a hash table. Searching time occurs during an interactive debug session; therefore, the searching time is not easily detectable to a user, thus creating the perception of a faster interactive debugging session.09-18-2008
20080228761Contextual data mapping, searching and retrieval - An example method is illustrated as including receiving a first content set, the first content set organized according to a rules set, using the rules set to parse the first content set to generate a first pattern set having a plurality of members, assigning a weighted value to each member of the first and a second pattern set based on a frequency of occurrence of each member in the first and second pattern sets, wherein each member of the first and second pattern sets includes digital content, and determining a relevancy score linking each of the members of the first and second pattern set in a one to one mapping of the members of the first pattern set to each of the members of the second pattern set, wherein the relevancy score is based upon the weighted value assigned to each member of the first and second pattern sets.09-18-2008
20080228758AD SPONSORS FOR MOBILE DEVICES BASED ON DOWNLOAD SIZE - The present invention relates to a method and system for ranking search results and is particularly, but not exclusively, suited to providing search results when the delivery of data corresponding to the search results is metered, such as when data are delivered to terminals connected to mobile networks.09-18-2008
20080228755POLICY CREATION SUPPORT METHOD, POLICY CREATION SUPPORT SYSTEM, AND PROGRAM THEREFOR - A policy creation support system is provided, which is capable of reducing cost for creation of an effective autonomic control policy. The policy creation support system measures information indicating performance of a monitoring target system, with regard to a monitoring item of a designated type, for each resource amount of an expandable resource, and selects one representative measurement value from among measurement values on a resource-amount basis. Then, the policy creation support system outputs the monitoring item, the resource amount serving as the monitoring item, and a range of the measurement value corresponding to the resource amount, as a countermeasure decision condition within a policy, by setting a range including the selected representative measurement value as the range of the measurement value.09-18-2008
20080228753Determining Attribute Associations Using Expanded Attribute Profiles - A method and system for determining attribute associations are presented in which primary attributes in an attribute profile are used to derive secondary attributes which are added to the attribute profile to create expanded attribute profile. A statistical association between a query attribute and attribute combinations occurring in the expanded attribute profile is determined.09-18-2008
20080228752Technical correlation analysis method for evaluating patents - A technical correlation analysis method for evaluating patents includes the steps of: entering a technical term to determine a user-defined search, and to obtain a set of patent documents of search results; counting the number of times citing patent references in the patent documents, so as to generate a correlation index of the cited patent references; and ranking the patent references according to the correlation index of the cited patent references.09-18-2008
20080228749Automatic tagging of content based on a corpus of previously tagged and untagged content - An automated mechanism of automatically tagging media files such as podcasts, blog entries, and videos, for example, with meaningful taxonomy tags. The mechanism provides active (or automated) assistance in assigning appropriate tags to a particular piece of content (or media). Included is a system for automatic tagging of audio streams on the Internet, whether from audio files, or from the audio tracks of audio/video files, using the folksonomy of the Internet. The audio streams may be provided by the media author. For example, the author can make a recording to be posted on a website, and use the system to automatically suggest (via prompted author interaction) folksonomically appropriate tags for the media recording. Alternatively, the system can be used in an automated fashion to develop and assign without any intervention by the author.09-18-2008
20090119286Method and Apparatus for Utilizing User Feedback to Improve Signifier Mapping - An apparatus for finding resources on a network comprises a finder server having access to: (a) a database including: (i) an index of resources available on network of interconnected computers on which a plurality of resources reside; and (ii) information regarding user feedback gathered from previous operations of the apparatus by a user and plural previous users; and (b) a learning system operable to access and learn from information contained on the database. The finder server is operable to locate, in response to entry by the user of a resource identity signifier, a single intended target resource intended by the user to uniquely correspond to the resource identity signifier, from among a plurality of resources located on the network, by: receiving a resource identity signifier from the user; accessing the database to determine, based on the information in the database, which, if any, of the indexed resources is likely to be the intended target resource; and directing a computer of the user so as to cause that computer to connect the user to the address of the resource, if any, determined as likely to be the intended target resource.05-07-2009
20090006355GLOBAL RESOURCE METHOD AND SYSTEM - A global resource method and system. The method includes associating by a computing system, groups of suppliers with geographical areas. The computing system receives a selection of a first skill from a requester. The requester is located within a first geographical area. The computing system receives geographical area specification data associated with the first skill. The computing system receives a selection of a first work location associated with the first skill. The computing system receives a selection of a group of suppliers comprising a first supplier associated with the geographical area specification data. The computing system generates a service request document comprising the selection of said first skill. The computing system transmits the service request document to the group of suppliers.01-01-2009
20080281812METHOD AND SYSTEM FOR IDENTIFYING EXPERTISE - A system and method of identifying entities having expertise in one or more subjects. Among other features, the method includes querying a database for documents (e.g., articles, papers, periodicals) relevant to a subject; and calculating a first score for each relevant document. The method also identifies entities affiliated with one or more relevant documents; and calculates a second score for each entity based on the one or more first scores of the one or more relevant documents affiliated with the entity. A step of ranking expertise of the entities based on the respective second scores of the entities is included.11-13-2008
20080281809Automated analysis of user search behavior - Automated analysis of user search behavior is provided. Data on user searches is maintained in a user search database. Relevance factors are determined for each search result included in a given search session where the relevance factors provide an indication of user satisfaction with particular search results included in the session. The relevance factors for each search result are analyzed by a relevance classification module for classifying each search result in terms of its relevance to an associated search query. The result of the relevance classification may assign a relevance classification and associated confidence level to each analyzed search result as to whether the search result is acceptable, unacceptable or partially acceptable relative to the search query that resulted in the search result. Relevance classifications for each analyzed search result may be stored for future use, for example, for diagnostic analysis of the operation of a given search mechanism.11-13-2008
20080281803Method of Transmitting Content With Adaptation of Encoding Characteristics - The invention proposes a method of transmitting a multimedia content from a server to a client device upon request of the client device, said method allowing adaptation of the characteristics of the encoder used for encoding the multimedia content based on the network transmission rate and/or client preference or preferences. The method of the invention consists in encoding the content with various encoder characteristics, thereby providing several encoded multimedia contents, slicing the encoded multimedia contents, thereby providing a plurality of file-based contents, downloading the content file by file while switching from one encoded multimedia content to another so as to change the encoding characteristics, thereby adapting to the network transmission rate and/or client preferences.11-13-2008
20080281814CONSTRUCTION OF TRAINABLE SEMANTIC VECTORS AND CLUSTERING, CLASSIFICATION, AND SEARCHING USING A TRAINABLE SEMANTIC VECTOR - An apparatus and method are disclosed for producing a semantic representation of information in a semantic space. The information is first represented in a table that stores values which indicate a relationship with predetermined categories. The categories correspond to dimensions in the semantic space. The significance of the information with respect to the predetermined categories is then determined. A trainable semantic vector (TSV) is constructed to provide a semantic representation of the information. The TSV has dimensions equal to the number of predetermined categories and represents the significance of the information relative to each of the predetermined categories. Various types of manipulation and analysis, such as searching, classification, and clustering, can subsequently be performed on a semantic level.11-13-2008
20080281813SYSTEM AND METHOD FOR SEARCHING AND RETRIEVING RELATED MESSAGES - A system and method is provided which utilizes a threading service to offer enhanced features for a document management system including an email system. Various enhanced email features may be provided through one or more of the following components: a delete module, a reply module, a profile module, and a search module. The delete module enables a user to delete a selected message, a set of related messages, or the whole set except for the selected message. The reply module enables a user to send a reply message to all addresses associated and involved with an entire set of related messages. The profile module enables a dynamic interest profile to contain all relevant information from an outgoing message and a set of messages related to the outgoing message. The search module enables search results to include documents which match the user's query as well as documents related to the documents which match the user's query.11-13-2008
20080281811Method of Obtaining a Representation of a Text - A method of obtaining a data file (11-13-2008
20090265346System and Method for Retrieving and Organizing Information from Disparate Computer Network Information Sources - A computer implemented method for accessing information from a set of searchable information sources includes analyzing a search query to determine subject matter of the query. A subset of information sources is selected from the set of information sources based upon the subject matter of the query. Analyzing utilizes at least two different criteria for deriving the subject matter of the query. One criteria includes comparing the search query against a set of entity lists. Another criteria includes comparing the search query against a knowledge-base.10-22-2009
20090265345SYSTEMS AND METHODS FOR GENERATING USER SPECIFIED INFORMATION FROM A MAP - An embodiment relates generally to a method of generating user-specified information. The method includes receiving a plurality of points selected on a map to form a first continuous line having one or more vertices. The method also includes generating a closed polygon having a plurality of edges, where at least one edge forms a second continuous line substantially parallel to and spaced apart at a distance from the first continuous line. The method also includes determining a plurality of coordinate pairs each associated with a point on the plurality of edges of the closed polygon and retrieving user specified information for an area enclosed by the plurality of coordinate pairs.10-22-2009
20090265344DOCUMENT PROCESSING DEVICE AND DOCUMENT PROCESSING METHOD - An object of the present invention is to provide a document processing device and document processing method that can provide a search result satisfactory to a user with respect to WWW documents in which a number of links among WWW documents is low and a number of accesses by users is low. An access pattern collection unit 10-22-2009
20090265342AVOIDING MASKED WEB PAGE CONTENT INDEXING ERRORS FOR SEARCH ENGINES - Multiple non-host client sites provide cached user copies of web pages and/or web content, or summaries thereof, to a server. Obtaining data from non-host sources for indexing purposes avoids masked web page content indexing errors for search engines. The server aggregates, summarizes and indexes the web pages and/or web content in an index of cached content, in conjunction with updating, generating and storing a search index using an indexing agent such as a web crawler or spider. In response to receiving search requests from end users, the search engine uses comparisons between the index of cached content and the index of crawled content to identify potential page masking errors for specific search results and appropriately rank or omit results with a high risk of masking errors in a search result list.10-22-2009
20090265341System and method for assisting user searches in support system - The invention concerns a method for assisting user searches in a support system and a system for performing the method which comprises the steps of providing a support data structure with nodes comprising support information of a support database, and providing at least one behavioral data structure comprising information about the time the user(s) spend at said nodes, and information about the transition probabilities between each upper node and its lower nodes, and calculating for each lower node that is located below a current node, navigated to by a user of the support data structure, the expectation value of the time gained by navigating directly to that lower node, and selecting at least one of the lower nodes based on said expectation value.10-22-2009
20090265340Proximity search for point-of-interest names combining inexact string match with an expanding radius search - A point-of-interest mapping search system that combines inexact string searches with a proximity search to provide an extremely high probability of return of a set of search results in an initial search response that are useful to the user. Relevance of any particular point-of-interest item in a combined inexact string/proximity is dependent on both (1) a quality of the name match; and (2) a proximity to the starting location (or other relevant search center point) of the POI search. The inexact string name/proximity search is performed efficiently by iteratively expanding a search radius around a given location, searching concentric circles of proximity until a specified target number of relevant results have been found. It is the combination of the use of a combined inexact string match together with a proximity search performed against a database of geo-referenced business names that provides advantageous results.10-22-2009
20090265338CONTEXTUAL RANKING OF KEYWORDS USING CLICK DATA - Techniques are provided for ranking the entities that are identified in a document based on an estimated likelihood that a user will actually make use of the annotations. According to one disclosed approach, usage data that indicates how users interact with annotations contained in documents presented to the users is collected. Based on the usage data, weights are generated for features of a feature vector. The weights are then used to modify feature scores of entities, and the modified feature scores are used to determine how to annotate documents. Specifically, a set of entities are identified within a document. A ranking for the identified entities is determined based, at least in part, on (a) feature vector scores for each of the identified entities, and (b) the weights generated for the features of the feature vector. The document is then annotated based, at least in part, on the ranking.10-22-2009
20090265337TRAIL-BASED EXPLORATION OF A REPOSITORY OF DOCUMENTS - Techniques that support trail-based exploration by a user of a repository of documents are described herein. In one embodiment, trail definition data that specifies a trail is received. The trail includes an ordered series of waypoints including a trailhead, intermediate waypoints, and one or more trailends. In some embodiments, deadends may also be defined in the trial. A particular waypoint in the ordered series of waypoints is established as a current waypoint. Search terms can be received from a user to cause a search to be performed. It is then determined whether the search satisfies matching criteria associated with a waypoint that immediately follows the current waypoint in the ordered series of waypoints. If so, the user advances to the next waypoint. Otherwise, the user remains at the current waypoint. Finally, if a trailend is reached, then an action such as rewarding the user in some way may be performed.10-22-2009
20090265336Method Of Detecting A Reference Sequence Of Events In A Sample Sequence Of Events - A method of detecting a reference sequence of events in a sample sequence of events, wherein each event is of a certain event type and holds a set of data attributes, includes the steps of: picking candidate combinations of events from said sample sequence so that the event types within each candidate combination match the event types in the reference sequence, calculating an overall similarity score for each candidate combination from at least (i) an event occurrence score based on occurrence deviations of the events of a candidate combination with respect to the matching events of the reference sequence and (ii) an attribute match score based on similarity deviations between the data attributes of the events of a candidate combination and the data attributes of the matching events of the reference sequence, and identifying the candidate combination with the best overall similarity score as reference sequence detected.10-22-2009
20090265335Automated Latent Star Schema Discovery Tool - A method, computer program product, and data processing system for computer-aided design of multidimensional data warehouse schemas are disclosed. A preferred embodiment of the present invention provides a software tool for identifying a latent star schema structure within an existing database. This software tool performs a heuristic analysis of the existing database schema to locate potential keys and measurement fields. Database tables within the existing schema are scored heuristically as to their suitability as fact tables based on the key candidates and measurement fields. For each fact table, other tables from the existing schema are identified as possible dimension tables. Data from the database is then used to test the suitability of the fact tables and dimension tables. The identified fact tables and their associated dimension tables are then reported to the user to reveal a basic star schema structure, which can be used as a basis for further design.10-22-2009
20090265334IMAGE QUERYING WITH RELEVANCE-RELATIVE SCALING - Queries may be issued against an image store to produce a set of image instance relating to images in the image store that relate to the query. The relevance of the images to the query may be depicted by scaling the image instances according to the predicted relevance of the image to the query. The image instances may be further positioned within the image instance set query result, e.g., by clustering according to image relatedness or by similar predicted relevance of the images to the query terms of the query. The image instances may also be presented as smoothly zoomable images, such that the user may zoom in on the images in an efficient manner that facilitates realtime, gradual zooming with reduced resampling inefficiency.10-22-2009
20090265333PRE-PURCHASE DEVICE INTEROPERABILITY VALIDATION - An interoperability assessment between two or more devices can be based on the devices' specifications and on empirical evidence of interoperability. Comparisons between the devices' capabilities can provide an initial assessment of interoperability, which can be further supported, or contradicted, by empirical evidence. Interoperability determinations can leverage existing data collection, such as error reporting and user identities to obtain estimates of empirical usage of devices, and to provide for a level of automation for requesting users. Interoperability determinations can also be offered, with identity protection limitations, for users other than the requesting user to facilitate gift-giving or agent purchasing.10-22-2009
20090265332System and Methods for Evaluating Feature Opinions for Products, Services, and Entities - A system for evaluating a review having unstructured text comprises a segment splitter for separating at least a portion of the unstructured text into one or more segments, each segment comprising one or more words; a segment parser coupled to the segment splitter for assigning one or more lexical categories to one or more of the one or more words of each segment; an information extractor coupled to the segment parser for identifying a feature word and an opinion word contained in the one or more segments; and a sentiment rating engine coupled to the information extractor for calculating an opinion score based upon an opinion grouping, the opinion grouping including at least the feature word and the opinion word identified by the information extractor.10-22-2009
20090265331CREATING BUSINESS VALUE BY EMBEDDING DOMAIN TUNED SEARCH ON WEB-SITES - Domain specific topics, and optionally uniform resource locators (URLs) can be received from a user, and from those domain specific topics and URLs, domain tuned search definitions are generated for a given domain. The domain tuned search definitions are saved and the user is provided with a definition of a domain tuned search interface that is embedded on a site specified by the user. When someone reviewing the user's web site performs a search using the domain tuned, embedded search interface, a search engine is invoked which performs a search on the user's input query, and then returns domain specific search results. The search engine searches for domain specific search results over web sites in addition to the web site that the user is currently reviewing, so the search is more precise than a general web search but broader than a specific site search.10-22-2009
20090265330CONTEXT-BASED DOCUMENT UNIT RECOMMENDATION FOR SENSEMAKING TASKS - Techniques for locating information in a document relevant to an interest of a user are provided. Information defined by the user of a document browser is collected. A context model is generated using the collected information. A document selected by the user is obtained. The document is divided into one or more segments. A relevance value is computed for each of the one or more segments by comparing each of the one or more segments to the context model. The relevance value represents a relationship to an interest of the user. Each of the one or more segments with the computed relevance value is presented in a defined organizational area of a display. The one or more segments presented on the display are linked to a corresponding one or more segments in the document.10-22-2009
20090265329SYSTEM AND METHOD OF DATA CACHING FOR COMPLIANCE STORAGE SYSTEMS WITH KEYWORD QUERY BASED ACCESS - A method of data caching for compliance and storage systems that provide keyword search query based access to documents computes a value for each data document based on a document information-retrieval relevancy metric for user keyword queries and a recency, frequency of each query. The values are adapted to changing query frequencies and popularities. Then selecting and evicting documents from a cache can be based on the values according to a knapsack solution. A weight is computed for each query such that recent, more frequent queries get a higher weight. A information-retrieval metric is used for measuring a relevancy of a document for a query. A weighted sum is taken of the information-retrieval metric times a query weight over all queries.10-22-2009
20090265328PREDICTING NEWSWORTHY QUERIES USING COMBINED ONLINE AND OFFLINE MODELS - Methods and apparatus are described for identifying newsworthy search queries employing a machine learning approach which combines offline and online modeling to achieve a high level of accuracy as well as timeliness and scalability.10-22-2009
20080288486METHOD AND SYSTEM FOR AGGREGATE WEB SITE DATABASE PRICE WATCH FEATURE - Signature schema documents may be pre-defined using a query language to provide instructions for application by an engine to extract data from web pages of respective web sites. For a particular web page, signature schema instructions identify a web page family for the web page and extract desired data from the web page in accordance with its web page family. A server may receive data from a web site and apply signature schema instructions maintained in a repository coupled to the engine. Extracted data can be cached to a database coupled to the engine to facilitate querying of the data to enable aggregate results to be presented to a client machine (e.g. a wireless device). The aggregate database can be monitored based upon defined user criteria such as for price changes of an item and provide appropriate notification to the client machine when changes occur.11-20-2008
20080288485STANDARDS-BASED LEARNING SYSTEMS AND METHODS - A user creating or modifying a lesson in an LMS can indicate a standard, such as a state-defined educational standard, for which additional content is wanted and the LMS will automatically retrieve content from an external resource, such as an LOR, that corresponds to the standard.11-20-2008
20080288482Leveraging constraints for deduplication - A deduplication algorithm that provides improved accuracy in data deduplication by using aggregate and/or groupwise constraints. Deduplication is accomplished using only as many of these constraints that are satisfied rather than be imposed inflexibly as hard constraints. Additionally, textual similarity between tuples is leveraged to restrict the search space. The algorithm begins with a coarse initial partition of data records and continues by raising the similarity threshold until the threshold splits a given partition. This sequence of splits defines a rich space of alternatives. Over this space, an algorithm finds a partition of the input that maximizes constraint satisfaction. In the context of groupwise aggregation constraints for deduplication all SQL (structured query language) aggregates are allowed, including summation.11-20-2008
20090037403GENERALIZED LOCATION IDENTIFICATION - A location identification system is described. In various embodiments, the location identification system identifies geographic location information in response to received search queries by processing geographic information to identify spatial or geometric regions, determining region intersection information that identifies spatial relationships between the geometric regions, and building an index of regions of constant attributes by associating intersecting geometric regions. In various embodiments, the location identification system can include a vector database wherein the vector database comprises geometric information including at least (a) spatial information geographically describing items and their locations and (b) textual attributes associated with the items or their locations, and an index of regions of constant attributes wherein the index associates textual attributes with items and their locations so that a proximity of two locations can be identified.02-05-2009
20080288483Efficient retrieval algorithm by query term discrimination - Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.11-20-2008
20080288481Ranking online advertisement using product and seller reputation - Described is a technology by which online advertisements for returning with a query response are ranked according to reputation. The reputation may correspond to a product or service and/or seller reputation. In one example, a set of relevant advertisement items are located and ranked using reputation data as a factor. For example, for each item, a ranking value is based on a mathematical combination of a product reputation score, a seller reputation score and a relevance score, with the items ranked by their computed values. The scores may be weighted differently. The reputation data may be mined from a review source, such as customer reviews available on the web. In one example implementation, a 3-gram model that considers terms in the review along with the two terms proceeding each term is used to analyze the reviews to determine whether each review is positive or negative with respect to the reputation.11-20-2008
20080306937Using search trails to provide enhanced search interaction - It has been found that user navigation that follows search engine interactions provides implicit endorsement of resources (such as web resources) that are preferred by users, and which may be particularly valuable for exploratory search tasks. Thus, a combination of past searching and browsing user behavior is analyzed to identify additional information that augments search results delivered by a search engine. The additional information may include a display of hyperlinks to locations which are derived from the past searching and browsing user behavior, given a specific input query. The additional information may be provided to supplement web search results.12-11-2008
20080313176PATH-BASED RANKING OF UNVISITED WEB PAGES - Path-based ranking of unvisited Web pages for WWW crawling is provided, via identifying all the paths beginning with a “seed” URL and leading to visited relevant web pages as “good-path set”, and for each unvisited web page, identifying the paths beginning from the “seed” URL leading to it as “partial-path set”; classifying all the visited web pages and labeling each web Page with the labels of a class or classes it belongs to; training a statistic model for generalizing the common patterns among all ones of “good-path set”; and evaluating the “partial-path set” with the statistic model and ranking the unvisited web pages with the evaluation results.12-18-2008
20080313174METHOD AND SYSTEM FOR UNIFIED SEARCHING ACROSS AND WITHIN MULTIPLE DOCUMENTS - A user-interface system and method for searching among multiple documents and searching for subsections within individual documents using a single search interface on an input-constrained user device having a screen and a keypad.12-18-2008
20080313178DETERMINING SEARCHABLE CRITERIA OF NETWORK RESOURCES BASED ON COMMONALITY OF CONTENT - A method, article of manufacture, apparatus for determining keywords to be used by a search engine. In one embodiment, a list of hyperlinks contained in an electronic document is identified by a searching program. The searching program then accesses the resource content (e.g., HTML) from each resource pointed to by the hyperlinks. The resource content of each resource is examined to determine whether a commonality exists in a manner directed to identifying keywords for each resource. These keywords may then be used by a search engine to return more accurate results to user queries.12-18-2008
20080313177ADDING DOMINANT MEDIA ELEMENTS TO SEARCH RESULTS - A method and system for determining dominance of the media elements of display pages is provided. The dominance system provides a scoring mechanism for scoring the dominance of media elements of display pages based on features of each media element of the display page. To generate the scores for the media elements of the display page, the dominance system first identifies the media elements and then identifies the features of the media elements. The dominance system then scores the identified media elements using the provided scoring mechanism and the identified features.12-18-2008
20080313175METHOD AND SYSTEM FOR INTERACTION-BASED EXPERTISE REPORTING - A task-based method and system for expertise reporting provides a display of a knowledge worker's expertise in structured documents based on their access and use of those documents.12-18-2008
20080313170Method and apparatus for keyword mass generation - A method and apparatus in accordance with the invention which, for any given keyword, generates a numeric value that defines keyword relevance based on the number and importance of a keyword's forward link and back link keyword neighbors.12-18-2008
20100005083FREQUENCY BASED KEYWORD EXTRACTION METHOD AND SYSTEM USING A STATISTICAL MEASURE - Frequency based keyword extraction method and system utilizing a statistical measure is disclosed which generates keywords within a page and/or document that can distinguish the document from an average document. A simple frequency threshold parameter can be utilized to determine a number of common stop words if a word in the document possesses a frequency in a corpus that is more than the threshold parameter. A statistical confidence interval of the frequency in the document can be compared against a frequency confidence interval of the word in the corpus. The extracted keyword possesses a greater intra-document frequency confidence interval than the frequency confidence interval of the word within the corpus. A statistical hypothesis test can also be utilized to determine the keyword by calculating a test statistic and testing whether the test statistic is greater than some threshold.01-07-2010
20080275861Inferring User Interests - The subject matter of this specification can be embodied in, among other things, a method that includes determining, for a portion of users of a social network, label values each comprising an inferred interest level of a user in a subject indicated by a label, associating a first user with one or more second users based on one or more relationships specified by the first user, and outputting a first label value for the first user based on one or more second label values of the one or more second users.11-06-2008
20080281810Meta search engine - In an information retrieval method a meta search engine (11-13-2008
20090063467Persona management in a geo-spatial environment - A method and system of persona management in a geo-spatial environment are disclosed. In one embodiment a method of a persona management includes creating a plurality of persona profiles associated with a first member of a community network, determining a plurality of locations associated with each of the persona profiles, displaying the persona profiles at the locations on a geo-spatial map, and managing the persona profiles using the geo-spatial map. The method may include accessing one of the persona profiles, determining a context of expression associated with the persona profiles, generating a communication between the persona profiles and/or a contact associated with the persona profiles based on the context of expression, and sending the communication to the contact using the community network.03-05-2009
20080201327Identity match process - An online computer system enables users to identify and contact, if they so desire, users with similar attributes. The primary method of matching identity matches facial and at least one other physical, astrological, or other defined attribute. This matching is done by computer comparison of the photographs and other data provided by the users. The users may be provided with information to contact any matches.08-21-2008
20080281805MEDIA CONTENT TAGS - A tag file associated with a content file provides a user with access to related content. A content provider can request that information be associated with selected content, such that when a user views the selected content a selectable element is generated and displayed to the user to provide easy access to the related content. Information such as keywords associated with the selected content also can be used to search for related content. Related content information is placed in tags of the tag file for the selected content, such that at a selected or other appropriate time information relating to the related content is displayed to the user. When a user selects the selectable element, the related content is located and displayed in place of, or in addition to, the selected content. Such an approach is useful for digital media networks such as IPTV applications.11-13-2008
20080243805Automatic Creation of E-Books - A system searches for segments among multiple publications dealing with a given topic or set of topics, and compiles these segments into a custom-created electronic-book. In a commercial environment, such custom-created e-books are offered for sale to a user or set of users who have expressed interest in the given topic or set of topics. Optionally, the system is aware of the publications that are in a user's existing library, and avoids the inclusion of redundant material in the custom-created e-book for that user.10-02-2008
20080270375LOCAL NEWS SEARCH ENGINE - The present system relates to a method for indexing a document comprising the acts of retrieving in the content of the document true landmarks associated with said content, selecting one true landmark as representing the document, indexing the document to include a reference to the selected true landmark, wherein the retrieving act further comprises the act of identifying landmarks through the comparison of the content of the document to a landmark database, and, retaining only true landmarks among the identified landmark using a natural language approach, and wherein the selecting act further comprising the acts of assigning a weight to each true landmark based on at least one criterion, selecting the true landmark that represents the document based on the assigned weights.10-30-2008
20090012954ELECTRONIC PROFILE RANKING - A method is implementable in an electronic system coupled to an electronic device, which is, in turn, coupled to a display device. A web page displayable on the display device is served to the electronic device. The displayed web page includes a user interface comprising a data-input field. At least one search term entered by a user of the electronic device and pertaining to a vocational characteristic is received from the electronic device. A set of profiles associated with respective entities is accessed. Each profile included indicators of a plurality of vocational characteristics corresponding to the associated entity. The at least one search term is compared to the indicators associated with each profile of the set. Each profile of the set is ranked according, at least in part, to the existence of at least one positive match between the at least one search term and the indicators associated with each profile of the set and a predetermined weight assigned to each said positive match.01-08-2009
20080313173METHOD AND APPARATUS FOR RATING, DISPLAYING AND ACCESSING COMMON COMPUTER AND INTERNET SEARCH RESULTS USING COLORS AND/OR ICONS - A new and safe way to display, limit, and rate search results. New Methods and Apparatus for Rating, Displaying and Accessing common computer and Internet Search results using Colors. It is known that search results can be filled with both inappropriate and offensive materials. This invention provides for new means of displaying search results, while also allowing for new means of displaying search results, while also allowing for end-users to clearly identify the differences between all search results, and to which they can or cannot have access.12-18-2008
20080208840Diverse Topic Phrase Extraction - Systems and methods for implementing diverse topic phrase extraction are disclosed. According to one implementation, multiple word candidate phrases are extracted from a corpus and weighed. One or more documents are re-weighed to identify less obvious candidate topics using latent semantic analysis (LSA). Phrase diversification is then used to remove redundancy and select informative and distinct topic phrases.08-28-2008
20080270390Criteria-Specific Authority Ranking - Disclosed is a method of ranking linkable nodes based on intrinsic scores assigned to the nodes.10-30-2008
20080215570Medical literature database search tool - This invention provides a method of identifying clinically relevant, evidence based medical literature on diseases and their treatment to physicians, nurses and other healthcare personnel. This method accesses a medical literature database that is then searched using a medical literature classification system identifier for a disease integrated with an evidence based medicine search filter. In conjunction with identifying the evidence based medical literature, a database of articles selected and reviewed by experts that concern the disease is also searched. The results of both searches are then displayed to the user. This invention can be implemented through the use of an encoded computer readable medium.09-04-2008
20080228750"Query-log match" relevance features - Techniques for generating features that are used to rank documents in a search results page are provided. A query is received and may be modified before being compared to queries in a query log of previously-issued queries. The comparisons may be made in a variety of ways. The comparisons may allow query terms to be ordered and terms to be inserted. Relevance features are generated from the results of the comparisons. The documents that are referenced in a search results page (generated as a result of the query) are ranked based on the generated relevance features.09-18-2008
20080235216METHOD OF PREDICITNG AFFINITY BETWEEN ENTITIES - In one embodiment, the invention includes a method of predicting affinity between a first entity and a second entity including associating a first plurality of characteristic tags with the first entity. The first plurality of characteristic tags are preferably associated with a first reference entity, generating a comparison matrix, and calculating a similarity score between the first entity and the second entity using the comparison matrix, wherein the second entity is associated with a second plurality of characteristic tags. In another embodiment, the invention includes a method of relating characteristic tags, including selecting a first characteristic tag from a first plurality of characteristic tags, selecting a second characteristic tag from a second plurality of characteristic tags, and relating the first characteristic tag and the second characteristic tag.09-25-2008
20080208833CONTEXT SNIPPET GENERATION FOR BOOK SEARCH SYSTEM - A book search system and media for generating a book index corresponding to a collection of books and for providing context snippets related to a search string formulated by a user based on the book index are provided. The book index includes a word hash that represents unique words and an offset to a location list that stores locations for each instance of the unique word. The book search system receives the search string from the user, parses the search string to locate phrases and words, and traverses the book index to generate a list of locations for each word or phrase included in the search string. The book search system utilizes a variable-sized container having a maximum size to store subsets of each word or phrase included in the list of locations to generate the context snippets for the search string.08-28-2008
20080208847Relevance ranking for document retrieval - Documents and/or document clusters are ranked with respect to their geographical locations and/or user specific (e.g., user input) relevance. Highly relevant documents and/or document clusters are assigned higher ranks than less relevant documents and/or clusters. In this way, ranked lists of documents and/or clusters, top clusters (e.g., top stories), top documents (e.g., most important articles), etc. may be served (e.g., presented, delivered, etc.) to users.08-28-2008
20080208831CONTROLLING SEARCH INDEXING - Computer readable media, systems, and methods for controlling search indexing are described. In embodiments, a search index control instruction is received and, if permitted by the search index control instruction, content pertaining to the received instruction is indexed and presented in accordance therewith. In one embodiment, receiving the search index control instruction includes traversing the Internet with a web crawler and analyzing one or both of a robots.txt file and source code associated with a website of interest to locate instructions. Search index control instructions may include, by way of example only, exclusionary instructions (e.g., excluding specified domains from linking to portions of the content associated with a website) and modification instructions (e.g., permitting indexing and presentation of content associated with a website but only in a modified form to reduce the risk of content theft).08-28-2008
20080243828Search and Indexing on a User Device - Search may be performed on a user device, such as a handheld electronic book reader device. A search query term may be received. Text of a collection of electronic items stored in memory of the user device may be searched for the queried term. Search results may be returned identifying locations in the electronic items at which the queried term appears.10-02-2008
20080243838COMBINING DOMAIN-TUNED SEARCH SYSTEMS - The claimed subject matter provides systems and/or techniques that effectuate combining domain-tuned search systems. The system can include mechanisms that obtain queries, written descriptions, or illustrative web-pages regarding a particular area of interest, and generate a definition related to the area of interest. The definition contains a list of paths with associated weights employed to identify an pre-established first domain-tuned search system related to the area of interest. The first domain-tuned search system thereafter can be combined with a second domain-tuned search system related to another area of interest and presented to a user for utilization in re-ranking generic search results to be specific to the first and second domains of interest if combined, or to only the first domain if weights for the second domain are logically subtracted.10-02-2008
20080281806SEARCHING A DATABASE OF LISTINGS - A database having listings rather than long documents is searched using a term frequency-inverse document frequency (Tf/Idf) algorithm.11-13-2008
20080208846Web site search and selection method - According to the web site search and selection method, in response to a search query a relevance score is assigned to each page of the web sites addressed by the search engine. Then, for each web site addressed by the search engine, the relevance scores of the individual pages are added together, after weighting them by a correction factor indicative at least of the number of pages of the site itself. In this manner, in response to the search query an overall relevance value for the sites addressed by the search engine is obtained.08-28-2008
20080215571PRODUCT REVIEW SEARCH - This disclosure describes various exemplary methods, computer program products, and user interfaces that provide results for a product review search with opinion snippets and opinion visual graphs. This disclosure describes identifying user opinions by extracting passages that contain subjective opinions from web pages; ranking the user opinions by incorporating sentiment orientations and sentiment topics, where the sentiment orientations are positive or negative; and generating review snippets to indicate user sentiment orientations and to describe user opinions toward product features. This disclosure improves a user product search experience from the following aspects: understanding the product review from snippets instead of browsing the web page; obtaining more information by reading reviews in a shorter time period; and obtaining overall opinions of users of the web through visualized opinion summarization.09-04-2008
20080215564Query rewrite - A method and apparatus for rewriting of search engine queries is provided. Queries are rewritten by applying a set of rules. The rules represent domain knowledge and can be created by developers or users outside the search engine. There are two types of rules, production rules and definitions. Production rules specify how a query can be modified. Definition type rules specify a vocabulary for matching or modification of query terms. The modified query is issued to a search engine generating more focused and relevant results.09-04-2008
20080250012IN SITU SEARCH FOR ACTIVE NOTE TAKING - A system and method that facilitates and effectuates in situ search for active note taking. The system and method includes receiving gestures from a stylus and a tablet associated with the system. Upon recognizing the gesture as belonging to a set of known and recognized gestures, the system creates an embeddable object, initiates a search with terms indicated by the gesture, associates the search results with the created object and inserts the object in close proximity with the terms that instigated the search.10-09-2008
20080243832Method and System for Parsing Languages - Embodiments of systems and methods for comparing attributes of a data record are presented herein. In some embodiments, a weight is based on a comparison of the name (or other) attributes of data records. In some embodiments, an information score may be calculated for each of two name attributes to be compared to get an average information score for the two name attributes. The two name attributes may then be compared against one another to generate a weight between the two attributes. This weight can then be normalized to generate a final weight between the two business name attributes. Comparing attributes according to embodiments disclosed herein can facilitate linking data records even if they comprise attributes in languages which do not use the Latin alphabet.10-02-2008
20080243813LOOK-AHEAD DOCUMENT RANKING SYSTEM - A method and system is provided for calculating importance of documents based on transition probabilities from a source document to a target document based on looking ahead to information content of target documents of the source document. A look-ahead importance system generates transition probabilities of transitioning between any pair of source and target documents based on analysis of links to target documents of the source document. The system may calculate the transition probabilities based on the number of links on documents a look-ahead distance away. The system then solves for the stationary probabilities of the transition probabilities. The stationary probabilities represent the importance of the documents.10-02-2008
20080235219DECISION MAKING AND IMPLEMENTATION SYSTEM - A system and method for generating recommendations of analyses of circumstances in business, accounting, science, medicine and other fields. An algorithm is generated using an interactive generation process based on decision tree type inquiries. The algorithm is translated into a computer language and code and loaded onto a computer, preferably on a network. A user inputs data concerning a particular topic, and the algorithm processes the data to generate and display a set of recommendations or analyses. The user inputs additional data which the system uses to refine the initial recommendations or analyses, and this process is repeated until arriving at a final set of recommendations or analyses. The organization and content of sets of display screens changes dynamically as data is input and processed. The data may include degrees of certainty relating to certain data, which is used in both determining a set of recommendations or analyses and expressing a degree of certainty about such recommendations or analyses.09-25-2008
20080215573Inter-Frequency Neighbor List Searching - Techniques for inter-frequency neighbor list searching are disclosed. Embodiments disclosed herein address the need for inter-frequency neighbor list searching. In one embodiment, a searcher is deployed to search a PN space with a first set of search parameters and to return search results. A subset of those results is selected, along with a previously saved search result, to form a set of PN locations for a second search. The second search is performed on a window around each of the PN locations, using a second set of search parameters. The maximum peak from the second search is saved for use in future iterations. In one embodiment, the subset is selected as the highest energy level peaks from the first search. In one embodiment, if a maximum peak is deemed to correspond to a valid base station when the position of that maximum peak is within a pre-determined time offset from a previous maximum peak.09-04-2008
20080235218WIDE-SPECTRUM INFORMATION SEARCH ENGINE - A method and computer program product for comparing documents includes segmenting a judgment matrix into a plurality of information sub-matrices where each submatrix has a plurality of classifications and a plurality of terms relevant to each classification; evaluating a relevance of each term of the plurality of terms with respect to each classification of each information submatrix of the information submatrices; calculating an information spectrum for a first document based upon at least some of the plurality of terms; calculating an information spectrum for a second document based upon at least some of the plurality of terms; and identifying the second document as relevant to the first document based upon a comparison of the calculated information spectrums.09-25-2008
20080235217System and method for creating, verifying and integrating metadata for audio/video files - The present invention discloses a system and method for insuring the integrity and format of metadata. In the preferred embodiment, a local database is created into which metadata information can be stored. Since the database is maintained locally, it can be guaranteed to have correct and complete metadata information. Metadata searches are preferably performed hierarchically, such that the local database is checked first for the required data. If the data is not resident in the local database, the traditional search of third-party databases is performed. Information retrieved from third-party databases is then verified, such as manually. Once the metadata has been checked and approved, the metadata is then stored locally. A set of rules is also created, which define the requirements and the file manipulations that must be preformed on the metadata for each type of target device.09-25-2008
20080235215DATA SEARCH METHOD, RECORDING MEDIUM RECORDING PROGRAM, AND APPARATUS - A data search method causes a computer to search data stored in a search target apparatus based on keywords entered as search conditions.09-25-2008
20080235214System and method for event search - The present invention discloses a system that provides for online search of events using electronic devices where the system comprises, a portal page, a poster tree structure, and a search engine configured to search events using what functionality, where functionality and when functionality. Further it is disclosed a method that provides online search of events using electronic devices where the method comprises the step of; opening a portal page configured to search events, moving a pointer or cursor to a what main category, optionally moving the cursor or pointer to a what sub-category, choosing a what main category or what sub-category moving the cursor or pointer to a where field, choosing a geographical area of interest, and moving the cursor or pointer to a when field and choosing an event date.09-25-2008
20080235213Utilization of copyright media in second generation web content - Content provision apparatus for suggesting media content from a media database to augment new textual content being published as a blog or the like, comprises: a text retrieval unit for retrieving new textual content over a network; a text analysis unit for analyzing the retrieved textual content; a search unit for using the analyzing of the retrieved textual content to search the media database to find media content suitable for the new textual content; and a dispatch unit for dispatching to an author of the text retrieval unit a suggestion for augmenting the new textual content. The suggestion is typically in the form of an email but alternatively may be sent using an RSS feed or via a comment for the blog. Preferably the media content can be pasted straight into the blog. A feature allows the content to carry advertising as a pop up label or link, and the advertising fee pays for the media usage rights.09-25-2008
20080235212Evaluating real estate properties - The present invention is an improved system and method for analyzing multiple real estate properties. The system includes a pool of properties that are searched based on user-defined search criteria. The system identifies comparison properties from the search pool. The comparison properties include attributes that match or are a near match to the search criteria. The system compares each of the comparison properties to at least one average value and demonstrably depicts the comparison to the user.09-25-2008
20080235211Optimization method and process using tree searching operation and non-overlapping support constraint requirements - A method and process provides a selection process designed to select optimized results from a plurality of possible results represented in a search tree. A tree search is employed, wherein bounds are used to prune at least one node or branch of the search tree. A non-overlapping support constraint in conjunction with the tree search is invoked to further prune the search tree. An optimized search tree is stored into a memory, following the invoking of the non-support constraint, and the optimized search tree is employed in additional processing operations.09-25-2008
20080235206USING SCENARIO-RELATED INFORMATION TO CUSTOMIZE USER EXPERIENCES - Methods for using scenario solution-related information to generate customized user experiences are provided. Upon receiving a user query, a plurality of results is returned, each result being representative of a scenario solution which may be utilized to address a particular issue relevant to the received query. At the time of authoring, each scenario solution is organized based upon one or more keywords and/or one or more categories (i.e., namespaces). Data associated with a namespace/keyword corresponding to a returned search result may be mined to determine information beyond basic scenario solution search results that may be of interest to the user. As the namespace(s)/keyword(s) in association with which to organize a particular executable scenario solution is determined by the author of the scenario solution, other information associated with the same namespace/keyword (and/or a namespace/keyword having a relationship thereto) is likely to be more relevant than information organized based upon keywords alone.09-25-2008
20080275872SYSTEM AND METHOD FOR EFFICIENTLY SEARCHING A FORWARDING DATABASE THAT IS SPLIT INTO A BOUNDED NUMBER OF SUB-DATABASES HAVING A BOUNDED SIZE - A method, apparatus, and storage medium product are provided for forming a forwarding database, and for using the formed database to more efficiently and quickly route packets of data across a computer network. The forwarding database is arranged into multiple sub-databases. Each sub-database is pointed to by a pointer within a pointer table. When performing a longest-match search of incoming addresses, a longest prefix matching algorithm can be used to find the longest match among specialized “spear prefixes” stored in the pointer table. After the longest spear prefixes are found, the pointer table will direct the next search within a sub-database pointed to by that spear prefix. Another longest-match search can be performed for database prefixes (or simply “prefixes”) within the sub-database selected by the pointer. Only the sub-database of interest will, therefore, be searched and all other sub-databases are not accessed. Using a precursor pointer and a sub-database of optimally bounded size and number ensures power consumption be confined only to the sub-database being accessed, and that higher speed lookup operations can be achieved since only the sub-database of interest is being searched.11-06-2008
20090157647Method and Apparatus for Discovering and Classifying Polysemous Word Instances in Web Documents - A method and apparatus for discovering polysemous words and classifying polysemous words found in web documents. All document corpi in any natural language have words that have multiple usage contexts or words that have multiple meanings. Semantic analysis is not feasible for classifying all word occurrences in all documents on the web, which contain trillions of words in total. In addition, semantic analysis typically cannot distinguish multiple usages of a given meaning of a given word. In one embodiment of this invention, polysemous words in natural languages can be discovered by analyzing the co-occurrence of other words with the polysemous word in web documents. In one embodiment, the multiple meanings and usages of a polysemous word can be determined by analyzing the co-occurrences of other words with the polysemous word. In one embodiment, overcorrelation tables and three-word correlation tables are generated to analyze the words found in web documents.06-18-2009
20080270389METHOD AND SYSTEM FOR IMPROVEMENT OF RELEVANCE OF SEARCH RESULTS - A system and method for improving the relevance of search results is disclosed. Voters who may be human searchers or guides may review search results or other review items associated with a search request or other reference item. A review may be activated based on a usage indicator(s) which may improve utilization of guides. A vote by a voter may be weighted based on a voting history associated with the voter and one or more reference voters who may be designated by the system. A voter may be presented with a group of items for review including simultaneously. A number of comparison voting sessions or elections may be used to determine a rating or ranking of a review item associated with a reference item.10-30-2008
20080270391SYSTEM FOR PROVIDING MULTI-VARIABLE DYNAMIC SEARCH RESULTS VISUALIZATIONS - A system are provided for enabling a user to search for documents that the user has previously viewed on its local machine. The system includes three main components: the desktop integration module, the index module, and the graphical user interface module. The desktop integration module is an application which monitors documents with which the user interacts for predetermined events, and obtains content data and metadata from the monitored documents. The index module indexes the content data and metadata received from the desktop integration module. The graphical user interface module then permits a user to utilize the desktop integration module and index module by allowing a user to search for a document.10-30-2008
20080270387METHOD AND SYSTEMS FOR SEARCHING AND DISPLAYING SEARCH RESULTS USING PROXIMITY CRITERIA - Search parameters and proximity criteria may be used to perform a proximity search. The proximity criteria may indicate a desired proximity among the search parameters in order for there to be a match. When there is a document that includes the search parameters that satisfy the proximity criteria, the search parameters in the document may be formatted.10-30-2008
20090138458APPLICATION OF WEIGHTS TO ONLINE SEARCH REQUEST - A machine-implemented search method comprises inputting a search query from a user, and before the query is executed, inputting from the user a weighting factor that has a specified relationship to the query. The method further includes initiating a search by causing the query to be applied according to the weighting factor, and returning a result of the search to the user. The weighting factor may represent, for example, a weight to be given to one of multiple information sources that are available to be searched in response to the query, such as an online search engine or a merchant online commerce web site. Alternatively, the weighting factor may represent a weight to be given to a term in the query.05-28-2009
20090100047METHOD AND SYSTEM OF MANAGING AND USING PROFILE INFORMATION - A method and system for matching a search request to a human assistant and/or other items based on information indicated in a profile associated with the search request is described. A ranking of a guide is determined based on matching of information associated with the guide and information associated with a search request. Profile information such as demographic, geographic, personality, areas of interest, people, hobbies, etc. may be used in addition to other information such as keywords or categories which are associated with a request in order to select a guide. Items such as a search result, an advertisement, a search resource, a previous query, etc. may be selected based on profile information associated with the item. Profile information may be associated with an item based on profile information associated with a guide and/or a user who has expressed an opinion regarding the item.04-16-2009
20090089282NATURAL LANGUAGE BASED SERVICE SELECTION SYSTEM AND METHOD, SERVICE QUERY SYSTEM AND METHOD - The present invention relates to a natural language based service selection system for complementing incomplete queries, which comprises a semantic analyzing device which analyzes an incomplete query from a user semantically, a service selecting device which complements the incomplete query based on the semantic-analyzed query so as to acquire the corresponding selected service, and a retrieving device which retrieves an answer according to the selected service. The present invention also relates to a natural language based service selection method as well as a service query system and method thereof, and thus can process an incomplete query from a user and provide a selected service.04-02-2009
20090089275USING USER PROVIDED STRUCTURE FEEDBACK ON SEARCH RESULTS TO PROVIDE MORE RELEVANT SEARCH RESULTS - The present invention discloses a solution of using user provided structure feedback to index electronic documents. In the solution, a search engine can serve search results based on an indexed store of electronic documents to at least user. Structure feedback can be received concerning the search results. The structure feedback can identify at least one structure element of an electronic document and at least one user specified semantic tag for the structure element. The indexed store can be changed to incorporate the structure feedback. The changed index store can be used when subsequently serving search results. The search engine can be a Web search engine and/or a desktop search engine.04-02-2009
20090030892SYSTEM OF EFFECTIVELY SEARCHING TEXT FOR KEYWORD, AND METHOD THEREOF - A system of the present invention stores: a first index which designates lists of keywords contained in texts from identifications of the respective texts; a second index which designates lists of texts containing keywords from identifications of the respective keywords; and the number of texts containing the respective keywords. Then, upon receiving an input of a text search condition, the system calculates an estimation of search time by the first index and an estimation of search time by the second index, and determines which one of the first and second indexes makes a search faster. Then, by using the index which has been determined to make the search faster, the system searches for keywords which appear in texts satisfying the text search condition with higher frequency.01-29-2009
20090063469User Based Document Verifier & Method - Temporal qualities of electronic documents are verified using a variety of techniques, including human based feedback. The invention can be used in environments such as automated news aggregators, search engines, and other electronic systems which compile information having temporal qualities.03-05-2009
20090063464System and method for visualizing and relevance tuning search engine ranking functions - The present invention is directed towards system and methods for generating a visual representation indicating performance of a system capable of accepting one or more inputs and producing an ordered set of one or more responsive outputs. The method of the present invention comprises selecting one or more benchmark inputs and generating an ordered output set for each of the one or more benchmark inputs, a given output set comprising one or more output items responsive to a given benchmark query. One or pixels representing the one or more output items comprising the one or more outputs sets are generated, a given pixel containing a visual representation indicating a degree to which the output item represented by the pixel is relevant with respect to the benchmark input to which the output item is responsive. The one or more pixels representing the one or more output items comprising the one or more output sets are arranged in a circle in a manner indicative of the performance of the system.03-05-2009
20090063471SYSTEMS AND METHODS FOR PROVIDING A CONFIDENCE-BASED RANKING ALGORITHM - A method for using a confidence based ranking algorithm is described. At least one search parameter is received. The at least one search parameter is used to identify at least one data record with confidence values. A results list with one or more data records is created. The results list is ordered according to the confidence values within the data records. The results list is sent.03-05-2009
20090049037Temporal Document Sorter and Method - Electronic documents are classified and compared according to their temporal qualities. The content of a document relating to an event is analyzed to identify temporal components. These components can be compared with corresponding counterparts in other documents to identify a relative temporal order. The invention can be used in environments such as automated news aggregators, search engines, and other electronic systems which compile information having temporal qualities.02-19-2009
20090119291MICROHUBS AND ITS APPLICATIONS - A system and method of crawling at least one website comprising at least one URL includes maintaining a lookup structure comprising all of the URLs known to be on a website; calculating a hub score for each webpage of the website to be recrawled, wherein the hub score measures how likely the to be recrawled webpage includes links to fresh content published on the website; sorting all the to be recrawled pages by their hub scores; and crawling the to be recrawled pages in order from highest hub scores to lowest hub scores. The calculating comprises computing a first value equaling a percentage of a number of new relative URLs on the to be recrawled page; computing a second value equaling a percentage of a previous hub score of the to be recrawled page; and computing the hub score as a sum of the first and the second values.05-07-2009
20090119289Method and System for Autocompletion Using Ranked Results - A set of ordered predicted completion strings are presented to a user as the user enters text in a text entry box (e.g., a browser or a toolbar). The predicted completion strings can be in the form of URLs or query strings. The ordering may be based on any number of factors (e.g., a query's frequency of submission from a community of users). URLs can be ranked based on an importance value of the URL. Privacy is taken into account in a number of ways, such as using a previously submitted query only when more than a certain number of unique requesters have made the query. The sets of ordered predicted completion strings is obtained by matching a fingerprint value of the user's entry string to a fingerprint to table map which contains the set of ordered predicted completion strings.05-07-2009
20090119288APPARATUS AND METHOD FOR SEARCHING MEDIA DATA - An apparatus and method of searching media data is provided. The method of searching media data includes selecting attributes from a displayed category, calculating degrees of correspondence between the selected attributes and media data, and generating specified signals in accordance with the calculated degrees of correspondence.05-07-2009
20090119285QUERY UTILIZATION - Methods and system for query utilization are described. A rate of a plurality of queries to a data source may be determined for a plurality of time periods. The plurality of queries may be associated with a term. A cost may be associated with a normal-to-deviated query state transition and a deviated-to-normal query state transition. A normal query state or a deviated query state may be assigned to a particular query on a particular time period of the plurality of time periods based on the rate of queries for the particular time period and the cost of the normal-to-deviated query state transition and the deviated-to-normal query state transition. A query burst may be identified during the plurality of time periods based on assignment of the normal query state or the deviated query state to the plurality of queries. The query burst may have the normal query state, the normal-to-deviated query state transition, and the deviated query state during a time period.05-07-2009
20090119283System and Method of Improving and Enhancing Electronic File Searching - The invention is a system and a method for improving and enhancing electronic file searching, the system employing the method, and the method comprising (a) accepting context as input; and (b) reducing context to keywords; and (c) weighting keywords; and (d) searching electronic data store(s) using said weighted keywords; and (e) displaying search results.05-07-2009
20090119274NAMED ENTITY EXTRACTING APPARATUS, METHOD, AND PROGRAM - A named entity extracting apparatus that extracts a named entity suitable for a user by enabling an order to be set in which the named entity is extracted from texts includes: an extraction order reading unit 05-07-2009
20090070324RELATED INFORMATION TRANSMISSION METHOD, RELATED INFORMATION TRANSMISSION SERVER, TERMINAL APPARATUS AND RELATED INFORMATION TRANSMISSION SYSTEM - A related information transmitting method, comprising the steps of: 03-12-2009
20090187553METHOD AND SYSTEM FOR FACILITATING VERIFICATION OF AN ENTITY BASED ON BUSINESS REQUIREMENTS - A method, system and computer program product for facilitating verification of an entity against a reference database. The entity comprises a plurality of attributes. The method obtains a set of attributes from the plurality of attributes based on a set of predefined parameters. Further, the method selects a set of algorithms corresponding to each attribute belonging to the set of attributes. Thereafter, the method executes one or more algorithms belonging to a set of algorithms corresponding to each attribute.07-23-2009
20090164460Digital television video program providing system, digital television, and control method for the same - Disclosed are a digital television video program providing system, a digital television, and a control method for the same. The digital television video program providing system includes a video analyzing unit for extracting search terms from information data associated with a played video, a search keyword extracting unit for selecting a search keyword from the extracted search terms based on a reliability factor acquired by an extraction method of the extracted search terms, a video search unit for searching for a playable video based on the selected search keyword, and a search result providing unit for providing search results for the video.06-25-2009
20090089279Method and Apparatus for Detecting Spam User Created Content - The present invention provides methods, apparatuses and systems directed to automatically detecting spam user created content. In a particular implementation, there is provided a method for processing spam contents, which comprises: maintaining a plurality of key information databases; receiving user-created content and at least one of a service ID and a content category ID of the user-created content from one or more users of a user-created content hosting site; selecting one of the plurality of key information databases based on at least one of the service ID and the content category ID; extracting second key information from the received user-created content; searching the selected key information database for first key information related to the second key information; classifying the received user-created content as spam content based on the extracted second key information and/or the first key information related to the second key information; and conditionally storing the user-created content in a network accessible data store available to users of the user-created content hosting site based on classifying the user-created content as spam or non-spam content. Said first and second key information may comprise at least one of predetermined type(s) of data, word(s) and phrase(s) in said contents, wherein said data comprises a user ID, a universal resource locator, a site address, an account number and/or a telephone number. In addition, said method may further comprise: determining whether the extracted second key information corresponds to predefined restricted information; and if the extracted second key information corresponds to the predefined restricted information, removing the extracted second key information and/or replacing the extracted second key information with predefined different information.04-02-2009
20090182730Method for semantic based storage and retrieval of information - A method of storing semantically similar documents on proximally located peers in a structured peer to peer overlay network, where each peer is assigned a unique identifier and each document includes one or more words belonging to at least one hierarchical structured collection of words. A method of searching and retrieving documents, corresponding to a search query, from a structured peer to peer overlay network is also provided.07-16-2009
20090182728Managing an Archive for Approximate String Matching - In one aspect, in general, a method is described for managing an archive for determining approximate matches associated with strings occurring in records. The method includes: processing records to determine a set of string representations that correspond to strings occurring in the records; generating, for each of at least some of the string representations in the set, a plurality of close representations that are each generated from at least some of the same characters in the string; and storing entries in the archive that each represent a potential approximate match between at least two strings based on their respective close representations.07-16-2009
20090182727SYSTEM AND METHOD FOR GENERATING TAG CLOUD IN USER COLLABORATION WEBSITES - A system and method for searching a collaborative website and displaying one or more words in a tag cloud. The system includes a search engine structured to search a collaborative website and a tag cloud generator configured to produce a tag cloud, which includes one or more words associated with one or more documents and configured to be weighted or scored according to importance within an online community, and based on a search term entered into the search engine. The method includes scoring one or more words within the collaborative website and displaying the one or more words via a tag cloud according to the score.07-16-2009
20090182725DETERMINING ENTITY POPULARITY USING SEARCH QUERIES - Systems, methods, and computer-readable media for determining the Internet search popularity of an entity are provided. Embodiments of the present invention include receiving a group of Internet search records and assigning a popularity ranking based on the number of times an entity descriptor associated with an entity occurs within the group of Internet search records created over a designated time period. An entity descriptor is one or more terms commonly used to identify an entity. The trend in an entity's popularity rank may also be calculated. An entity's popularity rank and trend in popularity rank may be presented in a graph or in a list.07-16-2009
20090182732Query based operation realization interface - Method and apparatus for an operation realization interface. In some embodiments, the method and apparatus provides a user query interface for a user to enter a query to describe her goal of a product. Then, the method and apparatus processes the query to determine appropriate operations of the product that, when realized, realize the user's goal. Finally, the method and apparatus manipulates the product to realize the appropriate operations so to realize the user's goal.07-16-2009
20090182731Search method and system using thinking system - The present invention relates to a system and method for information process using artificially constructed apparatus. More specially, the present invention provides a system and method that can search for information in a document structure and provide precise results by analyzing the inputs and search results using the executing system and the knowledge structure of the think system. In one preferred embodiment of the present invention, the search terms are divided into subject terms and corresponding feature terms, and document entry files comprising respective subject terms and corresponding feature terms will provide access to documents including subject terms and corresponding feature terms.07-16-2009
20090182726Bloom Filter for Storing File Access History - A method of producing a search query result that incorporates information about previously accessed search results includes retrieving a list of results responsive to a search request from a user at a first client. A Bloom filter is applied to the results in the list of results to identify one or more first results, if any, in the list of results that the user has previously accessed. A result list is generated. The result list includes at least a portion of the list of results, based at least in part on the identified one or more first results. The result list is sent to the first client.07-16-2009
20090182723RANKING SEARCH RESULTS USING AUTHOR EXTRACTION - Architecture that extracts author information from general documents and uses the author information for search results ranking. The architecture performs automatic author value extraction and makes the extracted value available at index time for subsequent use at query processing and results ranking. Machine learning (e.g., a perceptron algorithm) is employed and a set of input features for the perceptron algorithm utilized for author value extraction. The extracted author value is converted into a feature for input a ranking function for generating a ranking score for each document. The input features can also be weighted according to weighting criteria.07-16-2009
20090187551SEARCH RESULTS WHEN SEARCHING FOR RECORDS OF A BUSINESS OBJECT - Embodiments of the invention provide systems and methods for searching records of a business object using a keyword search. According to one embodiment, a method of searching for one or more records of a business object can comprise converting one or more records of the business object to one or more keyword lists and searching for records of the business object based on the keyword lists and the search criteria. Converting records of the business object to one or more keyword lists can comprise converting one or more attributes of each of the records to one or more keyword-value pairs and saving the keyword-value pairs in the keyword lists. Searching for records of the business object based on the keyword lists comprises applying a keyword search to the keyword-value pairs of the keyword lists.07-23-2009
20090164464Matching Process System And Method - A method for profile matching includes receiving a plurality of user profiles, each user profile comprising traits of a respective user. The method includes receiving a preference indication for a first user profile of the plurality of user profiles. The method also includes determining a potential match user profile of the plurality of user profiles based on the preference indication for the first user profile. The method also includes presenting the potential match user profile to a second user.06-25-2009
20090177648SYSTEMS AND METHODS FOR ORGANIZING AND MANAGING TRUSTED HEALTH CARE REFERENCE INFORMATION - Computer systems, computer readable media, and methods for receiving a search query are provided. Responsive to the search query, data is searched. The data is organized into a plurality of therapeutic categories, each respective category comprising a plurality of documents that relate to the respective category, each document organized into a plurality of predetermined fields of information. Each document is associated with a single therapeutic category. The searching comprises (i) identifying a subset of the plurality of therapeutic categories that pertain to the search query, (ii) identifying a subset of documents that pertain to the search query, and (iii) identifying a subset of fields of information contained in documents in the data that pertain to the search query. The subset of the plurality of therapeutic categories, the subset of documents that pertain to the search query, and the subset of fields of information that pertain to the search query are outputted.07-09-2009
20090164455SYSTEM AND METHOD FOR PERFORMING UNICODE MATCHING - System and method for performing Unicode matching for comparing and merging similar data objects having Unicode strings that are equivalent yet not exact matches. Unicode characters are characterized by number of strokes, stroke order, radicals, geometry, phonemes in association with input method editor and keyboard characteristics such as location of a character on an IME or keyboard (or number of GUI interface interactions used in entering the character, e.g., via tapping where “a” on a mobile device keyboard takes 1 tap of a key, “b” takes 2 taps). These characteristics associated with code points and IME's/keyboards are utilized to create subdomains for matching and determining “distance” to other Unicode code points (e.g., number of keyboard keys away). Allows for determining whether close, yet incorrect data entry may have taken place. Enables merging of duplicate data objects into master data object where minor differences or spelling errors introduce actually represent duplicate data.06-25-2009
20090164446USER FEEDBACK FOR SEARCH ENGINE BOOSTING - A system, method and program product for that utilizes user feedback as a boosting mechanism for closed loop content space search processes, such as site-specific web search engines. A search engine is disclosed that includes: a system for searching a database of content items such as web pages; a data collection system for collecting user feedback from users viewing displayed content items regarding information appearing in said displayed content items; a scoring system for assigning a score to content items from the database based on the user feedback; and a system for ranking a set of search results based on the score assigned to content items in the set of search results.06-25-2009
20090164462DEVICE AND A METHOD FOR ANNOTATING CONTENT - A device and a method for annotating content is provided. The device may comprise a means to analyse the content (06-25-2009
20090164450SYSTEMS AND METHODS OF RANKING ATTENTION - The disclosure describes systems and methods of ranking user interest in physical entities based on the attention given to those entities as determined by an analysis of communications from devices over multiple communication channels. The attention ranking systems allow any “Who, What, When, Where” entity to be defined and ranked based, at least in part, on information obtained from communications between users and user proxy devices. An entity rank is generated for entity known to the system in which the entity rank is derived from the information in communications that are indicative of user actions related to the entity. The entity ranks are then used to modify the display of information or data associated with the entities. The system may also generate a personal rank for each entity based on the relation of the entity to a specified user.06-25-2009
20090164454SYSTEM AND METHOD FOR SEARCHING VENUES BASED ON SIMILARITY VALUES - A method, system and computer program product for searching for a venue based on a similarity value is disclosed. In one embodiment, the method includes receiving one or more selected attributes. The method further includes receiving one or more attribute-specific optional factor values and one or more venue-specific optional factor values. The method further includes searching a database containing a plurality of venue records each having one or more attributes and identifying the venue records having one or more attribute matches, the attribute matches being the selected attributes. The method further includes generating an attribute-specific similarity value and a venue-specific similarity value, and a total similarity value.06-25-2009
20090164463Destination input systems, methods, and programs - Destination input devices, methods, and programs that store a plurality of destination data items and a plurality of related terms related to the destination data items, a plurality of generic names associated with each of the related terms. The devices, methods, and programs select at least one of the destination data items that are stored in a memory based on an ordinary search of the character that is input as the search key, extract one of the related terms, the generic names, and the destination data items based on a fuzzy search of the character that is input as the search key; and display in list form, search results that correspond to the character that is input, the generic names that are selected by the fuzzy search and the at least one of the destination data items that is selected by the ordinary search.06-25-2009
20090164458Methods and systems employing a cohort-linked avatar - Avatars, methods, apparatuses, computer program products, devices and systems are described that carry out obtaining at least one item description; determining an indication of fit between at least one aspect of the item description and at least one cohort-linked avatar; and transmitting the indication of fit to at least one entity.06-25-2009
20090164457Information collection, filtering and distribution method and system - A method and a corresponding system for collecting, filtering and distribution of electronic information comprising the following steps: collecting information items from information channels of different types, filtering the information items according to filtering specifications, assigning the filtered information items to information queues and supplying the information queues to information consumers.06-25-2009
20090164456EXPANDING A QUERY TO INCLUDE TERMS ASSOCIATED THROUGH VISUAL CONTENT - A method for expanding a query to include additional terms associated through visual content is provided. A bipartite graph is constructed based on a database of visual content and associated textual content. One partition of the bipartite graph contains visual content and the other partition of the bipartite graph contains textual content. Weighted edges between nodes in the two partitions represent associations between the visual content and textual content in the database. Random walks on the bipartite graph are performed to derive probabilistic association scores between textual content that are indirectly associated with each other through visual content. The query is expanded to include additional terms whose equivalent textual content is highly associated with the query's equivalent textual content.06-25-2009
20090164453System and method for providing real-time search results on merchandise - A search suggestion system and method for a product/service database which provides an improved, bifurcated search result search result algorithm. A vectored index of a product/service database is first generated. As a search query is typed, the letters/words are processed through a lexographical matching module, compared to the index, and a subset of the index is identified. The subset is then ranked according to (1) the user's history, (2) most popular sales data, (3) most often viewed products, and (4) lexographical weights. The highest ranked items are then displayed in a drop-down list to the user.06-25-2009
20090164452APPARATUS AND MEHTOD FOR PERSONALIZATION ENGINE - An apparatus and a method for a personalization engine for providing a user preference matching score for a media content item. Any of a plurality of media processing applications can submit a request including identification of the media content item and associated meta-data, and receive in response the user preference matching score. The requesting application can take actions responsive to the received user preference matching score. The user preference matching score is derived from information, collected by the personalization engine from a plurality of sources, including data pertaining to a plurality of pre-define fields that reflect the user's expressed preferences and the user's previous usage of other media content items. In deriving the user preference matching score, different weighting factors can be assigned to data in each of the pre-defined fields based on, for example, the source of the data and weighting factors specified by the requesting application.06-25-2009
20090164451SYNDICATING HUMOR - A method and apparatus for altering a page presenting search results is provided. The query dispatcher receives one or more query terms. Based on the query terms, the search engine generates a set of search results and advertisements. A parallel search dispatched by entertainment rating and selection engine generates a set of content items based on the one or more query terms and an additional one or more constraint terms. The entertainment item rating and selection engine selects a content item from the set of content items. The selection may be random, based on past user responses, or responses of users belonging to particular clusters. The entertainment item injector then replaces one of the search results or advertisements with the content item. The content item is presented to the user on a search results page. The content item contains a feedback mechanism to collect user responses. The entertainment item rating and selection engine then derives the quality of the entertainment item from the collected user responses.06-25-2009
20090164447CONTENT SEARCHING FOR PORTALS HAVING SECURE CONTENT - Searching content of a portal comprising a plurality of portal content elements, at least one of which is a secure portal content element. Each secure portal content element is associated with a unique identifier. Search parameters and credentials are received from a user, and a preliminary result set satisfying the search parameters is generated. For each secure portal content element in the preliminary result set, the credentials and the unique identifier are used to determine whether the user is permitted to access that secure portal content element. The preliminary result set is used to generate a final result set. A result identification is presented to the user, identifying the secure portal content elements that are included in the final result set and which the user is permitted to access, with such secure portal content elements being distinguished from any secure portal content elements to which the user is denied access.06-25-2009
20090187560STRING PATTERN CONCEPTUALIZATION METHOD AND PROGRAM PRODUCT FOR STRING PATTERN CONCEPTUALIZATION - A conceptualization method uses maximum or other substrings of a string pattern to find specific N-tuples of substring triples with N≧2 and m=1 . . . N inside a reference set (SET_r_i) of strings (STR_n_i). Each N-tuple is considered as a candidate for representing related concepts. Each concatenation of the substrings triples is an explicit member of the reference set (SET_r_i). Each middle substring out of middle substrings is unequal to another middle substring out of middle substrings within the substring triples found inside the reference set (SET_r_i). Each prefix substring (X_i) is equal to all other prefix substrings (X_i) within the substring triples found inside the reference set (SET_r_i). Each suffix substring (Z_i) is equal to all other prefix substrings (Z_i) within the substring triples found inside the reference set (SET_r_i). Either the prefix substring (X_i) or the suffix substring (Z_i) is not empty.07-23-2009
20090187556COMPUTER METHOD AND APPARATUS FOR GRAPHICAL INQUIRY SPECIFICATION WITH PROGRESSIVE SUMMARY - A computer method and system provides for graphical specification of inquiries and includes a corresponding progressive summary. The inquiries operate on stream data. Users graphically specify an inquiry in a graphical user interface according to an ontology. The invention system generates a plain-text translation of the graphical description of the inquiry and displays the generated plain-text description in a progressive summary in the graphical user interface. The system continually updates and generates the display of the plain-text description during user construction of the inquiry. This provides feedback to the user for improved construction of the inquiry.07-23-2009
20090144261SYSTEMS AND METHODS FOR SOLVING MULTIPLE INTERACTING STATE-SPACE SEARCH PROBLEMS - A combinatorial search method and system is implemented in a computer control system for utilizing state-space planning of operations for multi-step production processes. The planner considers various possible combinations of actions, searching for one that correctly transforms the initial state of the object (or commodity) into the specified desired final state, where each combination of actions the planner considers is called a search node. Each node contains a plan representing a series of actions of a plurality of machines on a single object and also containing the predicted state of the object with those actions applied either forward or backward. The state of the object consists of the set of attributes of the object. The method and system include multiple individual state-space search operations having a plurality of nodes, at least some of the nodes include children, and the children of the nodes represent potential solutions to existing problems to be solved, and the multiple state-space search operations are linked into a single search tree.06-04-2009
200901384683D MODEL RETRIEVAL METHOD AND SYSTEM - The present invention provides a 3D model retrieval system designed to extract feature vectors of 3D models to retrieve a similar model. Image feature vectors are extracted by subjecting target 3D models to rendering from various directions by using a random rotation generator and a 2D image generator. Then, the image feature vectors are registered in an image feature vectors database. Image feature vectors are extracted by subjecting a query 3D model to rendering from various directions by using another random rotation generator and another 2D image generator. The image feature vectors are compared to the contents of the image feature vectors database, thereby retrieving a 3D model.05-28-2009
20090138461METHOD FOR DISCOVERING DESIGN DOCUMENTS - Techniques for obtaining a lineage of a schema in one or more documents are provided. The techniques include using a schema to find a document that is most relevant to the schema, obtaining one or more relevant portions of the most relevant document that is related to the schema, constructing a first probe set from the one or more relevant portions of the document, using the first probe set to discover one or more documents for obtaining lineage information, discovering a second probe set from the one or more documents, and recursively using the second probe set to discover a related document.05-28-2009
20090138462SYSTEM AND COMPUTER PROGRAM PRODUCT FOR DISCOVERING DESIGN DOCUMENTS - Techniques for obtaining a lineage of a schema in one or more documents are provided. The techniques include using a schema to find a document that is most relevant to the schema, obtaining one or more relevant portions of the most relevant document that is related to the schema, constructing a first probe set from the one or more relevant portions of the document, using the first probe set to discover one or more documents for obtaining lineage information, discovering a second probe set from the one or more documents, and recursively using the second probe set to discover a related document.05-28-2009
20090138460System and Method of Determining Relationship Information - Systems and methods of determining relationship information are provided. A system may include processing logic and memory accessible to the processing logic. The memory may include instructions executable by the processing logic to access communication data associated with at least one first party. The memory may also include instructions executable by the processing logic to analyze a plurality of communications between the at least one first party and at least one second party to determine relationship information descriptive of a relationship between the at least one first party and the at least one second party. The communication data may include a call log and an email log, and the plurality of communications may include at least one call and at least one email message.05-28-2009
20090138459System and Method of Searching for Video Content - A method of searching video content includes searching the video content according to criteria defined by a user, and sending a notice to an electronic calendar with at least one entry that meets the criteria. A graphical user interface performing the method is also disclosed.05-28-2009
20090138463OPTIMIZATION OF RANKING MEASURES AS A STRUCTURED OUTPUT PROBLEM - Methods, systems, and apparatuses for generating relevance functions for ranking documents obtained in searches are provided. One or more features to be used as predictor variables in the construction of a relevance function are determined. The relevance function is parameterized by one or more coefficients. An ideal query error is defined that measures, for a given query, a difference between a ranking generated by the relevance function and a ranking based on a training set. According to a structured output learning framework, values for the coefficients of the relevance function are determined to substantially minimize an objective function that depends on a continuous upper bound of the defined ideal query error.05-28-2009
20090138464Method for removing network effects from search engine results - A method and apparatus for ranking results from a search engine query is described. In one embodiment, the search engine provides results from a search query. The results contain a list of web pages where each web page has one or more inbound links. The search engine computes the growth of the number of inbound links of each web page over a predefined period of time. The search engine ranks each web page based on a function of its respective computed growth of the number of inbound links.05-28-2009
20090287681MULTI-MODAL SEARCH WILDCARDS - A multi-modal search system (and corresponding methodology) that employs wildcards is provided. Wildcards can be employed in the search query either initiated by the user or inferred by the system. These wildcards can represent uncertainty conveyed by a user in a multi-modal search query input. In examples, the words “something” or “whatchamacallit” can be used to convey uncertainty and partial knowledge about portions of the query and to dynamically trigger wildcard generation.11-19-2009
20090049030SYSTEM AND METHOD FOR REDUCING THE MULTIPLE LISTING OF A MEDIA ITEM IN A PLAYLIST - A system and method for reducing the multiple listing of a media item in a playlist are disclosed. When a media item recommendation is received from a recommender, a playlist may be reviewed to determine whether there is a current listing of the media item in the playlist. If there is a current listing, a resultant listing may be provided. If the current listing is based on a recommendation from the recommender, the media item recommendation may be disregarded. In such case, the resultant listing comprises the current listing. Alternatively, the resultant listing may be provided such that the information in the received media item replaces the information in the current listing in the playlist to avoid multiple listings. Or, the resultant listing may comprise information contained in the media item recommendation merged with information in the current listing. In such case, information regarding both the received media item and the currently listing may be preserved while avoiding multiple listings of the media item in the playlist. Further, providing the resultant listing may comprise removing all information in the current listing if the media item is removed from the playlist. In this manner, the cluttering of the playlist with multiple listings of the media item and redundant information associated with the media item may be avoided.02-19-2009
20090282022WEB BROWSER ACCESSIBLE SEARCH ENGINE THAT IDENTIFIES SEARCH RESULT MAXIMA THROUGH USER SEARCH FLOW AND RESULT CONTENT COMPARISON - An Internet infrastructure contains a search server that delivers search result pages of web sites to client devices based upon a search string. Maxima categories are provided that sort search results or web pages based upon popularity and/or context similarity. A web browser contained within a client device is coupled to display various search result pages of web sites delivered by the search server. A maxima determination module within the search server responds to the delivery of the initial search string by first categorizing search results applicability to the search string on the basis of maxima or by generating maxima categories with search results contained therein that correlated to the search string. These search results within each applicable maximum are then sorting on the basis of popularity within each of the maxima categories to effectuate popularity ranks for each search result or web page. User interaction with search results are monitored to better select search maxima and popularity ranks for subsequent search result requests for this search string, whereby the desirability of search results provided to the user improves over time.11-12-2009
20090276415SYSTEM AND METHOD FOR AUTOMATICALLY PROCESSING CANDIDATE RESUMES AND JOB SPECIFICATIONS EXPRESSED IN NATURAL LANGUAGE INTO A COMMON, NORMALIZED, VALIDATED FORM - Systems and methods for automatically processing candidate resumes and job specifications expressed in natural language into a common, normalized, validated form. Candidate resumes and job specifications are received in electronic form and expressed in natural language. The natural language expression of the candidate resumes and job specifications are analyzed to extract elements expressed in candidate resumes and job specifications. Each extracted element is validated against a database of valid words or phrases. The extracted, validated elements are converted for each candidate resume or job specification into corresponding set of synonymous elements. The synonymous elements are expressed in a common form used across all candidate resumes and job specifications processed by the method. A set of candidate resumes is matched with a corresponding job specification by comparing the set of elements expressed in common form for the resumes with the set of elements expressed in common form for the job specification.11-05-2009
20090006364EXTENDING A SEED LIST TO SUPPORT METADATA MAPPING - Embodiments of the present invention address deficiencies of the art in respect to crawling content and provide a method, system and computer program product for metadata processing for seed lists for structured content sources. In one embodiment, a method for processing metadata for a seed list can include extracting metadata from a seed list for application content, storing the metadata in a repository, associating the metadata with fields of the application content, crawling the fields of the application content by reference to the metadata, and indexing the fields. In an aspect of the embodiment, the method further can include annotating the application to produce metadata for the fields of the application content. In yet another aspect of the embodiment, the method can include mapping the metadata to a document schema generic to a plurality of heterogeneous application content.01-01-2009
20090006362HIERARCHICAL SEEDLISTS FOR APPLICATION DATA - Embodiments of the present invention address deficiencies of the art in respect to crawling content and provide a novel and non-obvious method, system and computer program product for seed lists for hierarchically structured content sources. In one embodiment, a method for crawling seed lists for hierarchically structured content sources can be provided. The method can include specifying a depth of crawling for hierarchically structured content, crawling only seed lists at the specified depth among other seed lists in a hierarchy of seed lists mimicking the hierarchically structured content, and returning indexed data for the crawled seed lists. Optionally, an administrator user interface can be provided for specifying the depth of crawling for the hierarchically structured content.01-01-2009
20100005094APPARATUS AND METHOD FOR ANALYZING PATENT CLAIM VALIDITY - A computer system, method, and storage medium with embedded code automate analysis of validity of patent document claims. In embodiments, the computer system receives an identifier of the patent document and a claim, retrieves text of the patent document, parses the text to identify contextually important key terms of the claim, and then formulates one or more queries that include key terms and a priority date relating to the patent document. The system launches the queries and receives search results. From the results, anticipatory candidate members and obviousness candidate members are determined. If the total number of the members is excessive, the queries are reformulated more restrictively, and the search repeated. The system determines contextual relevance of the members and arranges the members in order of their relevance.01-07-2010
20100005093KNOWLEDGE FILTER - A method and system for sharing knowledge is disclosed. The method and system comprises receiving information input into a database and organizing items of information in the database. The method and system further includes collecting ratings and comments associated with each item of information and allowing users to access and sort items of information according to selected rating criteria in order to find the most reliable and/or valuable information from the database. In a second aspect, the present invention including an interface for providing information concerning a subject is disclosed. The interface comprises a first area that shows the subject and contributor name; and a second area that shows the content of the information item. The interface includes a third area that shows rating related to the subject; and a fourth area that allows users to submit ratings for the information item. Accordingly, a knowledge sharing system and interface are provided which allows every member of a knowledge sharing group to benefit from aggregate knowledge, experience and opinions of other members of the group. The system and method allows individual members to easily locate the information from a collectively generated knowledge base that is most consistent with that individual's personal measures of value in the information.01-07-2010
20090006372METHOD AND APPARATUS TO REORDER SERACH RESULTS IN VIEW OF IDENTIFIED INFORMATION OF INTEREST - Various embodiments described herein provide systems, methods, and software to automatically reorder search results presented to users based on information specific to the user or the computing environment of the user. Some embodiments include a data store holding user or environment specific data that is used to identify search results that are more likely to be relevant to the user. These and other embodiments are described in greater detail herein.01-01-2009
20090006390Compiling Information Obtained By Combinatorial Searching - Some embodiments, among others, include a search for sensitive information. Once a result of the search has been obtained, a score is assigned to the obtained result in accordance with a predefined criterion.01-01-2009
20090006387SYSTEM AND METHOD FOR MEASURING THE QUALITY OF DOCUMENT SETS - Systems and methods are described that calculate the interestingness of a set of one or more records in a database, either absolutely (i.e., compared to an overall collection of records) or relative to some other set of records. In one embodiment, the measure is a relative entropy value that has been normalized. Various applications of the measure are described in the context of an information retrieval system. These applications include, for example, guiding query interpretation, guiding view selection and summarization, intelligent ranges, event detection, concept triggers and interpreting user actions, hierarchy discovery, and adaptive data mining.01-01-2009
20090006383SYSTEM AND METHOD FOR MEASURING THE QUALITY OF DOCUMENT SETS - Systems and methods are described that calculate the interestingness of a set of one or more records in a database, either absolutely (i.e., compared to an overall collection of records) or relative to some other set of records. In one embodiment, the measure is a relative entropy value that has been normalized. Various applications of the measure are described in the context of an information retrieval system. These applications include, for example, guiding query interpretation, guiding view selection and summarization, intelligent ranges, event detection, concept triggers and interpreting user actions, hierarchy discovery, and adaptive data mining.01-01-2009
20090006382SYSTEM AND METHOD FOR MEASURING THE QUALITY OF DOCUMENT SETS - Systems and methods are described that calculate the interestingness of a set of one or more records in a database, either absolutely (i.e., compared to an overall collection of records) or relative to some other set of records. In one embodiment, the measure is a relative entropy value that has been normalized. Various applications of the measure are described in the context of an information retrieval system. These applications include, for example, guiding query interpretation, guiding view selection and summarization, intelligent ranges, event detection, concept triggers and interpreting user actions, hierarchy discovery, and adaptive data mining.01-01-2009
20090006380System and Method for Tracking Database Disclosures - A system and method is provided for identifying the source of an unauthorized database disclosure. The system and method stores a plurality of past database queries and determines the relevance of the results of the past database queries (query results) to a sensitive table containing the unauthorized disclosed data. The system and method also ranks the past database queries based on the determined relevance. A list of the most relevant past database queries can then be generated which are ranked according to the relevance, such that the highest ranked queries on the list are most similar to said disclosed data. Three techniques used in embodiments of the invention include partial tuple matching, statistical linkage and deviation probability gain.01-01-2009
20090006376Closest User Terminal Search Method for a Telecommunication Network and Service Node Applying Such a Method - Service node for a telecommunication network (01-01-2009
20090006374RECOMMENDATION SYSTEM WITH MULTIPLE INTEGRATED RECOMMENDERS - A recommendations system is provided in various embodiments for selecting items to recommend to a user. The system includes a recommendation engine with a plurality of recommenders, and each recommender identifies a different type of reason for recommending items. In one embodiment, each recommender retrieves item preference data and generates candidate recommendations responsive to a subset of that data. The recommenders also score the candidate recommendations. In certain embodiments, a normalization engine normalizes the scores of the candidate recommendations provided by each recommender. A candidate selector selects at least a portion of the candidate recommendations based on the normalized scores to provide as recommendations to the user. The candidate selector also outputs the recommendations with associated reasons for recommending the items.01-01-2009
20090006373RECOMMENDATION SYSTEM WITH MULTIPLE INTEGRATED RECOMMENDERS - A recommendations system is provided in various embodiments for selecting items to recommend to a user. The system includes a recommendation engine with a plurality of recommenders, and each recommender identifies a different type of reason for recommending items. In one embodiment, each recommender retrieves item preference data and generates candidate recommendations responsive to a subset of that data. The recommenders also score the candidate recommendations. In certain embodiments, a normalization engine normalizes the scores of the candidate recommendations provided by each recommender. A candidate selector selects at least a portion of the candidate recommendations based on the normalized scores to provide as recommendations to the user. The candidate selector also outputs the recommendations with associated reasons for recommending the items.01-01-2009
20090006368Automatic Video Recommendation - Automatic video recommendation is described. The recommendation does not require an existing user profile. The source videos are directly compared to a user selected video to determine relevance, which is then used as a basis for video recommendation. The comparison is performed with respect to a weighted feature set including at least one content-based feature, such as a visual feature, an aural feature and a content-derived textural feature. Multimodal implementation including multimodal features (e.g., visual, aural and textural) extracted from the videos is used for more reliable relevance ranking. One embodiment uses an indirect textural feature generated by automatic text categorization based on a set of predefined category hierarchy. Another embodiment uses self-learning based on user click-through history to improve relevance ranking.01-01-2009
20090006367SEARCH-BASED FILTERING FOR PROPERTY GRIDS - Technologies for search-based filtering of a property grid. Such filtering allows a user to enter a search term into an easily recognized search text box, or apply a user or pre-defined term to a property grid, thus reducing the set of properties visible so that the user has a smaller list to search to find the one on which they desire to operate. The search term is typically applied to all properties shown in the property grid. Elements that match the search term are made visible in an updated property grid while those that do not match are not presented. Also, the search term may be applied to more than just the name of the property. It may be applied to a category within which the property appears, the type of the property, or any of a number of attributes or tags that may be applied to the property.01-01-2009
20090006359AUTOMATICALLY FINDING ACRONYMS AND SYNONYMS IN A CORPUS - Acronym and synonym pairs can be identified and retrieved automatically in a corpus and/or across an enterprise based on customer settings globally or for a single instance. Possible acronym and synonym term pairs can be identified using a rule such as a heuristic, user-defined rule. Rules selected by the user can be used to rank acronym and synonym pairs using factors such as occurrence frequency and maximum term length. A rule interpreter engine executes the user defined rule set to properly identify and retrieve the user selected acronym and synonym pairs through the utilization of a shallow pause read step. Finally, the user selected acronym and synonym pairs are ranked according to the user preferences, and can be displayed or held for subsequent use in searching.01-01-2009
20090006357DETERMINING QUALITY MEASURES FOR WEB OBJECTS BASED ON SEARCHER BEHAVIOR - Techniques are provided for generating quality measures for items, including web pages, based on a “random searcher” behavior model. The random searcher behavior model takes into account “implicit” links between items, instead of or in addition to the explicit links. After identifying the implicit links between items, the implicit links may be used as the basis for generating quality measures for the items to which the implicit links point. A variety of types of implicit links are described. To facilitate the generation of quality measures for items based on implicit links, a graph of the implicit links may be constructed in a manner similar to a webgraph.01-01-2009
20090006354SYSTEM AND METHOD FOR KNOWLEDGE BASED SEARCH SYSTEM - The present invention provides functionality for conducting a knowledge based by finding search results from limited topic domains. According to one embodiment, the method of the present invention includes retrieving the context of a given user and identifying a plurality of characteristics associated with the user's context. The one or more characteristics associated with the user's context are displayed to the user and the user may select from the displayed characteristics. One or more items of content are retrieved based upon the user's selection and presented to the user on the user's client device.01-01-2009
20090177652MOBILE SEARCH SERVICE - At least some embodiments of this invention provide for a way to mix mobile content found as a result of searching and/or browsing on the Internet. Aspects of the invention provides software, systems (meaning software and hardware to run the software) or an exchange of signals with users to provide a mobile content service. Other related aspects provide methods for providing or using such a search service. According to one aspect there is provided a query server to provide a search service for searching computer accessible content, the query server being arranged to receive a search query from a user on a mobile device, output said search query to multiple sources of indexable information, input an individual list of results from each of said multiple sources together with a scoring for each result wherein each result has a position in its associated individual list determined by its scoring, combine said lists of results to form a single combined list wherein results in said single combined list are ranked using a combination of their scoring and position in their respective individual list and send said combined list of search results to a user's mobile device.07-09-2009
20090177649Answer Search System and Method - An answer searching system and method is disclosed herein. The answer searching system includes a user, a database, a few of experts and a service platform. The user sends at least one question by a communication device. The database stores a number of experts' data. The experts are corresponding to the experts' data stored in the database. The service platform is used to receive the questions sent from the user and search a suitable expert's data in accordance with the question in the database. Therefore, the service platform contacts the expert to answer the question and send the expert's answer back to the user. The user gives a feedback to the expert and save the feedback in the expert's data of the database.07-09-2009
20090177653IMAGE PROCESSING APPARATUS AND IMAGE PROCESSING METHOD - In an image processing apparatus according to the present invention, a scanning unit reads out image information regarding original document; an analyzing unit extracts layout information regarding character regions and character addition information added to characters within the character regions from the image information; an OCR processing unit converts the character regions included in the layout information extracted by the analyzing unit into character information; an extracting unit extracts one or more keywords comprised of a plurality of characters from the character information; a searching unit obtains meta-information by use of the extracted keywords; and an electronic document generating unit generates an electronic document according to description of a predetermined format by adding the meta-information to the character information. According to the present invention, it is possible to properly add secondly available information to electronized information.07-09-2009
20090177651INFORMATION PROCESSING DEVICE AND METHOD, PROGRAM, AND RECORDING MEDIUM - An information processing device includes: a user information obtaining unit configured to obtain information relating to data of content that a user has used; a meta information obtaining unit configured to obtain content meta information corresponding to content that the user has used; a first vector generating unit configured to generate a first user preference vector with each of the obtained content meta information as elements thereof; a second vector generating unit configured to generate a second user preference vector wherein the generated first user preference vector is analyzed and the number of elements of the first user preference vector is compressed; and a user identifying unit configured to identify a user corresponding to a second user preference vector having a high similarity to a second user preference vector determined beforehand from multiple second user preference vectors.07-09-2009
20090019039LAYERED AUGMENTATION FOR WEB CONTENT - Embodiments of the present disclosure include methods (and corresponding systems and computer program products) that augment content in web pages with resources and provide the resources based on user interaction with the augmented content in the web pages. The disclosed embodiments analyzes a web page to identify a keyword, locates a piece of reference data matching the identified keyword, generates an association of the located piece of reference data and the keyword, and embeds the association in an augmented web page. Upon receiving a request from a client computer corresponding to a pointer being positioned over the keyword in the augmented web page, the disclosed embodiments determines relevant resources, and transmits the resources to the client computer for display in a multi-layered dialog box, such that a viewer can access the plurality of resources by interacting with the multi-layered dialog box without leaving the augmented web page.01-15-2009
20090019034MEDIA DISCOVERY AND PLAYLIST GENERATION - A computer implemented method and system that generates a video playlist having recommended videos based on a user query object is disclosed. A user query object is used to search for a number of web pages. Summaries are generated for the returned web page search results. Valuable terms and phrases from those summaries may be extracted and used to search video storage sites based on the original user query. Playable videos returned from the video storage sites may be compared to the user query or to the extracted terms and phrases in order to rank the videos, and the most relevant videos may be returned. Those videos may be displayed to the user as a playlist in an Internet browser having an embedded video player.01-15-2009
20090187558METHOD AND SYSTEM FOR DISPLAYING SEARCH RESULTS - Methods and systems related to the display of primary and secondary search results are provided. Search results are displayed to the user without requiring the user to perform any tasks to view the entire set of search results. The user may then request secondary searches based on the displayed primary search results through performing a single action. Secondary search results are displayed along with the primary search results.07-23-2009
20090187554SPECIFYING WEIGHTED SEARCH TERMS FOR A SEARCH ENGINE - A content searching data processing system can be configured for specifying weighted search terms for a search engine. The system can include a search engine executing in a host server. The search engine can be coupled to a search index and can be configured for communicative coupling to different content browsers executing in respective clients over a computer communications network. Finally, the system can include weighted search term logic coupled to the search engine. The logic can include program code enabled to render a search term entry user interface in which variable weights are specified for corresponding search terms, to assign the variable weights to the corresponding search terms and to issue a search to the search engine with the search terms and variable weights and to return results of the search to a requesting one of the content browsers.07-23-2009
20090063477RESEARCH RAPIDITY AND EFFICIENCY IMPROVEMENT BY ANALYSIS OF RESEARCH ARTIFACT SIMILARITY - Methods for comparing query-related objects are provided. In one embodiment, a first plurality of query-related objects for a first user is compared to a second plurality of query-related objects for a second user to determine a degree of similarity between the first and second plurality of query-related objects. A notification of the degree of similarity is issued.03-05-2009
20090063453APPARATUS, SYSTEM, AND METHOD FOR EXECUTING A DISTRIBUTED SPATIAL DATA QUERY - An apparatus, system, and method for executing a distributed spatial data query. The present invention allows a client to perform spatial queries against spatial data stored in a various formats in various separate databases. A view of the data is created in the relevant databases, wherein the spatial data is converted to WKB and stored as a BLOB. A federated server contains nicknames for the various database views, and also contains views of the data where the BLOB is converted back to a spatial data type. The federated server presents to clients an application view of the distributed heterogeneous spatial data such that the clients can treat the data as if it were a homogenous data source. Also taught is incorporating distributed non-spatial data into the application view by creating a nickname and a view on top of the nickname which derives spatial information from the non-spatial location information.03-05-2009
20090063447UPDATING RETRIEVABILITY AIDS OF INFORMATION SETS WITH SEARCH TERMS AND FOLKSONOMY TAGS - Provided are techniques for updating retrievability aids. A search request including one or more search terms is received. Each of the one or more search terms is captured. A list of topics is provided to a user as search results. User selection of a topic in the list of topics is received. After reviewing the topic, the user adds one or more folksonomy tags to the topic. The one or more folksonomy tags added by the user to the topic are captured. Each of the one or more search terms and each of the one or more folksonomy tags are mapped to the topic. For each of the search terms, based on a number of times that the search term has been used to search for the topic, the search term is added to one or more retrievability aids. For each of the one or more folksonomy tags, based on a number of times that the folksonomy tag has been applied to the topic, the folksonomy tag is added to at least one of the one or more retrievability aids.03-05-2009
20090063446SYSTEM AND METHOD FOR PROVIDING VECTOR TERMS RELATED TO INSTANT MESSAGING CONVERSATIONS - The method according to one embodiment of the present invention comprises retrieving one or more terms or phrases comprising an instant messaging conversation in which one or more users are participating. One or more term vectors comprising one or more vector terms associated with the one or more retrieved terms or phrases comprising the instant messaging conversation are generated and one or more vector terms are selected from said term vectors. The one or more selected vector terms are displayed to the one or more users participating in the instant messaging conversation. An indication of a user selection of a given displayed vector term is received and one or more content items responsive to the selected vector term are identified.03-05-2009
20090063444System and Method for Providing Multiple Redundant Direct Routes Between Supernodes of a Multi-Tiered Full-Graph Interconnect Architecture - A method, computer program product, and system are provided for selecting, from a plurality of routes through the data processing system, a direct route for transmitting data. Data that includes address information is received at a first processor that is to be transmitted to a destination processor. Using routing table data structures, direct route entries are identified that correspond to direct routes for transmitting data. An accessed priority table data structure comprises a priority entry for each entry in the routing table data structures. The priority entry specifies a priority of a corresponding entry in the routing table data structures. A direct route entry is selected that corresponds to a direct route from the routing table data structures, based on specified priorities. Then the data is transmitted from the first processor to the destination processor using a path corresponding to the selected direct route entry.03-05-2009
20090063442METHOD AND SYSTEM FOR PROVIDING VALUE HELP FEATURES TO INPUT FIELDS GENERATED FOR DYNAMICALLY SELECTED COLUMNS - A method and system for providing value help features to input fields generated for dynamically selected columns is provided. A user interface element is generated for a dynamic key from the metadata. The user interface element has a name field and a value input field. A generic query having an attribute group is provided. The attribute group includes a name attribute, a code value attribute and an identifier value attribute. A sequence number is extracted from the metadata. It is determined from the sequence number that whether the user interface element is to be bound to the attribute group or not. A field type of the dynamic key is determined from the metadata. The name field is bound to the name attribute. The value input field is bound to the code value attribute if the field type is a code type. The value input field is bound to the identifier value attribute if the field type is an identifier type. Determining a query descriptor from the metadata. The query descriptor includes a first input parameter node and a first result node. The first input parameter node has a first attribute. The query descriptor is copied to a dummy query. The dummy query includes a second input parameter node and a second result node. The first input parameter node is identical to the second input parameter node and the first result node is identical to the second result node. The second input parameter node includes a second attribute identical to the first attribute. A screen is generated from the dummy query. The screen has a first input field bound to the second attribute and a result table bound to the second result node. A second input field is populated with a value of the second attribute from the second result node.03-05-2009
20090327283TECHNIQUES FOR WEB SITE INTEGRATION - Disclosed is a method and device for finding documents, such as Web pages, for presentation to a user, automatically or in response to a user expression of interest, which documents are part of a Web site being accessed by the user, and which documents relate to a document, such as a Web page, being accessed in the Web site. The method takes advantage of information retrieval techniques. The method generates the search query to use to find documents by reference to the text of the document in the Web site being accessed by the user. The method further uses a weighting function to weigh the terms used in the search query.12-31-2009
20090327280Methods And Systems For Increasing Protein Food Safety - Methods, systems, and devices for increasing protein food safety are provided. According to one embodiment, a method in a computer system for increasing protein food safety includes steps: (a) receiving contamination level data; (b) accessing from a database stored data comprising prior contamination level data, prior interventions associated with the prior contamination level data, and prior actual results associated with the prior contamination level data; (c) selecting a subset of the prior contamination level data, the prior interventions, and the prior actual results, where the prior contamination level data is similar to the contamination level data; (d) determining if an effective intervention is set forth in the subset based at least partially on the prior actual results in the subset; and (e) if an effective intervention is not set forth in the subset, causing an intervention to be output that is increased relative to the intervention in the subset.12-31-2009
20090327282SOCIAL MOBILE SEARCH - Search and retrieval of information shared between members of a social mobile networks is facilitated. The search returns results in an order that is based on relevance measurements unique to each member. A comprehensive map of the interactions and behaviors that take place between the members of a social mobile network is discovered and maintained. Such maps are used to assign a unique relevance measurement for each member of each social mobile network.12-31-2009
20090327281METHOD AND SYSTEM FOR RANKING WEB PAGES IN A SEARCH ENGINE BASED ON DIRECT EVIDENCE OF INTEREST TO END USERS - A method and system for ranking Web pages in a Web search engine is described. One illustrative embodiment receives a Web search query from a particular user, the query including at least one keyword; identifies one or more Web pages that contain the at least one keyword; determines, for each of the one or more Web pages, a raw page ranking; adjusts the raw page ranking of each of at least one Web page among the one or more Web pages based on direct evidence of how interesting that Web page is to users to produce an adjusted page ranking, the direct evidence being derived from clickstream data collected from the users; and presents, as search results, the at least one Web page to the particular user in accordance with the adjusted page rankings.12-31-2009
20090327277Methods and apparatus for reusing data access and presentation elements - A method and system for editing document is provided in which fragments, such as data and data presentation information, are identified, and metadata is generated for each fragment. A record associated with each fragment and related metadata is entered into a fragment database. The fragment database can be searched by users to locate fragments for insertion into a document. The located fragment and any dependent fragments can then be inserted into the document.12-31-2009
20090327273METHOD FOR PROVIDING SEARCH AREA COVERAGE INFORMATION - A method for displaying search area coverage is provided. A searched area (12-31-2009
20090327274PREFETCHING DATA FOR DOCUMENT RANKING - The subject matter disclosed herein relates to prefetching data for use in ranking of electronic documents via a document ranking component.12-31-2009
20090327271Information Retrieval with Unified Search Using Multiple Facets - Information retrieval with unified search between heterogeneous objects is described. The method includes: indexing a first object as a document in a search index; referencing a second object related to the first object in a facet of the document; and storing a relationship strength between the first and second objects in the facet of the document in the search index. Multiple heterogeneous objects can be related to the first object and referenced in multiple facets of the document, each with its relationship strength to the first object. Scoring an indirect object by indirect relation to a query object can be carried out by aggregating the relationship strengths between the indirect object and the retrieved objects multiplied by the retrieved objects' direct scores of relationship strength to the query object.12-31-2009
20090327270Using Variation in User Interest to Enhance the Search Experience - Searches can be enhanced by custom-tailoring results based on a consideration of the variability of the goals of a search given a query. In an example embodiment, a system to enhance searching includes a search interface, a search-goal variability determiner, and a search experience enhancer. The search interface accepts a query from a user as input for a search. The variability determiner determines the variability in user interest (e.g., goals) for the query. The measure of variability in user interest may reflect the degree of variation in the goals of different users or groups of users for the query. The search experience enhancer enhances a search experience for the user responsive to the variability in user interest (e.g., in terms of search goals).12-31-2009
20090327268PROVIDING TARGETED INFORMATION FOR ENTERTAINMENT-ORIENTED SEARCHES - Systems and methods for providing immediate access to comprehensive information and answers on a set of related search engine results pages for common searches executed in the entertainment domain relating to, for instance, music, musicians, movies and celebrities. Upon receipt of a keyword-based search query, a decision is made regarding what the user actually wanted to see as a search result. This information is then automatically presented in a dedicated region of the keyword search results page, typically with links to more refined information. Upon selection of a link, the refined information is also displayed in a dedicated region of the keyword search results page. In this way, the user does not have to navigate multiple, different user interfaces on a variety of different web sites in order to view the information desired.12-31-2009
20090327267BASING SEARCH RESULTS ON METADATA OF PRIOR RESULTS - Embodiments of the invention provide a method, system, and media for determining search results based on a query. One embodiment of the method includes receiving an initial query, inspecting an initial set of query-related information that is associated with the query, which is the fruit of analyzing aggregated user-interaction data, which includes information related to how users have previously interacted with former search results that were presented in response to the query. This information includes prior metadata associated with the former search results. Embodiments further include presenting an initial set of search results based on the initial set of query-related information, gathering current user-interaction data, and updating the initial set of query-related information based on the current user-interaction data. In this way, an embodiment of the invention helps, among other things, map a semantic meaning of a query to results that bring about a satisfying user experience.12-31-2009
20090327266Index Optimization for Ranking Using a Linear Model - Technologies are described herein for providing a more efficient approach to ranking search results. One method reduces an amount of ranking data analyzed at query time. In the method, a term is selected, at index time, from a master index. The term corresponds to a number of documents greater than a threshold. A set of documents that includes the term is selected based on the master index. A rank is determined for each document in the set of documents that contains the term. Each document in the set of documents that contains the term is assigned to a high ranking index or a low ranking index based on the simple rank.12-31-2009
20090327263BACKGROUND CONTEXTUAL CONVERSATIONAL SEARCH - A method of generating search queries based on digitized audio from conversations, including: providing a database having a global hot-list of universal popular keywords or phrases and a personalized entity list comprising keywords and phrases used with a frequency above a determined threshold value in conversations involving a user; monitoring a conversation between at least two people, including the user; identifying words or phrases in digitized audio of the monitored conversation through speech recognition; comparing the identified words or phrases to the keywords and phrases in the database to find any matches; generating a search string, without the user requesting a search, based on words or phrases found to match the keyword or phrases stored in the database; submitting the search string to a search engine as a search query; and serving a set of search results returned by the search engine to a display device of the user.12-31-2009
20090327262Management of Deletion Requests for Related Documents in a Content Management System - A method and system for the management of deletion requests for related documents in a content management system. When a user requests to delete a related document associated with a record in the content management system, the request determines whether there are other documents associated with the record, collects the documents and determines if any of the collected documents are parent documents, removes the association between each collected document and the record, and initiates a deletion process to delete the parent documents.12-31-2009
20090327261SEARCH TECHNIQUES FOR RICH INTERNET APPLICATIONS - A computing device includes one or more rich internet application (RIA) client engines. Each RIA client engine includes a corresponding private RIA storage area. The computing device also includes a per-RIA public storage area for each RIA. The per-RIA public storage area including a subset of data items in the private RIA storage area of the corresponding RIA client engine. A search engine of the computing device may search the data items in the one or more per-RIA public storage areas and link to content in the private RIA storage area of the corresponding RIA client engine at a given data item matching a search request12-31-2009
20090055393METHOD AND SYSTEM FOR FACILITATING INFORMATION SEARCHING ON ELECTRONIC DEVICES BASED ON METADATA INFORMATION - A method and system for facilitating information searching for a user of an electronic device is provided. One implementation involves, on a client side, obtaining metadata for content accessed by a user via the electronic device, displaying terms based on said metadata for user selection, receiving a user selection including receiving selection of one or more of said terms from the user, forming a query based on the user selection to search for related data, and extracting data of interest to the user based on said query.02-26-2009
20090055387Apparatus and method for targeted distribution of search index fragments over a wireless communication network - A system and method for identifying portions of an index related to prior search requests sent from a wireless data processing device and transmitting the portions of the index to the wireless data processing device to be used for local searches. Specifically, a method according to one embodiment of the invention comprises: collecting information related to a plurality of content located over a network; automatically generating and continually updating an index for the plurality of content as new content is identified; analyzing search requests transmitted from a wireless data processing device; based on the analysis, identifying portions of the index relevant to the search requests; transmitting the portions of the index to the wireless data processing device; and executing subsequent search requests using the portions of the index stored on the wireless data processing device.02-26-2009
20090055380Predictive Stemming for Web Search with Statistical Machine Translation Models - Techniques for determining when and how to transform words in a query to return the most relevant search results while minimizing computational overhead are provided. A dictionary is generated based upon words used in a specified number of previous most frequent search queries and comprises lists of transformations that may include variants based upon the stems of words, synonyms, and abbreviation expansions. When a query is received from a user, candidate queries are generated based upon replacing particular words in the query with a transformation of the particular words. Candidate queries are selected that have a high probability of returning relevant results by computing values of the query using language model scoring and translation scoring. The selected candidate queries and the original query are executed to return search results. The search results are displayed to the user with the words in the original query and the transformed words in bold.02-26-2009
20090055374METHOD AND APPARATUS FOR GENERATING SEARCH KEYS BASED ON PROFILE INFORMATION - Methods and apparatus for performing a search using a search keyword and associated aliases for the search keyword are disclosed. According to one aspect of the present invention, a method includes obtaining a search keyword via a user interface, and obtaining a search keyword via a user interface and automatically determining if there is at least one alias for the search keyword by searching a first database using the search keyword. The first database is a profile database that is configured to include a plurality of profiles that contain contact information, including a first profile that contains the search keyword. The method also includes automatically searching for at least one document using the alias if there is one, and the search keyword. The document is associated with a document data source.02-26-2009
20090144256Workflow control in a resource hierarchy - Illustrative embodiments provide a computer implemented method, an apparatus and a computer program product for workflow management control in a resource hierarchy. In one embodiment, the computer implemented method comprises, receiving data, from a plurality of target data sources, into a collection, and synthesizing the received data in the collection to establish a resource hierarchy. The collection is then queried, using criteria in a request for a resource from a requester to provide a selected resource from the collection, forming a response, the selected resource of the response being a best fit result, and returning the response to the requester.06-04-2009
20090144266Search method for entries in a database - A method is provided of searching for one or more text entries in a database. The method includes steps of receiving a search term. The search term may contain a single element (e.g., word, number or abbreviation), or a group of elements such as a combination of words, numbers and/or abbreviations. A pre-processing step is performed on the search term. The pre-processing step includes adding one or more substitutions to the search term in the event that an element of the search term has an equivalent, removing exclusion words from the search term, and removing noise characters. The pre-processing step creates a search string for use in searching of the database. A search of the database is performed for entries that match, at least in part, the search string. The search results are ordered and returned. The method can be coded as software instructions and provided or furnished as a standalone software product.06-04-2009
20090144274METHOD AND SYSTEM FOR FILTERING A TABLE - A method for filtering a table may include creating a filter in response to a user selecting data in a first table. The method may also include applying the filter to at least one other table in response to a user selecting at least one column in the at least one other table.06-04-2009
20090144263SEARCH RESULTS USING A PANEL - Techniques are described to improve search results using a panel. A search engine deploys one or more network traffic monitors. Traffic monitors analyze network traffic and find HTTP requests made to search engines. When a search query is spotted, the traffic monitor records the sequence of user requests, including search engine, search terms, and sites visited. A sequence of queries where a user visits one search engine, enters a query, visits zero or more sites from the results listings, and visits a second search engine, enters a query and visits one or more websites and stops searching is used to determine whether to increase or decrease a relevance value between a search term and the sites visited.06-04-2009
20090144264THIRD-PARTY INFORMATION OVERLAY ON SEARCH RESULTS - Embodiments of the present invention provide systems and methods for integrating third-party information, such as third-party rating information, over the search results. The integrated third-party information in search results provides users additional information to determine which search results to click on for more details. In one embodiment, the methods and systems allow users to choose which third-party data sources to include (or overlay) in their search results. Whenever a user issues a search request to a search engine, which returns search results that correspond to relevant third-party overlay data, the search engine will return a list of the search results integrated with the corresponded third-party data. The integrated third-party information augments the titles, abstracts and link descriptions of search results to help the user determine which search results in the list are relevant. The information, such as rating and review information, provided by third parties trusted by the user can also help the user judge the quality of products and services described in the search results.06-04-2009
20090144265Search engine for searching research data - Searching research data includes receiving one or more search parameters describing desired data, identifying one or more columns or tables of one or more databases that comprise data relevant to the one or more search parameters, dynamically constructing a plurality of instructions for extracting the data from one or more databases, the one or more databases hosted on one or more platforms, and extracting the data from the one or more databases using the plurality of instructions.06-04-2009
20090144262SEARCH QUERY TRANSFORMATION USING DIRECT MANIPULATION - A search query transformation system and method for transforming and refining a search query are described. Embodiments of the system and method use various graphical components and controls. Direct manipulation ensures that the searcher is driving the changes in the search queries using a pointing device. Embodiments of the search query transformation system and method include a search query re-weighting user interface (UI) component for graphically adjusting and re-weighting weights of search terms, and a search query term replacement UI component for graphically replacing a search term in a query or add a synonym to the query. Embodiments of the system and method also include a search query suggestion component, which provides query revision recommendations to a searcher that are tailored to the direct manipulation query refinement interface.06-04-2009
20090144260ENABLING SEARCHING ON ABBREVIATED SEARCH TERMS VIA MESSAGING - System and method for processing a search query using partial indexing to enable use of abbreviated search terms in the query. A mobile device sends a search request (e.g. a text message) to a server over a network. Search request terms can include subsets of feature identifiers and function as partial indexes. The search request can include additional context (e.g. to indicate desired service such as restaurant or transportation, or additional geographic information). The server matches the terms to an interim search result such as one or more geographic locations, and then provides information regarding the interim result to the mobile device. Partial indexing of a database or of one or more tables of the database (e.g. for a geographic area) can be adjusted to balance a minimum term size (e.g. minimum number of characters) against an average, maximum, or median number of matching locations or services.06-04-2009
20090144276COMPUTERIZED DATA MINING SYSTEM AND PROGRAM PRODUCT - Under the present invention, a data exploration system, a customized model system and an existing model system are provided. The data exploration system analyzes user data to identify statistical information such as data distribution, data relationships, data outliners and invalid or missing data values. The customized model center iteratively generates customized data mining models in parallel based on permutations of the user data, user-provided business parameters and/or a set of model generation algorithms. The existing model system provides users with a library of existing data mining models, assembled based on the business parameters, from which they can choose one or more. In any event, any customized or existing data mining models selected can be run against the user data in parallel.06-04-2009
20090144273SYSTEM AND METHOD FOR MUSIC AND COMPATIBILITY MATCHING - An exemplary system includes a searching and matching subsystem configured to communicate with an access device and a commercial device over a data communication network, the searching and matching subsystem including, a session module configured to assign a session identifier to a session initiated by an access device, the access device being associated with the session, a data store configured to store playlists of said access devices, and a compatibility module configured to identify compatible user playlists when directed by an access device.06-04-2009
20090144271DYNAMIC CLIENT INTERACTION FOR SEARCH - A system for guiding a search for information is presented. The system comprises a user interface that accepts a phrase and receives at least one suggestion based at least in part on the phrase. The system also includes a phrase suggestion engine that matches the phrase with the at least one suggestion. Methods of using the system are also provided.06-04-2009
20090012955Method and system for continuous, dynamic, adaptive recommendation based on a continuously evolving personal region of interest - Embodiments of the present invention are directed to flexible, user-adapted, continuous searching, on behalf of a particular user, for points of interest relevant to the user's current location within a specifically computed personal region of interest. In a general case, the personal region of interest is computed as a function of the user's level of disposition towards the searched-for points of interest. The level of disposition towards the searched-for points of interest may, in turn, be based on two or more of the user's location, the current date and time, a history of the user's interaction with the POI-searching system, including user-initiated searches and user selections from displayed search results, a user profile developed for, and continuously updated on behalf of, the user, and a current context for the search, as specified by a search query or by other context-specifying means. The personal region of interest generally defines an abstract area, volume, or hypervolume within which method and system embodiments of the present invention search for points of interest.01-08-2009
20090012951SYSTEM AND METHOD FOR EFFICIENT ISSUANCE OF QUERIES - System and method for efficient issuance of queries, such as DirXML script queries, by a policy for a value of an attribute of an object of the target system are described. In one embodiment, the method comprises, responsive to issuance by a policy of a query for a value of a designated attribute of a designated object of a target system, checking a result cache associated with the target system to determine whether the value for the designated attribute of the designated object is stored therein; responsive to a determination that the value for the designated attribute of the designated object is stored in the result cache, returning the value stored in the result cache to the policy; and responsive to a determination that the value for the designated attribute of the designated object is not stored in the result cache, querying the target system for the value of the designated attribute of the designated object.01-08-2009
20090012950METHOD AND SYSTEM FOR SEARCHING ACROSS INDEPENDENT APPLICATIONS - A method and system are provided for searching across independent applications. A first seedlist (01-08-2009
20090012956Retrieval of Structured Documents - This disclosure relates to performing a query for a search term of a database containing a plurality of structured documents. Those structured documents that do not include the search term are ferreted or filtered out during an initial search. Matched structured documents which are those structured documents that do contain the search term are evaluated by ranking the individual elements based on how well each individual element matches the search term, and indicating to the user the ranking of the individual elements wherein the individual elements can be accessed by the user.01-08-2009
20080319972SHORT PERIOD SEARCH KEYWORD - A search engine provides a registration of short period search terms to provide memorable key words that allow a person to view a specific Web page without having to remember a long and complicated uniform resource locator. A search engine is modified to weight a specific search term to the highest value for a short period of time. A content provider may pay a price to have a memorable search term registered for a period of time. When the search engine receives a search request with the registered search term, the particular page will be placed on top of the search results due to the increased weight.12-25-2008
20090254539User Intention Modeling For Interactive Image Retrieval - A system performs user intention modeling for interactive image retrieval. In one implementation, the system uses a three stage iterative technique to retrieve images from a database without using any image tags or text descriptors. First, the user submits a query image and the system models the user's search intention and configures a customized search to retrieve relevant images. Then, the system extends a user interface for the user to designate visual features across the retrieved images. The designated visual features refine the intention model and reconfigure the search to retrieve images that match the remodeled intention. Third, the system extends another user interface through which the user can give natural feedback about the retrieved images. The three stages can be iterated to quickly assemble a set of images that accurately fulfills the user's search intention. They system can be used for image searching without text tags, can be used for initial text tag generation, or can be used to complement a conventional tagged-image platform.10-08-2009
20080319985STORAGE MEDIUM, DATA EXTRACTION APPARATUS AND METHOD - One or more extraction conditions for designating data to be extracted can be input in a program. When one or mode extraction conditions are input, a data extraction is carried out for each of the extraction conditions and the extracted data is output to an output destination in accordance with the extraction condition that the present data satisfies.12-25-2008
20090006369AUTO-SUMMARY GENERATOR AND FILTER - A system that facilitates data presentation and management is provided. The system includes at least one database to store a corpus of data relating to one or more topics and a summarizer component to automatically determine a subset of the data over the corpus of data relating to at least one of the topic(s), wherein the subset forms a summary of at least one topic.01-01-2009
20080319992SYSTEM AND METHOD FOR PROVIDING PERSONALIZED ONLINE INFORMATION - A method and system for providing personalized and integrated online services for communications and commercial transactions both in private and public venues. The invention provides personalized information that is conveniently accessible through a network of public access stations (or terminals) which are enabled by a personal system access card (e.g., smart card). The invention also provides advertisers the opportunity to directly engage actual and potential user-consumers with selected advertising or marketing content based on each user's profile and usage history.12-25-2008
20090024620Method and Apparatus for Providing Search Result Using Language Chain - Provided is a multiple information retrieval apparatus and method using a language chain to provide directly executable multiple information when a search result regarding a query word is provided. The retrieval apparatus includes: an I/O display window inputting query words; a search button which approves the input query words over the Internet; a search engine database containing information about the query words; a search query word information source having information which sets the query words as a title; a language chain which connects the grouped words with at least one connector; a parsing key for selecting the grouped words, and an image or video display window which can display images or videos using a web-browser. Accordingly, the retrieval apparatus provides multiple domains regarding query words, indices of electronic publishing materials and home page lists.01-22-2009
20090024618SYSTEM AND METHOD FOR INDEXING WEIGHTED-SEQUENCES IN LARGE DATABASES - The present invention provides an index structure for managing weighted-sequences in large databases. A weighted-sequence is defined as a two-dimensional structure in which each element in the sequence is associated with a weight. A series of network events, for instance, is a weighted-sequence because each event is associated with a timestamp. Querying a large sequence database by events' occurrence patterns is a first step towards understanding the temporal causal relationships among the events. The index structure proposed herein enables the efficient retrieval from the database of all subsequences (contiguous and non-contiguous) that match a given query sequence both by events and by weights. The index structure also takes into consideration the nonuniform frequency distribution of events in the sequence data.01-22-2009
20090024617VIDEO PLAYER FOR EXHIBITING CONTENT OF VIDEO SIGNALS WITH CONTENT LINKING TO INFORMATION SOURCES - A method and apparatus for retrieving information relevant to tracked objects appearing in a display of a video signal is disclosed. The method is performed by a viewing computer having stored thereon an augmented display tool. In response to a user requesting the video signal, a content directory storing content information relevant to the tracked objects is acquired from a video-overlay server. The augmented display tool causes the viewing computer to acquire and display the video signal and record a time measurement and spatial coordinates of each point selected by a viewer using a pointing device. The augmented display tool uses the content directory to find an object identifier corresponding to each selected point and extracts relevant information from a global object directory maintained at the video-overlay server.01-22-2009
20090024615System and Method for Creating and Searching Medical Ontologies - A method for creating and searching medical ontologies includes providing a semi-structured information source comprising a plurality of articles linked to each other, each article having one or more sections and each article is associated with a concept, creating a directed unlabeled graph representative of the information source, providing a plurality of labels, labeling a subset of edges, and assigning each unlabeled edge an equal probability of being assigned one of the labels. For each node, the probability of each outgoing edge is updated by smoothing each probability by an overall probability distribution of labels over all outgoing edges of each node, and the probability of each incoming edge is updated the same way. A label with a maximum probability is assigned to an edge if said maximum probability is greater than a predetermined threshold to create a labeled graph.01-22-2009
20090024616CONTENT RETRIEVING DEVICE AND RETRIEVING METHOD - A content retrieving device has: a content storing unit in which are stored a plurality of contents that are associated with one or more character strings; a thesaurus storing unit in which is stored a thesaurus that includes vertical relationship information between character strings; an inputting unit by which a character string is inputted; an extracting unit extracting an associated character string that is associated with an inputted character string, by using the thesaurus and on the basis of association degree information that expresses association degrees between character strings included in the thesaurus by numerical values determined in accordance with the vertical relationship information=between the character strings; and a retrieving unit retrieving contents associated with the associated character string and the inputted character string.01-22-2009
20090024614SYSTEMS AND METHODS FOR ONLINE CONTENT SEARCHING - A search engine that can be configured to combine information related to a web page (channel) or content file view and/or “click throughs” with revenue information in order to determine relevance of the various matched content listed in the search results, is disclosed. By combining revenue information with page view/click through information, potentially more relevant results can be presented to the user for viewing.01-22-2009
20090024612FULL TEXT QUERY AND SEARCH SYSTEMS AND METHODS OF USE - The invention is a method for textual searching of text-based databases including databases of compiled internet content, scientific literature, abstracts for books and articles, newspapers, journals, and the like. Specifically, the algorithm supports searches using full-text or webpage as query and keyword searches allowing multiple entries and an information-content based ranking system (Shannon Information score) that uses p-values to represent the likelihood that a hit is due to random matches. Additionally, users can specify the parameters that determine hits and their ranking with scoring based on phrase matches and sentence similarities.01-22-2009
20090024611System and method for transmitting securities information - A system for transmitting information about securities comprising: a relational database and an analysis processor. The relational database is used to store a relational table that contains a plurality of relations concerning users of securities. When the analysis processor receives information about securities, the analysis processor determines a plurality of users who match the information about securities according to the relational table, and then transmits the information about securities to the plurality of users. When the system is transmitting information about securities, relational tables are established in order to reduce search loops that are required for determining the users, thus speeding up operations of the system and reducing system loads.01-22-2009
20090024608Determining a subset of documents from which a particular document was derived - Embodiments of the present invention pertain to determining a subset of documents from which a particular document was derived. According to one embodiment, similarity measurements indicating similarities between contents of documents are received. A subset of the documents that the particular document was derived from is determined based on dates the documents were created and the similarity measurements without requiring document tracking information to be associated with the documents to determine the subset.01-22-2009
20090024607QUERY SELECTION FOR EFFECTIVELY LEARNING RANKING FUNCTIONS - A learning system for a search ranking function model may include a computer program that iteratively refines the model using new queries and associated documents from an unlabeled training set. The unlabeled training set may include a set of queries for which the associated documents have not been labeled as “relevant” or otherwise labeled. The new queries may be selected based on a similarity to and an accuracy of each neighbor from a labeled training set, such as a labeled validation set. Upon selection, the documents associated with the new queries may be labeled. The new queries and their associated documents may be accumulated into a labeled training set, such as a labeled training set, and a refined model may be learned based on the augmented labeled training set. The model may be iteratively refined until it is determined that the model is adequate.01-22-2009
20090024604DYNAMIC METADATA FILTERING FOR CLASSIFIER PREDICTION - A classifier is used to predict relevant results with arbitrary filtering conditions specified by the user. The classifier model is stored as a database table and joined with a metadata properties table instead of calculating the query result probability using the full classifier model. A user-specified query based filter is applied to the joined tables to obtain the list of documents satisfying the filter. The probability is then computed using the sub-model.01-22-2009
20090024603Method and system for performing search using acronym - Techniques for facilitating efficient local search using acronym are disclosed. According to one aspect of the techniques, a graphic user interface is provided to accept inputs from a user; letters successively entered as the inputs from the user are received; and titles are then progressively reduced and displayed in accordance with the letters, wherein the titles have words each beginning with one of the letters.01-22-2009
20090024605METHOD AND SYSTEM FOR USER AND REFERENCE RANKING IN A DATABASE - A method and system for user and reference ranking in a database or index. Users may provide reference anchors to references and those may be rated by other users. References may be rated by users and the feedback of those ratings may dynamically alter a prior user's score or ranking. The weight of a user's rating thereafter may be influenced by his score or ranking. Users or the system may create categories, which may be cross-referenced, and users may provide ratings of those categories as well. Users may also suggest search fields for categories. Feedback ratings for the search fields and categories may also affect a user's score or ranking.01-22-2009
20090024602Method and apparatus for searching a video library by genre - A graphic user interface is provided to allow a user to search an interested title among a plurality of titles by genre. In one embodiment, a list of types classifying the titles is displayed. A user is allowed to select one or more of the types to narrow down the list so that an interested title can be readily located in form of movie banners. As types are being selected, the list is progressively reduced. When one of the selected types is relaxed, the list is increased. In any case, movie titles in the list fall into a category classified by either one or all of the selected types.01-22-2009
20090240679Selecting Accommodations on a Travel Conveyance - The subject matter of this specification can be embodied in, among other things, a process that includes receiving digital accommodation criteria for an accommodation assignment requested for a passenger and accessing accommodation properties that specify characteristics of accommodations offered on a travel conveyance. A first portion of the accommodation properties are base properties and a second portion of the accommodation properties are derivation properties derived from the base properties during execution of a software program configured to access the accommodation properties. The process also includes assigning weights to the accommodation properties based on a comparison between the accommodation properties and the received accommodation criteria. The process includes determining total weighting scores for each of one or more of the accommodations based on an aggregation of the assigned weights for the accommodation properties associated with the accommodation and outputting the requested accommodation assignment.09-24-2009
20090222439Name-based filters utilized in full-text search engine - Techniques for filtering a full-text search result in a full-text search engine level are described herein. According to one embodiment, a filter is defined via a definition statement a filter using a filter name which identifies a filter object representing an implementation of the filter. In response to a search query received at the ORM system from an application client, where the search query identifying the filter via the filter name, a full-text search engine is invoked to perform a full-text search in a database based on one or more keywords in the search query. The filter object is identified based on the filter name extracted from the search query and the filter object associated with the search query is invoked using the filter name of the filter object to filter a search result generated from the full-text search engine. Other methods and apparatuses are also described.09-03-2009
20090204609Determining Words Related To A Given Set Of Words - In one embodiment, display of a user entry window of a graphical user interface is initiated. Search terms entered into the user entry window to initiate a first search are received. One or more first search results from a corpus of documents are determined according to the search terms. Display of the search terms at a current search terms window of the graphical user interface is initiated. Display of the first search results at a search results window of the graphical user interface is initiated. Display of the first search suggestions at a search suggestion window of the graphical user interface is initiated.08-13-2009
20090083262SYSTEM FOR ENTITY SEARCH AND A METHOD FOR ENTITY SCORING IN A LINKED DOCUMENT DATABASE - A system has a processor coupled to access a document database that indexes keywords and instances of entities having entity types in a plurality of documents. The processor is programmed to receive an input query including one or more keywords and one or more entity types, and search the database for documents having the keywords and entities with the entity types of the input query. The processor is programmed for aggregating a respective score for each of a plurality of entity tuples across the plurality of documents. The aggregated scores are normalized. Each respective normalized score provides a ranking of a respective entity tuple, relative to other entity tuples, as an answer to the input query. The processor has an interface to a storage or display device or network for outputting a list including a subset of the entity tuples having the highest normalized scores among the plurality of entity tuples.03-26-2009
20090083261INFORMATION DISPLAY APPARATUS, INFORMATION DISPLAY METHOD, AND COMPUTER PROGRAM PRODUCT - A keyword expressing a search target and an instance of information associated with the keyword are extracted from a character string included in a web document acquired for the keyword, based on a topic ontology, a relationship between the instances is visualized in a first topic graph expressed by a size of a topic node and a length of a topic link, and a reference relationship between web documents, which are an information source, is visualized in a blog graph expressed by a blog node and a blog link.03-26-2009
20090083260System and Method for Providing Community Network Based Video Searching and Correlation - Systems and methods are described which allow a more accurate determination of relationships among videos in terms of their subject matter, context and social preferences. Rather than relying on user-specified metadata to relate videos, the present embodiments use social affinity to determine related subject matter. The process begins with a user accessing any particular video that has a unique identifier. Once the video is accessed, a list of users is found, who have added the video to their collection. From all these users, a set of all the videos that appear in their collections is compiled. Based on this information, a subset of videos which appear in a significant number of collections, can be deemed to be related to the selected video. This subset of related videos can further be analyzed to verify the metadata of the selected video and to provide suggestions and/or corrections regarding that metadata.03-26-2009
20090083259INFORMATION PROVIDING SYSTEM, INFORMATION PROVIDING METHOD AND INFORMATION PROVIDING RECORD MEDIUM - An information providing system which provides accumulated information items in compliance with requests has an association unit which totals access logs to the information items in each predetermined access unit. The association unit associates the plurality of information items accessed in the predetermined access unit as relevant information items. The information providing system also has an information providing unit which provides a requested information item when any of the plurality of information items associated by the association unit has been requested and which simultaneously provides any other information associated with the requested information or an access portion to the other information.03-26-2009
20090083258Methods and Apparatus for Improved Neighborhood Based Analysis in Ratings Estimation - Systems and techniques for estimation of item ratings for a user. A set of item ratings by multiple users is maintained, and similarity measures for all items are precomputed, as well as values used to generate interpolation weights for ratings neighboring a rating of interest to be estimated. A predetermined number of neighbors are selected for an item whose rating is to be estimated, the neighbors being those with the highest similarity measures. Global effects are removed, and interpolation weights for the neighbors are computed simultaneously. The interpolation weights are used to estimate a rating for the item based on the neighboring ratings, Suitably, ratings are estimated for all items in a predetermined dataset that have not yet been rated by the user, and recommendations are made of the user by selecting a predetermined number of items in the dataset having the highest estimated ratings.03-26-2009
20090049038Location Based News and Search Engine - A search engine and/or news aggregator considers temporal qualities of electronic documents, a location of a searcher/reviewer, and a situs associated with content of the document to determine how/if they should be presented to users.02-19-2009
20090083247AUTOMATICALLY MAKING CHANGES IN A DOCUMENT IN A CONTENT MANAGEMENT SYSTEM BASED ON A CHANGE BY A USER TO OTHER CONTENT IN THE DOCUMENT - A content management system provides a way to detect a change to one part of a document, and to generate a corresponding change in a different part of the same document. Dynamic inclusion rules define conditions that, when satisfied, allow automatically changing a link in a document to a new link when corresponding data in the document is added or changed. If a change corresponds to a defined dynamic inclusion rule, a corresponding query in the rule is evaluated according to the changes in the document. When there is enough information to run the query, the query is automatically executed in a background process. If there is a single link that satisfies the query, the document may be updated with the new link. If multiple links satisfy the query, the top ranked query result may be automatically selected, or the user may select which link should be included in the document.03-26-2009
20090083253Efficient Evaluation of Hierarchical Cubes By Non-Blocking Rollups and Skipping Levels - Techniques are described herein for efficiently evaluating database queries that include hierarchical cube computations. During second and subsequent evaluation phases (if any), a database server does not re-determine groups (nor re-aggregate within such groups) that have already been determined in a previous evaluation phase. Instead, according to a technique described herein, whenever an evaluation phase subsequent to the first evaluation phase is performed, the database server immediately outputs or otherwise returns certain groups and aggregate results that were determined based on certain grouping column sets that were generated in the previous evaluation phase. The database server does not aggregate within these certain groups when performing aggregation in the current evaluation phase, thereby avoiding the duplication of work already performed during previous evaluation phases.03-26-2009
20090198668APPARATUS AND METHOD FOR DISPLAYING DOCUMENTS RELEVANT TO THE CONTENT OF A WEBSITE - A computer readable storage medium includes executable instructions to identify a user of a website, retrieve one or more keywords describing content on the website, and search for reports corresponding to the one or more keywords. The reports are filtered based on data access permissions associated with the user. A highly ranked report is displayed on the website.08-06-2009
20090198667Generating Search Result Summaries - Embodiments are configured to provide a summary of information associated with one or more search results. In an embodiment, a system includes a summary generator that can be configured to provide a summary of information including one or more snippets associated with a search term or search terms. The system includes a ranking component that can be used to rank snippets and the ranked snippets can be used when generating a summary that includes one or more ranked snippets. In one embodiment, the system can be configured to include one or more filters that can be used to filter snippets and the filtered snippets can be used when generating a summary. Other embodiments are available.08-06-2009
20090198679SYSTEMS, METHODS AND SOFTWARE FOR EVALUATING USER QUERIES - The present inventor devised, among other things, an information retrieval system that determines whether a query is ambiguous or not and based on this determination either continues or aborts a search process. One query evaluation extracts word pairs from an input query and uses features of the extracted word pairs, for example the number of word pairs and their frequencies within a document collection, to determine if the query is ambiguous or not. Another evaluation measures topical convergence, using query related caselaw headnotes that are associated with topics in a legal taxonomy. And yet another checks topical convergence through the lens of full caselaw documents and secondary legal documents, such as law review articles, specifically the minimum number of case law and secondary legal documents that are necessary to span a set of top ranked topics identified using the query.08-06-2009
20090049036Systems and methods for keyword selection in a web-based social network - A system and method for selecting a subset of keywords from a set of master keywords found in user profiles in a social network is disclosed. The method includes selecting a first and second group of user profiles including one or more keywords and computing the number of occurrences of each of the master keywords in the first and second group of profiles. A value may be computed for each of the master keywords based on a comparison of the number of occurrences in the first group of profiles and the number of occurrences in the second group of profiles. The computed value may be used for selecting the subset of keywords from the master keywords and/or ranking the master keywords.02-19-2009
20090049035System and method for indexing type-annotated web documents - Methods and apparatus generate an index for use in a document retrieval system where the index is organized by type and keyword. Redundancy in the index is reduced by organizing type entries in a hierarchy of internal and leaf nodes. Determining whether to generate an inverted list for a type is based on the position of the type in the hierarchy; generally inverted lists are generated only for types corresponding to leaf nodes. Redundancy is further reduced by re-using inverted lists generated for keywords for types when there is an overlap between keywords and types. Search performance using the document retrieval index is improved by adding entries corresponding to combinations of keywords and types. The intersections of inverted lists associated with the keywords and types comprising the combinations are determined and added to the index for use in search operations. Determining whether to add an entry for a keyword-type combination is made on a cost-benefit analysis dependent, at least in part, on the proximity of the keyword to type in documents containing the combination.02-19-2009
20090049032METHOD AND SYSTEM FOR INTENT QUERIES AND RESULTS - A search engine compares entered search terms to an index of terms signifying a specific or local intent. If an entered term matches term in the index, then the search engine identifies and outputs information corresponding to the specific or local intent. Terms to include in the index of terms can be identified by monitoring the searching behavior of a set of users.02-19-2009
20090049031Method And System For Database Searching - A method of searching a second database comprising A) receiving a summary document generated by the first database, the summary document comprising a list of returned first database subject keys, representing the returned first database subjects, the list further including at least one identifier associated with the returned first database subjects; B) reading the summary document and generating one or more second database query options for searching for second database subjects that have relationships to the returned first database subjects corresponding to the at least one identifier; C) receiving a second database query in accordance with said one or more second database query options; D) receiving said returned first database subjects; E) using said returned first database subjects, searching said second database in accordance with said second database query options.02-19-2009
20090055375Bundle Generation - First topics related to a content page, such as a web page, are identified. Thereafter, second topics related to a first content element, such as advertisements, and a second content element, such as media files, are identified based on the first topics. Common topics are identified that are common to the first and second topics. Based on the common topics, first and second content elements are identified and combined in a bundle that is transmitted to a user requesting the content page.02-26-2009
20090055385Media-Based Recommendations - A computer-implemented method includes receiving information expressing a user's interest in one or more media programs, obtaining information indicative of popularity for a plurality of media programs responsive to the received information by individuals other than the user, and transmitting one or more recommendations of media programs for display to the user, from the plurality of media programs that relate to the received information.02-26-2009
20080301131MODIFICATION OF A SAVED DATABASE QUERY BASED ON A CHANGE IN THE MEANING OF A QUERY VALUE OVER TIME - An apparatus and method modify a saved query based on a change in a query value meaning that changes over time. In preferred embodiments a graphical query interface displays an option to adjust query values of a saved database query. A query adjustment mechanism then adjusts the value of the query to compensate for the change in the meaning of the query value since the query was created such that the adjusted query will have the same basic meaning as when the query was originally created. Preferred Embodiments allow the user to specify to adjust the query to the current date or to a specified date in the past.12-04-2008
20090193009VIEWING TIME OF SEARCH RESULT CONTENT FOR RELEVANCY - Amounts of time that search result content is displayed for viewing can be collected and used for relevancy ranking. Selection of a first of a plurality of search results is detected. The plurality of search results is received in response to submission of a set of one or more search terms. An amount of time content of the first search result is displayed for viewing is determined. The content is loaded in response to the selection of the first search result. An indication of the amount of time is supplied as input for ranking relevancy of the first search result with respect to the set of one or more search terms.07-30-2009
20090063478System for Compiling Word Usage Frequencies - A system for assisting a user who is learning a language to prioritize words to be learned in order of usage frequency is disclosed. A frequency determination program running on a computer determines the frequency of usage of each word at a list of locations provided by the user. Different algorithms to identify what constitutes a word are employed depending upon the language of the source data. The total number of words at each location and their usage frequency found during the user session, along with a total number of words and their usage frequency for all user sessions performed regardless of location, are calculated and made available to the user. The user can view usage frequencies for words from a single location, a group of locations, or all user sessions performed.03-05-2009
20090063472EMPHASIZING SEARCH RESULTS ACCORDING TO CONCEPTUAL MEANING - Computer-readable media, computerized methods, and computer systems for conducting semantic processes to present search results that include highlighted regions which are relevant to a conceptual meaning of a query are provided. Initially, content of document(s) is accessed and semantic representations are derived by distilling linguistic representations from the content. These semantic representations may be stored at a semantic index. Also, a proposition is derived from the query by parsing search terms of the query, and distilling the proposition from the search terms. Typically, the proposition is a logical representation of the conceptual meaning of the query. The proposition is compared against the semantic representations at the semantic index to identify a matching set. Regions of the content within the document, from which the matching set of semantic representations are derived, are targeted. Accordingly, highlighting may be applied to the targeted regions when presenting or displaying the search results.03-05-2009
20090063449Integrating Sponsored Media with User-Generated Content - A variety of computer based service that permit users to edit, compose, upload, or otherwise generate content also provide for the integration of sponsored media into presentations along with user-generated content. An exemplary service generates text based on user input, provides tags based on the text to a sponsored media repository, receives a sponsored media data structure in return, and formats sponsored media from the data structure for display to the user.03-05-2009
20090063475Tool for personalized search - The invention provides for customized display of search results to Users. The invention further provides for customization of associated advertisements. The invention further provides for a dynamic personal knowledge base that is kept private. The invention provides an Intelligent Web Proxy that re-sorts search results based on the contents of the personal knowledge base (PKB) and creates a display customized to User preferences. The invention also automatically tracks and updates User activity, and “learns” User preferences.03-05-2009
20090063474System and Method for Information Retrieval - A method of performing a search of an online directory, comprising receiving an identifier entered as part of website address; extracting the identifier entered as part of the website address; searching one or more databases associated with the online directory for instances of the identifier; and displaying information extracted from the one or more databases if a match was found for the identifier.03-05-2009
20090063473INDEXING ROLE HIERARCHIES FOR WORDS IN A SEARCH INDEX - Methods, systems and computer readable media for finding documents in a data store that match a natural language query submitted by a user are provided. The documents and queries are matched by determining that words within the query have the same relationship to each other as the same words in the document. Documents are semantically analyzed and words in the document are indexed along with the role the word plays in a sentence. The initial semantic role may be generalized using a role hierarchy and stored in the index along with the original role. A similar analysis may be used with the search query to find words used in the same role in both the query and the document.03-05-2009
20090063468SYSTEM AND METHOD FOR CAREER WEBSITE OPTIMIZATION - A method of managing career opportunities is provided. In some embodiments, a method for providing career and job listing websites and optimizing the career and job listing websites for search engine optimization is provided. In various embodiments, the method comprises creating a career website that mirrors a company's website, extracting job information from the company, and optimizing the job information for maximum search engine placement.03-05-2009
20090063463Ranking of User-Generated Game Play Advice - Management of user-generated game play advice is disclosed. The present invention allows for management of game play advice that is complete and up-to-date regardless of when a particular interactive gaming title is released. Game play advice is pervasive and easily accessible to game players in addition to being accurate and credible such that game players can trust or rely upon the rendered advice.03-05-2009
20090063461USER QUERY MINING FOR ADVERTISING MATCHING - Systems and methods to determine relevant keywords from a user's search query sessions are disclosed. The described method includes identifying search session logs of a user, segmenting the search session logs into one or more search sessions. After the segmentation, the search sessions are analyzed to compose a list of semantically relevant keyword sets including at least a first keyword set and a second keyword set. The described method further includes determining a semantic relevance between the first and second keyword sets according to the frequency at which the first and second keyword sets are reported in the query results and displaying one or more semantically high relevant keyword sets after being filtered by a threshold.03-05-2009
20090063460PRESENTING RESULT ITEMS BASED UPON USER BEHAVIOR - Methods, systems, and computer storage media having computer-executable instructions embodied thereon that, when executed, perform methods for identifying and presenting the “best” answer to a given search query as it relates to a particular user based upon that user's behavior are provided. Upon receipt of a search query and determination of the search result items satisfying the query, it is determined whether the user has executed the same or a substantially similar search in the past and, if so, if there is a particular one of the search result items that s/he has a tendency to select when the search result items are presented. If a particular result is frequently selected, that result is prominently presented (e.g., highlighted, displayed with a border, displayed in a different font than other results, or the like) among the search result items making it easier for the user to quickly identify the desired result.03-05-2009
20090063458 METHOD AND SYSTEM FOR MINIMIZING SORTING - A method for minimizing the sorting of data comprises retrieving a database having an index of entries arranged according to a first, second, and third data entries. Additionally, partitioning the index of entries into a first partially-ordered list, wherein the first partially-ordered list comprises information arranged in the form of the first, second, and third data entries. The first partially-ordered list share the same first data entry. Furthermore, partitioning the index of entries into a second partially-ordered list, wherein the second partially-ordered list comprises information arranged in the form of first data entry, second data entry, third data entry. The first data entry within the second partially-ordered list is not the same as the first data entry in the first partially-ordered list. Additionally, querying the first partially-ordered list without querying the second partially-ordered list according to a set of query instructions.03-05-2009
20090063455Bipartite Graph Reinforcement Modeling to Annotate Web Images - Systems and methods for bipartite graph reinforcement modeling to annotate web images are described. In one aspect the systems and methods implement bipartite graph reinforcement modeling operations to identify a set of annotations that are relevant to a Web image. The systems and methods annotate the Web image with the identified annotations. The systems and methods then index the annotated Web image. Responsive to receiving an image search query from a user, wherein the image search query comprises information relevant to at least a subset of the identified annotations, the image search engine service presents the annotated Web image to the user.03-05-2009
20090063451SEARCH ENGINE USING WORLD MAP WITH WHOIS DATABASE SEARCH RESTRICTION - In most of the Internet search operations, unwanted search results can be eliminated to reduce the high volume of the Internet traffic, and make the search operation highly efficient, according to the present invention. The present invention proposes a two step approach. The first step is to achieve the high relevance of the search results by search region restricted search operation. The second step, further adds high degree of relevance to the search results by the contact address correlation with a reliable reference address or the legitimate contact address eliminating the crap and squatter sites from the search result list. The region restricted search does searching in a selected geographical region. Thus the region restricted search operation minimizes the search time and huge volume of Internet traffic, which is likely to impair the overall Internet performance.03-05-2009
20090063448Aggregated Search Results for Local and Remote Services - A search system may include searches performed on remotely hosted services that may be indexed and queried by an aggregated search tool. The search tool may aggregate desktop searches and internet searches with searches of remotely hosted services into a single set of results. Remotely hosted services may include databases and other services that are hosted over the Internet but may be privately available to a user. Examples of remotely hosted services may include shared directories, customer resource management systems, project management tools, accounting systems, and other remote services. In some embodiments, a search index created from the remote service may be stored locally or on a server.03-05-2009
20090063454VORTEX SEARCHING - Determining intersection points of parameter patterns. Parameter patterns are specified in a query. A method includes identifying a first parameter pattern from the query as occurring less often in the index than one other parameter pattern in the query. The data store is searched until a present location of the data store has been identified as including the first parameter pattern. Then the data store is searched for a location of another parameter pattern. If the present location is identified as including the another parameter pattern, then an indication is provided identifying an intersection. Otherwise, the method includes continuing searching remaining portions of the data store to find a location of the another parameter pattern at a new present location. At least one of the acts of searching above includes eliminating at least a portion of records of the data store from searching without being searched prior to being eliminated.03-05-2009
20090049041Ranking content items related to an event - Ranking content items is disclosed. A user input is received from each of one or more users indicating an opinion of the user with respect to a content item included in a plurality of content items. Based at least in part on a number of users from whom user input has been received, a degree is determined to which a ranking of the content item relative to one or more other content items in the plurality of content items is determined by user input.02-19-2009
20090055383DYNAMIC MEDIA INTERACTION USING TIME-BASED METADATA - Systems and methods are provided for linking time-based metadata to media content so that as the metadata changes in synchronicity with media content during play, information associated with the media content can be outreached in the context of the media presentation. More particularly, according to one embodiment of the present invention, a media player device is provided that renders media content and retrieves and displays appropriate metadata information associated with the media content at an appropriate time to an ancillary metadata viewer device during play of a media resource.02-26-2009
20090055373System and method for refining search terms - A system and method for refining a string of search term used in a search of an electronic data base, by extracting phrases from a first result set of the search, selecting relevant phrases from among the extracted phrases, and adding the selected phrases to the original search term.02-26-2009
20090055388Method and system for selecting search engines for accessing information - A method and system for access to information using search engines is provided. A search engine is selected for executing a query based on search engine characteristic information and the query. The characteristic information for each search engine includes information representing searching capabilities of each search engine. Selecting a search engine further involves determining a similarity between the query and the characteristic information for each search engine, and selecting a search engine based on the similarities such that a search engine with the highest similarity may be selected for executing the query and returning search results.02-26-2009
20090055384SHARED INFLUENCE SEARCH - In one embodiment, a search query is received from a user. Then a designated expert for the search query is determined. Search results based at least in part upon previous actions taken by the expert relevant to the search query are then identified. These results may then be returned to the user.02-26-2009
20090055382Automatic Peer Group Formation for Benchmarking - A method of automatically generating peer groups of entities includes receiving data for a plurality of characteristic parameters about a number of entities and defining a number of peer groups, k, to be generated. A minimum number of entities, m, to be assigned to each peer group is defined, and k initial cluster values are defined around which to group the entities according to the data for the entity's characteristic parameters. Each entity is assigned to a peer group associated with a particular initial cluster center value, and it is ensured that the number of entities assigned to each peer group is greater than the minimum number, m.02-26-2009
20090055378SYSTEMS AND METHODS FOR PROVIDING IMPROVED ACCESS TO PHAMACOVIGILANCE DATA - A system and method for browsing a pharmacovigilance database with a graphical representation that shows relationships between medical terms may include providing access to a plurality of medical terminologies and mapping medical terms of the plurality of terminologies to a searchable database by using a semantic network to relate the medical terms of the different terminologies. The system and method may further include providing a graphical user interface that enables graphical navigation of the plurality of terminologies, enables display of a mapping between a first medical term from a first medical terminology to a second medical term from a second medical terminology, and enables coding of pharmacovigilance reports using medical terms of the second terminology based on a description provided using medical terms of the first terminology.02-26-2009
20090055379Systems and Methods for Locating Contact Information - Systems and methods for locating contact information are shown and described. The method can include receiving an instruction to locate a portion of contact information stored in the one or more directories of the computing device and initiating a plurality of directory handler routines at the computing device. Each directory handler routine can be associated with a specific one of the directories of the computing device. The method also includes traversing one or more of the directories resident at the computing device with the associated directory handler routine and locating the contact information stored in the one or more directories. Also, the method includes aggregating the located contact information and displaying the aggregated contact information at the computing device.02-26-2009
20090055376SYSTEM AND METHOD FOR IDENTIFYING SIMILAR MEDIA OBJECTS - The systems and methods described create a mathematical representation of each of the media objects for which user ratings are known. The mathematical representations take into account the subjective rating value assigned by a user to the respective media object and the user that assigned the rating value. The media object with the mathematical representation closest to that of the seed media object is then selected as the most similar media object to the seed media object. In an embodiment, the mathematical representation is a vector representation in which each user is a different dimension and each user's rating value is the magnitude of the vector in that dimension. Similarity between two songs is determined by identifying the closest vectors to that of the seed song. Closeness may be determined by subtracting or by calculating the dot product of each of the vectors with that of the seed media object.02-26-2009
20100036833SYSTEM AND METHOD FOR TYPE-AHEAD ADDRESS LOOKUP EMPLOYING HISTORICALLY WEIGHTED ADDRESS PLACEMENT - The subject application is directed to a system and method for type-ahead address lookup employing historically weighted address placement. A prompt is generated on a display for commencement of a new search operation and search data of text entries is received via a user interface. Entries are stored in an associated database, each entry having at least one searchable text field. At least a first character of a new search received via the user interface is tested against the entries relative to the searchable field. A display is generated corresponding to a subset of the entries based upon a testing output. Selection data is received corresponding to a selected entry from the displayed subset and weighting data is generated corresponding to received selection data. Displayed entries are ordered corresponding to the subset of database entries upon subsequent re-entry of the at least a first character during a subsequent search operation.02-11-2010
20090100035Generating a User-Specific Search Index of Content Within a Virtual Environment - Embodiments of the invention provide techniques for searching for virtual objects of an immersive virtual environment based on user interactions within the virtual environment. Generally, embodiments provide an attribute index storing data describing attributes of virtual objects, and an interaction index storing data describing user interactions with virtual objects. Search queries may be evaluated using both the attribute index and interactions index. Thus, virtual objects may be searched in terms of object attributes as well as user interactions with the virtual objects.04-16-2009
20090083254Method and system for providing improved answers - Disclosed is a method and system for ranking answers supplied by user authors in an online database. A first author enters a first answer under a question. The answer is ranked #03-26-2009
20090083252WEB-BASED COMPETITIONS USING DYNAMIC PREFERENCE BALLOTS - In one example, a method for ranking items such as contest entries is provided. An exemplary method includes displaying sequential subsets of entries from a plurality of entries for a first user to vote on, e.g., making a selection of their preference of one over the other. The method further includes generating a first preference ballot of displayed entries based on selections by the first user, and ranking the plurality of entries based upon the first preference ballot and at least a second preference ballot received from another user. The ranking may be determined based on the first and second preference ballot by a Condorcet algorithm. Additionally, display of the entries may be determined based on previous selections associated with the contest entries, e.g., based on the state of the contest and/or the history of particular contest entries.03-26-2009
20090083249METHOD FOR INTELLIGENT CONSUMER EARCONS - A method for utilizing earcons, includes: forming a database of earcons; forming a user profile and preferences database; monitoring user audio content; monitoring the user environment; playing a series of earcons from the database of earcons on a user's communication device; wherein the series of earcons are chosen from the database of earcons based on the user profile and preferences database; and wherein the playing of individual earcons from the series of earcons is based on the monitored user audio content and environment.03-26-2009
20090055392ORDERING OF SEARCH RESULTS BASED ON LANGUAGE AND/OR COUNTRY OF THE SEARCH RESULTS - A system and method for providing preferred language and/or country ordering of search results is described. A search query describing potentially retrievable information provided in a plurality of search result languages and/or countries is received. A search is executed by evaluating the search query against information characteristics maintained in a searchable data repository. At least one preferred language and/or country applicable to search results generated is dynamically determined responsive to the executed search. At least some of the search results are ordered in consideration of the at least one preferred language and/or country.02-26-2009
20100023509PROTECTING INFORMATION IN SEARCH QUERIES - A method of protecting information in search queries uses a search apparatus with a user interface that is configured for connection to a computer network that comprises a plurality of search engines on a plurality of servers. The method includes receiving a search query comprising a plurality of keywords; dividing the search query into a number of sub-queries, each sub-query comprising at least one of the keywords; and submitting the sub-queries to different search engines01-28-2010
20090070310ONLINE ADVERTISING RELEVANCE VERIFICATION - Online relevance verification is performed to provide relevant advertisements to search queries received at a search engine. Relevance of an advertisement for a received search query is determined by comparing the content of a landing page associated with the advertisement against search results for the search query. Relevance may then be used to filter irrelevant advertisements from consideration and/or may be used in ranking advertisements during an auction process in conjunction with monetization factors. Selected advertisements may then be returned in response to the search query.03-12-2009
20090204596SEMANTIC COMPATIBILITY CHECKING FOR AUTOMATIC CORRECTION AND DISCOVERY OF NAMED ENTITIES - A computer implemented system and method for processing text are disclosed. Partially processed text, in which named entities have been extracted by a standard named entity system, is processed to identify attributive relations between a named entity or proper noun and a corresponding attribute. A concept for the attribute is identified and, in the case of a named entity, compared with the named entity's context, enabling a confirmation or conflict between the two to be determined. In the case of a proper name, the attribute's context can be associated with the proper name, allowing the proper name to be recognized as a new named entity.08-13-2009
20090204607DOCUMENT MANAGEMENT METHOD, DOCUMENT MANAGEMENT APPARATUS, INFORMATION PROCESSING APPARATUS, AND DOCUMENT MANAGEMENT SYSTEM - When registering a document, pieces of index candidate information to be assigned to the document to be registered are created and output based on user characteristic information acquired from login information of a user and document information acquired from the document to be registered. Index information selected by the user from the pieces of output index candidate information is received. The received index information is registered in association with the document acquired by a document information acquiring unit. When browsing a document, the user characteristic information acquired from the login information of the user is compared with user characteristic information set for the registered document in association with the index information. A document having user characteristic information whose predetermined items match items of the set user characteristic information is extracted as a document associated with the user.08-13-2009
20090100045DEVICE AND METHOD FOR ADAPTIVE SERVICE SELECTION, QUERY SYSTEM AND METHOD - The present invention relates to a device for adaptive service selection comprising a semantic analyzing means which analyzes a query from a user semantically, an adaptive service selecting means which generates a new service mapping rule so as to obtain a selected service, when the semantically-analyzed query does not match with a rule in a service mapping rule base, and a retrieving means which retrieves and obtains an answer according to the selected service. The present invention also relates to a method for adaptive service selection, a system and method for adaptive service selection as well as a query system and method thereof. With the system and method of the present invention, a new service mapping rule can be generated and added automatically when a user query is not included in a service mapping rule base. It is thus possible to improve the accuracy of natural language based service selection and provide the user with a selected service as well as the corresponding query answer.04-16-2009
20090100040LAWFUL INTERCEPTION OF BROADBAND DATA TRAFFIC - Methods, systems, and computer-readable media provide for lawfully intercepting broadband data traffic. According to one method, a request to retrieve a network address associated with a login identifier is received. An Authentication, Authorization and Accounting (AAA) server is queried based on the login identifier to retrieve the network address associated with the login identifier. Relevant data traffic and AAA information associated with the relevant data traffic is filtered at a network element. The relevant data traffic and the AAA information is forwarded to a law enforcement agency (LEA) system.04-16-2009
20090100044Action management system and action management method - A system and a method of managing the action on an electronic document and a paper document are disclosed. The action management system includes a procedure definition information data base for holding the procedure definition specifying the condition for a string of actions constituting the procedure, a procedure matching part for extracting the string of actions coincident with the condition for the procedure definition from the actions managed by the action management system, and a hypothesis information data base for holding the procedure obtained by the procedure matching part and the hypothesis of the action constituting the procedure. The action management system extracts a typical action and a candidate for a non-typical action related to the typical action.04-16-2009
20090100043System And Method For Providing Orientation Into Digital Information - A system and method for providing orientation into digital information is provided. A plurality of evergreen indexes for subject areas are maintained. The evergreen indexes include digital information and are each organized by topics that include a topic model matched to the digital information. A user interest within the digital information is determined. The topic models for the evergreen indexes are evaluated against the user interest and those topics models that best match the user interest are identified. Access to the digital information is provided via at least one of the topic models in at least one of the evergreen indexes.04-16-2009
20090100042SYSTEM AND METHOD FOR ENHANCING SEARCH RELEVANCY USING SEMANTIC KEYS - A method, computer-usable medium, and a computer system for searching for webpages are disclosed. Embodiments of the present invention provide a convenient and efficient mechanism for filtering results from a keyword search using semantic keys and semantic sub-keys, thereby enabling an increased number of irrelevant results to be filtered from a keyword search. The search query may be parsed to determine the focus of the query, where the focus may be used determine at least one semantic key for the search query. Each semantic key may be associated with at least one semantic sub-key, where the semantic keys and/or the semantic sub-keys may be used to filter the results of the keyword search. As such, broader keyword searches may be performed to include a larger number of relevant results, where the filtering mechanisms of the present invention may then filter an increased number of irrelevant results.04-16-2009
20090100039Extensible mechanism for grouping search results - Systems, methods, and other embodiments associated with grouping automated search results are described. One embodiment includes a computer-readable medium storing computer-executable instructions operable to perform a method that includes identifying items to group. The method also includes selectively grouping a first item and a second item upon determining that a comparison of a metadata attributes indicates that the first item and the second item are to be treated as members of a group.04-16-2009
20080263031METHOD AND APPARATUS FOR CREATING SEARCHES IN PEER-TO-PEER NETWORKS - One embodiment of the present method and apparatus for creating searches in peer-to-peer networks includes forming clusters comprising data from a user's media library and formulating at least one search request message in accordance with the clusters. Formation of the clusters may be guided at least in part by data attributes that the user indicates are important. In this way, the user's media library may be “mined” for information that will aid in creating searches for data that the user may be interested in, but may not necessarily know how to search for or may not necessarily know exists.10-23-2008
20080263030METHOD AND APPARATUS FOR MANAGING PEER-TO-PEER SEARCH RESULTS - One embodiment of the present method and apparatus for processing a search request message received over a network includes computing a threshold value in accordance with the search request message and returning at least one search result to a user in response to the search request message, if a rank of the at least one search result at least meets the threshold value.10-23-2008
20080263029ADAPTIVE ARCHIVE DATA MANAGEMENT - In one embodiment, input is received from a user defining a classification and an analytic for the classification. Multiple classifications and analytics may be defined by a user. A definition of relevance parameters is determined that characterize the classification and a set of analytics measures associated with the analytic. The definition may be for the classification. Unstructured data and structured data are analyzed based on the definition of the relevance parameters to determine relevant data in the unstructured data and the structured data. The relevant data being data that is determined to be relevant to the classification defined by the user. An index of the terms from the relevant data is determined. The index is useable by an analytics tool to provide results for queries of the unstructured data and structured data. The query may be used within the classification such that targeted results are provided using the index and the relevant data to the classification. Thus, queries from different classifications may be performed efficiently using data determined to be relevant to the classification.10-23-2008
20080263028Report Search Method, Report Search System, and Reviewing Apparatus - An object of the present invention is to provide a report search method, and a reviewing apparatus by which a measure against abnormalities such as a defect of a sample may be quickly obtained by searching desired information in a report recording past information.10-23-2008
20080263027PORTABLE DATA STORAGE APPARATUS AND METHOD OF ALLOWING USER TO SELECT DIGITAL DATA USING THE PORTABLE DATA STORAGE APPARATUS - A data storage apparatus includes a data storage unit which stores digital data; a metadata storage unit which stores metadata regarding a plurality of categories for a plurality of pieces of digital data including the digital data stored in the data storage unit; a search criterion input unit which receives at least one category which is a search criterion, from among the plurality of categories; a list sorting unit which sorts a list of the plurality of pieces of digital data on the basis of item values of the at least one search category; a list display unit which displays the list of digital data; a selection input unit which receives a selected one piece of digital data selected by a user, from among the list of digital data. Accordingly, the user can easily and quickly search for his or her desired data.10-23-2008
20080263026Techniques for detecting duplicate web pages - Techniques are disclosed for detecting web pages with duplicate content. In one embodiment, a set of shingles is computed for each page of a group of pages. An aggregate set of shingles is determined based on the sets of shingles computed for the group of pages. A first subset from the aggregate set of shingles is determined by selecting, from the aggregate set, shingles whose frequencies in the aggregate set exceed a specified threshold. A modified set of shingles is generated for each page of the group of pages by removing, from the set of shingles for that page, any shingle included in the first subset. One or more duplicate pages in the group of pages are determined based at least in part on the modified sets of shingles generated for the group of pages.10-23-2008
20080263025USE OF NATURAL SEARCH CLICK EVENTS TO AUGMENT ONLINE CAMPAIGNS - A method is described for augmenting sponsored search results in a search engine, which includes extracting attribute data from a plurality of natural searches for a search term linked to a plurality of uniform resource locators (URLs), analyzing the attribute data of one or more attributes for clickers and non-clickers to determine at least one greatest distinguishing factor between the clickers and non-clickers, and integrating the at least one greatest distinguishing factor into a matching algorithm used by the search engine to rank order and display a plurality of the most relevant ads corresponding to the plurality of URLs in response to a search for the term. The method may also integrate the at least one greatest distinguishing factor into a marketer algorithm to enable a marketer of a URL to strategically choose a search term, along with the at least one greatest distinguishing factor, on which to bid.10-23-2008
20080263024ELECTRONIC DEVICE WITH A RANKING OF APPLICATIONS BASED ON LOCATION AND METHOD OF USING THE SAME - The present invention provides for a portable electronics device and a method of using the same. In one embodiment, the portable electronics device is comprised of (1) a position locator for determining an approximate location of the device; (2) an application correlator for correlating the approximate location with an application; and (3) a ranking generator for ranking the application relative to the approximate location.10-23-2008
20080263023INDEXING AND SEARCH QUERY PROCESSING - A method for processing a search query according to one embodiment includes receiving a search query containing terms; looking up at least some of the terms in a search index for identifying sections of documents containing the at least some of the terms; generating a content score for each of the documents based at least in part on a number of keywords found in the sections of each document; looking up at least some of the terms in the search index for attempting to match one or more of the terms to context information in the search index, the context information being associated with at least one of the documents; generating a context score based at least in part on the matching of terms to the context information; generating a document score for each of the documents based at least in part on the content score and the context score; and outputting an indicator of at least one of the documents, or portion thereof, for the at least one of the documents having a higher document score relative to other of the documents.10-23-2008
20080263021Methods of object search and recognition - The proposed technical solution allows processing of machine-readable forms of unfixed format. An auxiliary brief description may be optionally specified to determine the spatial orientation of the image. A method of searching for elements of a document comprises the following main operations in addition to the operations of preliminary image processing: selecting the varieties of structural description from several available variants, determining the orientation of the image, selecting the text objects, where the text must be recognized, and determining the minimal required volume of recognition, recognizing the text objects, searching for elements of the form. Searching for elements of the form comprises the following actions: selecting a searched element in the structural description, gaining the algorithm of search constraints from the structural description, searching for the element, testing the obtained variants.10-23-2008
20080263020Content providing system, content providing apparatus and method, content distribution server, and content receiving terminal - A content providing system includes a content distribution server and a content receiving terminal connected to each other through a communication channel. The content receiving terminal includes an operation input unit specifying one point on a line, a time information output unit outputting the specified point as time information, a request sender sending a content providing request including the output time information to the content distribution server, and a provider providing at least one content item to a user. The content distribution server includes a content storage unit in which a plurality of content items are stored in association with at least the corresponding time information, a search unit searching the content storage unit for at least one content item according to a search condition based on the time information, and a content distributor distributing at least one content item to the content receiving terminal.10-23-2008
20090204604METHOD AND DEVICE FOR ONLINE DYNAMIC SEMANTIC VIDEO COMPRESSION AND VIDEO INDEXING - A technique for semantic video compression is shown in block (08-13-2009
20090198688METHODS AND SYSTEMS FOR DYNAMICALLY REARRANGING SEARCH RESULTS INTO HIERARCHICALLY ORGANIZED CONCEPT CLUSTERS - Methods of and systems for dynamically rearranging search results into hierarchically organized concept clusters are provided. A method of searching for and presenting content items as an arrangement of conceptual clusters to facilitate further search and navigation on a display-constrained device includes providing a set of content items and receiving incremental input to incrementally identify search terms for content items. Content items are selected and grouped into sets based on how the incremental input matches various metadata associated with the content items. The selected content items are grouped into explicit conceptual clusters and user-implied conceptual clusters based on metadata in common to the selected content items. The clustered content items are presented according to the conceptual clusters into which they are grouped.08-06-2009
20090198682METHOD AND SYSTEM FOR RESTRICTING ACCESS RIGHTS ON USER PROFILE INFORMATION USING A NEW NOTION OF PEER - The present invention relates to the field of Network portals and in particular to a method and system for restricting access rights on user profile information using a new notion of peer groups, wherein a given user's peer group is defined to be the set of users containing all the members of all the user's communities, wherein the individual communities are defined within the web portal wherein on said web portal a plurality of composite applications are implemented, wherein each composite application (08-06-2009
20090198680DOCUMENT MANAGEMENT METHOD, DOCUMENT MANAGEMENT APPARATUS, AND DOCUMENT MANAGEMENT SYSTEM - User characteristic information acquired from the login information of a user is stored as attribute information associated with the document information of a registered document. In accordance with login of the user, a document associated with the acquired user characteristic information is acquired based on the acquired user characteristic information and the stored user characteristic information. A display content to display pieces of information for identifying the acquired document is created. As the attribute information of the document, a weight representing the relevance between the document and each of a plurality of items included in the user characteristic information is stored in association with each other. A display content to classify, based on the weight of each item, the pieces of information for identifying the acquired document and display the information is created.08-06-2009
20090198675METHODS AND SYSTEMS FOR USING COMMUNITY DEFINED FACETS OR FACET VALUES IN COMPUTER NETWORKS - A database search method and system utilize user community defined facets and facet values for refining searches. The system provides access to a database having a plurality of records in respective categories of information. Each record has one or more facets to the respective category of information. The system enables user input of a search term formed of a first parameter indicative of at least one category of information of the database. In response to the user input, the invention system displays (a) a set of search results, including records from the database of the at least one category of information, and (b) a listing of facets and/or facet values of the records in the search results. The listing of facets and/or facet values serve as suggested additional parameters for further refining the search terms or guiding user navigation of the database. In response to user selection of a facet value from the listing, the system refines the search term resulting in a refined search term formed of the first parameter plus the user selected facet and/or facet value. A search of the database is rerun using the refined search term. The facets and facet values are defined by a computer network community of users over time and through use of the network community portal. Another embodiment is an advertising engine that displays targeted advertisements to the user based on refined search. Another embodiment is a method that utilizes refined search to help the user with navigation of a site (e.g., website or other computer network site) as a component of a GUI.08-06-2009
20090210410STORAGE METHOD AND SEARCH METHOD FOR MARK EVENT ON TWO-DIMENSIONAL SPACE - A storage method and a search method for mark events on two-dimensional space are provided. First, an event and a corresponding coordinate thereof are retrieved. Next, calculation on the coordinate of the event is performed to generate an index representing a bucket position in a storage device. Next, whether or not there is any existing search tree stored in the bucket position is judged, and then the event is inserted into a linked list of a node of the search tree stored in the bucket position according to a judgment result. Besides, when a range on the two-dimensional space is designated, corresponding nodes in the search tree are rapidly accessed according to the index obtained by a hash function, and further by application of pointers pointing to the bucket position having the search tree stored therein and by real-time return of search result, the search speed is high.08-20-2009
20090210417SEARCH ENGINE FEEDBACK FOR DEVELOPING RELIABLE WHOIS DATABASE REFERENCE FOR RESTRICTED SEARCH OPERATION - A system and method monitors and weeds out illegitimate/illegal websites during search engine indexing and domain name registration. The whois database generated during domain name registration is used as a reference database for correlation with a database generated by the search crawler on a search engine server. A whois analyzer from the search engine server extracts a set of URLs into a database called the uncorrelated URL database. The uncorrelated URL database contains those URLs from both the aggregate whois database and reverse index database after removing common URLs. The uncorrelated URLs are contacted and advised by the whois administrator to take necessary action to be listed in the whois database and properly be indexed during search engine crawling. This process ensures that every URL is properly registered and identified on the Internet thus eliminating the success of illegal/unwanted websites.08-20-2009
20090210413K-NEAREST NEIGHBOR SEARCH METHOD, K-NEAREST NEIGHBOR SEARCH PROGRAM, AND K-NEAREST NEIGHBOR SEARCH DEVICE - Provided is a k-nearest neighbor search method of searching for a query number k of nearest points to an arbitrary point in a DBMS for creating a spatial index from multidimensional points, comprising setting a search conditions, judging which of a lowest branch and an intermediate branch of the spatial index a nearest region to the query point is, calculating, when the nearest region is judged to be the lowest branch, a distance between the query point and a child region of the nearest region, storing information of a divided region which has become a calculation target, calculating, when the nearest region is judged to be the intermediate region, a distance between the query point and a point included in the nearest region, storing information of the point which has become a calculation target, finishing search processing when the search conditions are satisfied, and obtaining a search result from the DBMS.08-20-2009
20090012952APPARATUS AND METHOD FOR LOCATING A TARGET ITEM IN A LIST - Process for locating target item in a list including a plurality of items in sequence from 1 to m, including: (a) defining a range within the list including the target item, including: i) identifying a first item at location “n”, where 1≦n≦m; ii) identifying a last item at location “p” where n≦p≦m; (b) identifying and displaying an item at location D=((p−n)/2+n); (c) determining location of target relative to “D”. If the target is at or adjacent “D”, terminate the process and display the target item; if the target is between “n” and “D”, resetting “p” to “D”, repeat steps (b) and (c); if the target item is between “D” and “p”, resetting “n” to “D”, repeat steps (b) and (c); until the target item is at or adjacent the location “D”.01-08-2009
20090006384SYSTEM AND METHOD FOR MEASURING THE QUALITY OF DOCUMENT SETS - Systems and methods are described that calculate the interestingness of a set of one or more records in a database, either absolutely (i.e., compared to an overall collection of records) or relative to some other set of records. In one embodiment, the measure is a relative entropy value that has been normalized. Various applications of the measure are described in the context of an information retrieval system. These applications include, for example, guiding query interpretation, guiding view selection and summarization, intelligent ranges, event detection, concept triggers and interpreting user actions, hierarchy discovery, and adaptive data mining.01-01-2009
20090100038Information Analysis System - An information analysis system is provided, which includes a data loading unit to retrieve data from a document, a storage unit to store the data, a correlation analysis unit to compute at least one correlation index to represent the correlation between the data stored at the storage unit, and a mapping unit to show the correlation between the data on a map based on the correlation index. As a result, technical trends or prospect technology are analyzed.04-16-2009
20090049040SYSTEM AND METHOD FOR SEMANTIC ASSET SEARCH IN A METADATA REPOSITORY - Embodiments of the invention are generally related to semantic search and service metadata repositories, particularly with regards to methods and systems for performing a semantic asset search in a service metadata repository. One embodiment includes identifying service metadata assets with similar metadata, relationships, and categorizations to the service metadata assets with the most relevant keywords and identifying service metadata assets have been used in conjunction with the one or more selected service metadata assets.02-19-2009
20090150388NLP-based content recommender - Methods, techniques, and systems for using natural language processing to recommend related content to an associated text segment or document. Example embodiments provide a NLP-based content recommender (“NCR”) which uses NLP-based search techniques, potentially in conjunction with context or other related information, to locate and provide content related to entities that are recognized in the associated material. NCRs may be embedded as widgets, for example on Web pages to assist users in their perusal and search for information, provided by means of browser plug-ins or other application plug-ins, provided in libraries or in standalone environments, or otherwise integrated into other code, programs, or devices. This abstract is provided to comply with rules requiring an abstract, and it is submitted with the intention that it will not be used to interpret or limit the scope or meaning of the claims.06-11-2009
20090012953Method and system for continuous, dynamic, adaptive searching based on a continuously evolving personal region of interest - Embodiments of the present invention are directed to flexible, user-adapted, continuous searching, on behalf of a particular user, for points of interest relevant to the user's current location within a specifically computed personal region of interest. In a general case, the personal region of interest is computed as a function of the user's level of disposition towards the searched-for points of interest. The level of disposition towards the searched-for points of interest may, in turn, be based on two or more of the user's location, the current date and time, a history of the user's interaction with the POI-searching system, including user-initiated searches and user selections from displayed search results, a user profile developed for, and continuously updated on behalf of, the user, and a current context for the search, as specified by a search query or by other context-specifying means. The personal region of interest generally defines an abstract area, volume, or hypervolume within which method and system embodiments of the present invention search for points of interest.01-08-2009
20090006381INFORMATION SEARCH DEVICE, INFORMATION SEARCH METHOD, AND INFORMATION SEARCH PROGRAM - A systematic problem search unit searches information about a systematic problem that is a common problem to a plurality of projects, using a conditional expression for searching the information about the systematic problem based on one of or a plurality of a count by which the information about the systematic problem is stored in the retrospect storage unit, a count by which a trial improvement plan linked to the systematic problem is stored in the retrospect storage unit, and information indicating whether the stored improvement plan is stored in the retrospect storage unit, as a conditional expression indicating a condition for extracting the information about the systematic problem. a systematic problem output unit outputs a search result in the systematic problem search procedure to each of or one of a predetermined storage unit and a predetermined output unit.01-01-2009
20090222433METHODS AND SYSTEMS FOR SEARCHING DATA BASED ON ENTITIES RELATED TO THE DATA - Systems and methods classify, organize, and retrieve data from a variety of applications based on entities associated with the data. A data classification module is configured to retrieve stored information from a repository. The data classification module is configured to receive a request to retrieve the stored information. The data classification module is configured to search the repository based on the request. Based on the search, the data classification module is configured to retrieved stored information from the repository The data classification module is configured provide the retrieved information to a requester of the information. For example, the data classification module can be configured to provide the retrieved information in a series of interactive cascading menus.09-03-2009
20090222429Service identification in legacy source code using structured and unstructured analyses - Identifying service candidates in legacy source code, including a source code analyzer performing structured and unstructured analyses of computer software source code procedures, a repository storing results of the analyses, a target profile analyzer analyzing a target service description of a Service Oriented Architecture and formulating a query therefrom, a search module querying the repository to identify source code elements that match the target service description, and combining any matches within a predefined distance from each other within the source code, a ranking engine ranking the combined matches in accordance with predefined heuristics, and a procedure aggregator aggregating the combined matches by their location in propinquity to the procedures, comparing interface definitions defined for the service description to entry and exit points of the procedures to identify candidate procedures having similar input and output parameters, and producing a ranked list of candidate procedures that map into the target element.09-03-2009
20090204610DEEP WEB MINER - Systems, computer implemented methods and computer program products are provided for selectively capturing and/or evaluating information including content and metadata from across a network such as the “wide world web” (WWW), or more generally, the Internet. A deep web mining tool may be utilized to exploit the deep web by understanding forms, search engines and results pages. Moreover, deep web mining tool may be utilized to extract and exploit structured and unstructured content and metadata from web sites and documents, generate queries, capture and re-link web sites, crawl through web sites and non-HTML files and perform other aspects of obtaining and/or evaluating information.08-13-2009
20090204608PRODUCT PLACEMENT ENGINE AND METHOD - A product placement engine and method for automatically identifying products for association with a document, the engine including a parser, an analysis module adapted to determine word scores and to adjust the word scores of the words by predetermined weightings, a keyword constructor module adapted to construct a keyword query search string using words having the highest word scores, a search engine adapted to search a products database having product records to identify products satisfying the keyword query search string and assign product scores, and a post processing module adapted to identify word matches in each of the product records and the document and update the product score.08-13-2009
20090204603MULTI-CHANNEL CONTENT MODELING SYSTEM - A service delivery platform receives a request for a catalogue. The system obtains subscriber-specific multi-media catalogue entries based on profile information stored with the service delivery platform. The system sends the subscriber-specific catalogue entries along with service details of the subscription back to the subscriber.08-13-2009
20090204606FILE MANAGEMENT SYSTEM, FILE MANAGEMENT METHOD, AND STORAGE MEDIUM - A file management apparatus which makes it possible to cache a file registered in a file server from an initial stage of registration thereof. When a file is newly registered in the file server, the file server extracts the feature elements of the file, and searches for an registered file having a high degree of similarity to the file being registered in response to the current registration request. The search is performed based on the feature elements and an access log. Then, the file server searches for a domain from which access has been made to the file a not smaller number of times than a predetermined number of times, and copies and registers the newly registered file also in a cache server of the domain.08-13-2009
20090006386SYSTEM AND METHOD FOR MEASURING THE QUALITY OF DOCUMENT SETS - Systems and methods are described that calculate the interestingness of a set of one or more records in a database, either absolutely (i.e., compared to an overall collection of records) or relative to some other set of records. In one embodiment, the measure is a relative entropy value that has been normalized. Various applications of the measure are described in the context of an information retrieval system. These applications include, for example, guiding query interpretation, guiding view selection and summarization, intelligent ranges, event detection, concept triggers and interpreting user actions, hierarchy discovery, and adaptive data mining.01-01-2009
20090006365IDENTIFICATION OF SIMILAR QUERIES BASED ON OVERALL AND PARTIAL SIMILARITY OF TIME SERIES - Techniques for identifying similar queries based on their overall similarity and partial similarity of time series of frequencies of the queries are provided. To identify queries that are similar to a target query, the query analysis system generates, for each query, an overall similarity score for that query and the target query based on the time series of the query and the target query. The query analysis system also generates, for each query, partial similarity scores for the query and the target query based on various time sub-series of the overall time series of the queries. The query analysis system then identifies queries as being similar to the target query based on the overall similarity scores and the partial similarity scores of the queries.01-01-2009
20090006358SEARCH RESULTS - A technique for the creation of synthesized results from multi-query searches to provide more relevant information to the user in a more useful format and to discard or reduce in relevancy information that is not so useful. It can determine which queries belong to the search based on parameters in the queries or results. It also provides mechanisms for supporting exploratory searches including: saving/restoring search context; search-specific query history; a “keepers” bin for storing useful results; elimination of redundant results; re-ranking of common search results; integration of searching with navigation; pivoting on search results; collaboration among multiple searchers; user-generated content; generation of hypotheses; re-executing queries and executing standing queries; multi-monitor searching and automatic preparation of search summaries.01-01-2009
20090106241SEARCH CRITERIA CONTROL SYSTEM AND METHOD - A method and system is provided for controlling search criteria when searching databases using active controls. In one aspect, a search criteria control bar (SCCB) displays results of a search by identifying category selections and keywords. Category selections may be identified by a unique delimiter and any keywords may also be identified by another unique identifier. A user may optionally narrow a search by selecting any active category or active keyword(s) that may be identified in the results summary by simply clicking on the appropriate choice. This may cause only those pages associated with the selected keyword or category to be displayed. Conversely, a user may alter a search by eliminating a keyword or category from the results by a one-click action. Further, the search and results may be limited by user preferences. In this manner, a user may be able to intuitively control searches with more refinement and efficiency.04-23-2009
20090106238Contextual Searching of Electronic Records and Visual Rule Construction - A web-based system for visual construction of logical rules includes a server, a network, and client operatively connected to the server via the network. The server includes a database and a search engine. The client includes a web-based visual rule building application including selectable windows for displaying and visually editing terms, logical operators, logical rules for storage in the database. The logical rules are generated by visually selecting at least one of the terms and logical operators from the windows. The server may further include a search engine configured to perform at least one of a direct search or a contextual search for an entered query string in records stored in the database and the client may include a visual interface for displaying results of the searches. The search results generated by the search engine may be stored as terms in the database for subsequent rule generation.04-23-2009
20090106237SYSTEM AND METHOD FOR DYNAMICALLY CUSTOMIZING WEB PAGE CONTENT - A method for providing customized web page content. The method includes receiving information from a referring uniform resource locator (URL) including information relating to a first search performed by a user, processing specific terms in the referring URL to produce a customized web page based upon the terms in the referring URL, and presenting the customized web page to the user.04-23-2009
20090106235Document Length as a Static Relevance Feature for Ranking Search Results - Embodiments are configured to provide information based on a user query. In an embodiment, a system includes a search component having a ranking component that can be used to rank search results as part of a query response. In one embodiment, the ranking component includes a ranking algorithm that can use the length of documents returned in response to a search query to rank search results.04-23-2009
20090106230Query dependent link-based ranking - Query dependent ranking uses weighted edges in a stochastic approach for link structure analysis (SALSA) technique. Functions describing the weights of edges into and out of a vertex are determined to define transition probability functions. The transition probability functions are used to compute authority scores for each uniform resource locator (URL) u in a base set to rank a result set to a received query.04-23-2009
20090106240SUPPLIER IDENTIFICATION AND LOCATOR SYSTEM AND METHOD - A supplier identification and locator system in that allows a user to identify a supplier of goods or services over the Internet; the system includes at least one directory Web site having a domain name that is at least partially descriptive of a class of goods or services. The directory Web site has a plurality of links that access suppliers' Web sites; a supplier descriptive portion located substantially adjacent to the link; a descriptive title portion substantially corresponding to the class of goods or services described in the domain name; a rollover window that displays information about at least one supplier; and an input receiving area where a user inputs data and ranked search results are displayed.04-23-2009
20090106236Method for scoring products, services, institutions, and other items - The present invention relates to a method of scoring an item, including reviewing a first editorial evaluation of the item; assigning a first editorial evaluation score to the item based on the first editorial evaluation; reviewing a second editorial evaluation of the item; assigning a second editorial evaluation score to the item based on the second editorial evaluation; and calculating a score for the item based on the first editorial evaluation score and the second editorial evaluation score. The item score may also be calculated based upon component scores for the item or quantitative ratings.04-23-2009
20090106232BOOSTING A RANKER FOR IMPROVED RANKING ACCURACY - A system described herein includes a trainer component that receives an estimated gradient of cost that corresponds to a first ranker component with respect to at least one training point and at least one query. The trainer component builds a second ranker component based at least in part upon the received estimated gradient. The system further includes a combiner component that linearly combines the first ranker component and the second ranker component.04-23-2009
20090106234APPARATUS AND METHODS FOR WEB MARKETING TOOLS AND DIGITAL ARCHIVES - WEB PORTAL ADVERTISING ARTS - This invention relates to the creation of a software application to: facilitate the creation, representation and publication of digital objects; in particular, methods and apparatus that improve digital resource retrieval on the part of end users and to provide a new system for the web based marketing of digital assets and the online distribution of metadata enriched advertising.04-23-2009
20090106231Query dependant link-based ranking using authority scores - Query dependent ranking uses an authority score. A base set is determined as the union of a result set to a received query, an inlinking-set, and an outlinked-set. The inlinking-set is determined by sampling a predetermined number of uniform resource locators (URLs) linking to each result. The outlinked-set is determined by sampling a predetermined number of URLs linked to by each result. A neighborhood graph consists of the vertices of the base set and the edges between the vertices in the base set. An authority score for each URL in the base set is computed using a Stochastic Approach to Link Structure Analysis (SALSA) technique. The authority scores are used to rank the result set.04-23-2009
20090106222Listwise Ranking - Procedures for learning and ranking items in a listwise manner are discussed. A listwise methodology may consider a ranked list, of individual items, as a specific permutation of the items being ranked. In implementations, a listwise loss function may be used in ranking items. A listwise loss function may be a metric which reflects the departure or disorder from an exemplary ranking for one or more sample listwise rankings used in learning. In this manner, the loss function may approximate the exemplary ranking for the plurality of items being ranked.04-23-2009
20090106229Linear combination of rankers - Described herein is a system that includes a receiver component that receives first scores for training points and second scores for the training points, wherein the first scores are individually assigned to the training points by a first ranker component and the second scores are individually assigned to the training points by a second ranker component. The apparatus further includes a determiner component in communication with the receiver component that automatically outputs a value for a parameter α based at least in part upon the first scores and the second scores, wherein α is used to linearly combine the first ranker component and the second ranker component.04-23-2009
20090106228METHOD AND APPARATUS FOR PROVIDING A USER TRAFFIC WEIGHTED SEARCH - A method and apparatus for providing a user traffic weighted search in a network are disclosed. For example, the method receives a query from a customer and determines whether the customer has opted-in for a service for traffic data monitoring. The method then provides one or more search results to the customer in response to the query, where the one or more search results are prioritized in accordance with collected user usage data if the customer has opted-in for the service for traffic data monitoring.04-23-2009
20090106227Lubrication Program Management System and Methods - A system for scheduling a plurality of selected maintenance tasks. The system comprises one or more storage media and a processor. The one or more storage media store data indicative of a plurality of maintenance points, a plurality of task templates, and a plurality of maintenance task definitions as associations between maintenance points and task templates. At least one maintenance point has a plurality of maintenance point parameters and is associated with at least one task template having a plurality of task parameters, such that upon accessing at least one of the maintenance task definitions, such maintenance task definition is dynamically generated from the plurality of maintenance point parameters of the at least one maintenance point and from the plurality of task parameters of the at least one task template. The processor selectively applies one or more queries to the stored data to generate an assignment including one or more selected maintenance tasks. The one or more queries have a plurality of filter criteria and a plurality of logical relationships defined between the filter criteria to selectively include maintenance task definitions matching the one or more queries and exclude maintenance task definitions not matching the one or more queries. The system further comprises at least one means for outputting the generated assignment.04-23-2009
20090106226SEARCH SHORTCUT PULLQUOTES - A system for assisting users in obtaining objective information about consumer products is provided. The system includes pullquotes within a listing of ordered search results, where the pullquotes are extracted from independent objective consumer reviews.04-23-2009
20090106223ENTERPRISE RELEVANCY RANKING USING A NEURAL NETWORK - A neural network is used to process a set of ranking features in order to determine the relevancy ranking for a set of documents or other items. The neural network calculates a predicted relevancy score for each document and the documents can then be ordered by that score. Alternate embodiments apply a set of data transformations to the ranking features before they are input to the neural network. Training can be used to adapt both the neural network and certain of the data transformations to target environments.04-23-2009
20090106221Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features - Embodiments are configured to provide information based on a user query. In an embodiment, a system includes a search component having a ranking component that can be used to rank search results as part of a query response. In one embodiment, the ranking component includes a ranking algorithm that can use one or more click-through features to rank search results which may be returned in response to a query. Other embodiments are available.04-23-2009
20090204602APPARATUS AND METHODS FOR PRESENTING LINKING ABSTRACTS FOR SEARCH RESULTS - Disclosed are apparatus and methods for providing linking abstracts for a plurality of search results. In certain embodiments, an abstract of a listed search result is revised to include links to locations within the associated search result document that are proximate to one or more abstract portions. When the user selects a particular linkable abstract portion within a particular listed search result, the user is then provided with the corresponding location within the particular search result document. That is, the linked abstract portion is caused to be presented to the user.08-13-2009
20090204594Recommendation System for Assisting Mashup Developers at Build-Time - A recommendation system exploits a repository of mashups to provide design-time assistance to the user through relevant suggestions as to what outputs can be generated along with the best plans to generate those outputs. An output ranker ranks the outputs of the system base on their popularity scores, and a planner uses metric planning algorithms and a configurable utility function. The system takes into account popularity and semantic similarity when recommending services and sources.08-13-2009
20090204600MOBILE RECOMMENDATION AND RESERVATION SYSTEM - Apparatus and methods are described for assisting a requester to identify and reserve a resource that meets the need of the requestor's request. For example, a request can be augmented using augmentation data, such as current position and requester preferences, and resources are located and ranked according to their match against the augmented request. A person who is the requester may engage in another attention demanding task, such as driving a car, while resources well matched to preferences are located. Preference data may be determined from social network data, a preference file, and/or other sources.08-13-2009
20090198687PLAN SOLVER - Systems and methods for supply chain management and identification of feasible plans. Identification of feasible plans includes simultaneous breadth and depth satisfaction of demands. Demands are satisfied using multiple sources of supply, consideration of substitute items, generation of supply, and/or reallocation of supply previously pegged for satisfaction of a lower priority demand. Reallocation optionally includes consideration of items and demands associated with multiple level codes.08-06-2009
20090198684System and Method for Determining Semantically Related Terms - The present disclosure is directed to systems and methods for determining semantically related terms. Generally, one or more seed terms are received from a user. A system searches a first index comprising a plurality of terms and one or more webpages associated with each term of the plurality of terms to determine a plurality of webpages associated with the seed terms. The system then searches a second index comprising a plurality of webpages and one or more terms associated with each webpage of the plurality of webpages to determine a plurality of potential terms associated with the plurality of webpages associated with the seed terms. At least one term of the plurality of potential terms is suggested to a user.08-06-2009
20090198683FAST ADAPTIVE DOCUMENT FILTERING - Data structures, stored on various types of computer-readable media, include information related to user profiles and/or to various documents. The information included in these data structures is arranged and stored in manner that allows for rapid user profile updating to be performed as new or changed documents are processed in a document filtering system.08-06-2009
20090198681REAL PROPERTY EVALUATION AND SCORING METHOD AND SYSTEM - A method for evaluating a parcel of real estate includes recording a location and intended use of the parcel, generating a geocoded graphic of an area including the parcel, and defining a trade area. Boundaries of the trade area are demarcated on the graphic, and databases are accessed to determine characteristics of the trade area. The characteristics are processed into an objective score. A customized report is generated including the score and an analysis of the characteristics, and is displayed on the computer in a downloadable format. A server for generating the report includes a computer-readable medium, a processor, and an algorithm. The algorithm records a location and intended use of the parcel on the computer-readable medium, determines a trade area, accesses data sources to determine a set of trade area characteristics, and processes the characteristics using the processor to calculate an objective score based on the intended use.08-06-2009
20090198678SYSTEMS, METHODS, AND SOFTWARE FOR ENTITY RELATIONSHIP RESOLUTION - To facilitate access to public records, the present inventors devised, among other things, an entity resolution system. The exemplary system includes master records database of 300 million entities, which is partitioned into multiple distinct portions. The exemplary system extracts entity information from input public records and constructs one or more blocking queries against specific portions of the master records database to identify one or more sets of candidate records. Feature vectors are defined for the candidate records and machine learning techniques, such as Support Vector Machine, are used to determine which of the candidate records from the master records database match the input public records. Candidate records that match are logically associated with public records, enabling ready access via direct or indirect queries.08-06-2009
20090198677Document Comparison Method And Apparatus - A document comparison and identification method comprises the steps of: identifying (S08-06-2009
20090198673Forum Mining for Suspicious Link Spam Sites Detection - An anti-spam technique for protecting search engine ranking is based on mining search engine optimization (SEO) forums. The anti-spam technique collects webpages such as SEO forum posts from a list of suspect spam websites, and extracts suspicious link exchange URLs and corresponding link formation from the collected webpages. A search engine ranking penalty is then applied to the suspicious link exchange URLs. The penalty is at least partially determined by the link information associated with the respective suspicious link exchange URL. To detect more suspicious link exchange URLs, the technique may propagate one or more levels from a seed set of suspicious link exchange URLs generated by mining SEO forums.08-06-2009
20090198671SYSTEM AND METHOD FOR GENERATING SUBPHRASE QUERIES - A system for generating subphrase queries. The system includes a sequence label modeling engine and a regression modeling engine. The sequence label modeling engine generates a plurality of subphrase queries by indexing through each token in a search phrase and labeling each token based on an association to other tokens in the search phrase. The regression modeling engine scores each subphrase query at least partially on the association according to a scoring model. The regression modeling engine identifies the subphrase query with the highest score which may then be used for identifying a sponsored search list or a web search item.08-06-2009
20090198669CONFIGURATION-BASED SEARCH - A system that tunes search results is presented. During operation, the system receives content to be searched. The system then iteratively performs the following operations until search results meet specified criteria. The system generates an index of the content based on a set of configuration parameters. Next, the system performs a search against the index to produce the search results. The system then determines whether the search results meet the specified criteria. If the search results do not meet the specified criteria, the system modifies one or more of: the set of configuration parameters; and the content. If the search results meet the specified criteria, the system saves the set of configuration parameters into a configuration file which can be used to generate the index for the content.08-06-2009
20090248683Method and Apparatus for Enhancing Electronic Reading by Identifying Relationships Between Sections of Electronic Text - An apparatus, method and article of manufacture of the present invention detects the presence of references to the same concept in separate sections of text, and, with no input required from the reader, presents the reader with information concerning the detected references to the concept. The information provided may comprise information related to the location of the reference to the concept in other sections of text, and the reader also is provided the ability to move from one reference to a concept directly to another reference to the same concept.10-01-2009
20090248677Methods for generating a personalized list of documents associated with a search query - A method for generating, by a monitoring program, a list of relevant documents, related to a search query, comprising tracking activities of first user pursuant to retrieval of a first list of primary documents resulting from a first search query; assigning a user interest score to a secondary document whose identifier is referenced within the contents of a primary or another secondary document; adding said identifier and score to a list of relevant documents wherein said list is associated with said first query and said first user; persisting said list of relevant documents to store.10-01-2009
20090248676CONTENT MANAGEMENT DEVICE, CONTENT MANAGEMENT SYSTEM, AND CONTENT MANAGEMENT METHOD - A content management device for managing acquired content in a searchable manner, includes a storage device that stores a search database in which a plurality of keywords and content are registered; a registration unit that registers new content in the search database; and a search unit that searches content registered in the search database. The registration unit performs operations includes: extracting a plurality of keyword candidates associated with the new content from the search database; displaying the extracted keyword candidates; and registering a keyword candidate designated from among the displayed keyword candidates in the search database, as a search keyword in association with the new content. The search unit performs operations includes: displaying a plurality of search keywords registered in the search database; extracting content associated with a search keyword designated from among the displayed search keywords from the search database; and displaying the extracted content.10-01-2009
20090248668Learning Ranking Functions Incorporating Isotonic Regression For Information Retrieval And Ranking - Embodiments of the present invention provide for methods, systems and computer program products for learning ranking functions to determine the ranking of one or more content items that are responsive to a query. The present invention includes generating one or more training sets comprising one or more content item-query pairs and determining one or more contradicting pairs in a given training sets. An optimization function to minimize the number of contradicting pairs in the training set is formulated. and modified by incorporating a grade difference between one or more content items corresponding to the query in the training set and applied to each query in the training set. A ranking function is determined based on the application of regression trees on the queries of the training set minimized by the optimization function and stored for application to content item-query pairs not contained in the one or more training sets.10-01-2009
20090248671INFORMATION CLASSIFICATION SYSTEM, INFORMATION PROCESSING APPARATUS, INFORMATION CLASSIFICATION METHOD AND PROGRAM - An information classification system includes a server that includes a knowledge base that receives classification information to be classified, conducts language analysis of the classification information to acquire a plurality of keywords and classify the plurality of keywords into elements made up of a classification target word and a related word that modifies the classification target word, and conduct a search with the related word being for a judgment condition for decision of the classification while separating the classification target word from the related word, so as to assign a classification identification value to the information; a classification candidate extraction section that extracts the classification identification value that the knowledge base assigns to generate an automatic classification result; and a classification update section that receives the automatic classification result, displays a GUI for classification confirmation allowing confirmation of correctness of the automatic classification result, and corrects registered items in the knowledge base with a correction value received through the GUI for classification confirmation while referring to log data that is a processing history about automatic classification for the language analysis and the element classification.10-01-2009
20090248662Ranking Advertisements with Pseudo-Relevance Feedback and Translation Models - Methods, computer products, and systems for selecting advertisements in response to an internet query are provided. The method provides for receiving an internet query that includes query terms, retrieving and then ranking a first set of advertisements in response to the internet query using a query likelihood model. The method then selects sampling words using pseudo-relevance feedback and translation models, the internet query, and the first set of ad materials obtained using the query likelihood model. The sampling words are chosen from a distribution of words from the words in the first set of ad materials, and the pseudo-relevance feedback model is used to select a word (w) in the distribution of words based on a probability that word w generates query term q(p(q|w)). The translation model is used to calculate the probability p(q|w) based on a translation probability that w translates into q(t(q|w)). The method also includes retrieving and ranking a second set of ad materials using an expanded query formed by adding the selected sampling words to the original internet query. The second set of ad materials is then presented to the user. The use of translation models enhances the topicality of the results because the distribution words selected are related to the terms in the original query as indicated by their translation probabilities.10-01-2009
20090248656Search Engine Relevance Tuning Based on Instant Messaging (Influence Search Results Using IMS) - To provide up-to-date search results containing Internet addresses that have become extremely popular very recently, search engines fine-tune search result rankings using communications sent by users of real-time messaging systems to each other. Instant messaging systems are one type of real-time messaging systems. Search engines use a URL found in instant messages to promote the ranking of Internet addresses and to refresh abstracts and caches. Similar demographics between the search engine user and senders of instant messages might be a requisite for promotion. The number of hops taken by a URL among instant messaging users might determine the extent of the promotion. To prevent unfair manipulation of search results, a URL should hop a threshold number of times. Call centers also promote the rankings of knowledge articles presented to call center operators based on how often keywords related to each knowledge article are detected in a conversation with a caller.10-01-2009
20090210416SEARCH ENGINE USING WORLD MAP WITH WHOIS DATABASE SEARCH RESTRICTIONS - A search operation can provide geographically restricted and verified information to a user. A two-step approach is used to perform these searches. The first step is to obtain high relevance search results by searching only in a specific region defined for a search operation. The second step further improves the quality of the search results by performing contact address correlation. If the search server finds a reliable reference address in the search results, then these search results can be presented to the user, whereby search results that are not correlating well with legitimate and registered addresses for the site are removed from the search result lists. Therefore, the region-restricted search does searching in a selected geographical region and only presents legitimate web pages or search results to a user. Thus, the region-restricted search operation improve quality and may minimize search time and reduce a huge volume of non-valued Internet traffic, which is likely to impair the overall performance and experience on the Internet.08-20-2009
20090210415MEDIASET GENERATION SYSTEM - Disclosed are various embodiments of systems and methods for generating composite mediasets from mediasets, each comprising media items, associated with a plurality of users. In some embodiments, individual and/or group recommendations are provided for creating a group playlist by aggregating user taste data for a plurality of users in a group. In other embodiments, systems and methods are provided which allow for sharing and playing of a group playlist by users in a group, each of which has a media playback device. Each media item, such as a song, is played from one of the individual user devices for the benefit of all users in the vicinity at the time. Music thus can be “shared” without transferring files potentially in violation of copyrights.08-20-2009
20090210414Bit string searching apparatus, searching method, and program - Bit string searching apparatus using a coupled node tree with a root node and a node pair stored in adjacent areas that is formed by a branch node and a leaf node, branch nodes, or leaf nodes; the branch node including a discrimination bit position in the search key and information indicating a position of a primary node that is one node of a node pair; the leaf node including an index key formed by a bit string; from the root node of an arbitrary subtree of the coupled node tree, linking is repeated based on the search key's bit value at the discrimination bit position and information indicating a position of a primary node until a leaf node is reached; an index key stored in the leaf node is obtained as a search result key of the subtree by means of the search key.08-20-2009
20090210412Method for searching and indexing data and a system for implementing same - A system and method for processing a plurality of data to identify and search words contained with the plurality of data, wherein prior knowledge of the data format is unknown, is provided. The method includes identifying words within the data, wherein indentifying includes, processing the data to identify words, prior to searching. The method also includes storing the words in a predetermined manner and searching the words, wherein searching includes searching the words responsive to at least one search term to identify match results and processing the match results to at least one of save the match results to a file and display the match results.08-20-2009
20090210409INCREASING ONLINE SEARCH ENGINE RANKINGS USING CLICK THROUGH DATA - A method for providing keywords for a web page so as to increase online search engine rankings of the web page is provided. The method includes detecting click-throughs to the web page from a link in a search result list of an online search engine. The method further includes collecting data for each click-through, including: a) at least one keyword entered into an online search engine by a user to produce the search result list from which the click-through originated and b) a position value. The method further includes assigning a score to each keyword based on a number of words in the keyword and position values associated with each keyword. The method further includes providing keywords with a score that meets a predefined threshold as a suggestion for improving search engine rankings of the web page.08-20-2009
20090210411Information Retrieving System - A user speech analyzing component poses, to a user, question sentences for respective ones of a plurality of attributes, and analyzes an attribute value for each of the attributes from an answer sentence from the user to the sentence question. A user data holding component, as a result of analysis, holds user data that allows the plurality of attributes, and respective user attribute values for the attributes to correspond to one another. A matching component, when an acquisition ratio of the attribute values from the user with respect to all of the attributes is a predetermined value or greater, selects at least one target data candidate that matches each of the attributes and each of the attribute values of the user data, from a plurality of target data. A dialogue control component outputs each of the target data candidates selected, to the user's side.08-20-2009
20080275868Graphic User Interface for the Construction of Complex Search Queries - A web application and a method for creating complex query strings for conducting searches in through at least one database comprising structured documents that are structured in content-fields.11-06-2008
20090222431SYSTEM AND/OR METHOD FOR PERSONALIZATION OF SEARCHES - The subject matter disclosed herein relates to a system and/or method for providing enhanced content search results based on metrics indicating user affinity for an information site such as a web site. Information on user visits to a particular web site may be accumulated, for example, in connection with a beacon or other tracker placed on the publisher web site. The enhanced content may be provided by the publisher web site or may be generated otherwise.09-03-2009
20090222443Method and system for matching organizations based on profile and criteria - A means of matching diverse organizations with relevant partners such as retailers with suppliers, manufacturers with distributors, and universities and government agencies with service providers. Provides a bilateral match where the needs of 2 matched organizations are met based on specific organizational profile and desired partner criteria.09-03-2009
20090222442USER-DIRECTED NAVIGATION OF MULTIMEDIA SEARCH RESULTS - A method and apparatus for timed tagging of content is featured. The method and apparatus can include the steps of, or structure for, obtaining at least one keyword tag associated with discrete media content; generating a timed segment index of discrete media content, the timed segment index identifying content segments of the discrete media content and corresponding timing boundaries of the content segments; searching the timed segment index for a match to the at least one keyword tag, the match corresponding to at least one of the content segments identified in the segment index; and generating a timed tag index that includes the at least one keyword tag and the timing boundaries corresponding to the least one content segment of the discrete media content containing the match.09-03-2009
20090222440SEARCH ENGINE FOR CARRYING OUT A LOCATION-DEPENDENT SEARCH - The invention relates to a search engine for carrying out a search for internet pages, for which a geographic origin criterion input by the user as a search item is fulfilled. The search engine comprises: a device for carrying out a multitude of internet pages; a device for extracting geographic data from the searched pages, the extracted data describing the geographic origin of the page or of the page provider; a device for forming a database in which geographic data extracted from these internet pages are assigned to a multitude of searched internet pages; an input interface for inputting a search inquiry by the user, the input interface enabling the user to input a geographic origin criterion in addition to other search items; searching the database and outputting those internet pages for which the geographic origin criterion and the additional search items are fulfilled by comparing with the contents of the internet pages and with the geographic data assigned thereto.09-03-2009
20090222436PROBLEM ISOLATION THROUGH WEIGHTED SEARCH OF KNOWLEDGE BASES - A computer program product for problem isolation through a weighted search of knowledge bases includes computer useable program code that generates an aggregate relevance index which ranks the search results. The aggregate relevance index is calculated using a measure of relevance of each of said pertinent documents across all keyword searches. A method for problem isolation through the weighted search of knowledge bases comprises searching knowledge bases using extracted keywords to identify pertinent documents contained within said knowledge databases; and generating a global rank associated with each of the pertinent documents, the global rank being calculated using a measure of the relevance of each of the pertinent documents across all keyword searches and a measure of the relevance of each of the keywords to the records as a whole.09-03-2009
20090222435LOCALLY COMPUTABLE SPAM DETECTION FEATURES AND ROBUST PAGERANK - The claimed subject matter provides a system and/or a method that facilitates reducing spam in search results. An interface can obtain web graph information that represents a web of pages. A spam detection component can determines one or more features based at least in part on the web graph information. The one or more features can provide indications that a particular page of the web graph is spam. In addition, a robust rank component is provided that limits amount of contribution a single page can provide to the target page.09-03-2009
20090222434INCLUSION OF METADATA IN INDEXED COMPOSITE DOCUMENT - Embodiments of the invention provide systems and methods for searching business objects. According to one embodiment, a method of searching one or more business objects can comprise receiving a set of search criteria and identifying attributes of the business object that match the search criteria by searching an indexed composite document representing the business object based on the search criteria. The indexed composite document can comprise an indication of a value of one or more attributes of the business object and metadata associated with at least one of the values. Searching the indexed composite document can comprise performing a keyword search on the metadata of the composite document based on the search criteria. An indication of the identified attributes of the business object can be returned ordered by relevance to the search criteria.09-03-2009
20090222432Geo Tagging and Automatic Generation of Metadata for Photos and Videos - Photo/video is geo tagged with GPS coordinates corresponding to the place of capture of said photo/video. ‘Geo-information’ metadata corresponding to GPS coordinates is automatically generated and attached to corresponding photo/video. The ‘geo-information’ metadata comprises of date &time of capture, geo information metadata such as local weather, local attractions, local events etc. at the time of capture of corresponding photo/video. According to another aspect, a search engine is provided with means to crawl through one or more database comprising of ‘geo-information’ metadata attached to photos/videos and generate result comprising of photos/videos with ‘geo-information’ metadata corresponding/relevant to query input. According to another aspect, present invention discloses apparatus, means and methods to attach one or more local advertisements to photos/videos and display advertisement in conjunction with corresponding photo/video on communication devices.09-03-2009
20090222430Apparatus and Method for Content Recommendation - A content recommender executes a method wherein a set of attributes are provided for a plurality of content items. A number of recommendation parameters are defined as a function of attribute values for a subset of attributes and for each content item, recommendation values are determined based on the definitions. A multi-dimensional clustering is applied to the recommendation values for the content items to generate a plurality of content clusters. Each dimension of the clustering corresponds to a recommendation parameter. The content recommender then selects a set of content clusters from the content clusters and a recommendation set of content items is generated by selecting at least one content item from each selected content cluster. The invention may allow improved recommendations to be generated and may in particular allow recommendations to be generated which reflect a number of different and possibly conflicting considerations.09-03-2009
20090254533Methods, Systems, and Articles of Manufacture for Distribution of Search Metadata - Embodiments of the invention are generally related to metadata describing users accessing a network and network content. Each user may have a user profile comprising a list of user tags describing the user. Each item of network content may include a list of content tags describing the item. When a user selects an item of network content, one or more tags from the user profile may be added to the list of content tags for the item. In some embodiments, one or more tags from the list of content tags may be added to the user profile. Therefore, over time and access by multiple users, a comprehensive list of tags describing user profiles and network content may be developed.10-08-2009
20080270381Enterprise-Wide Information Management System for Enhancing Search Queries to Improve Search Result Quality - Described are a system and method of performing an electronic search for information objects stored in a plurality of data stores. A look-up is performed of a metadata model for instances of metadata that satisfy a criterion related to a received text string. A catalog of catalog items is provided. Each catalog item is linked to one or more instances of metadata in the metadata model and is uniquely associated with an information object stored in the data stores. The catalog is searched in real time to find one or more catalog items that are linked to one or more instances of metadata found in the look-up of the metadata model. Each information object associated with a catalog item found in the search of the catalog is listed in real time.10-30-2008
20090234838SYSTEM, METHOD, AND/OR APPARATUS FOR SUBSET DISCOVERY - Embodiments of methods, apparatuses, devices and/or systems associated with subset discovery are disclosed.09-17-2009
20090234840Information Processing Method, Information Processing System, And Server - A community establishing site is connected to an information terminal via a network. When the user enters a search keyword into the information terminal, an entered-information acquiring unit acquires the search keyword through a web server, and registers it together with user information in a user information master. A community establishing unit extracts users who have entered the same search keyword, from the plurality of users registered in the user information master, and records them in a community information master, thereby establishing a community. An information extracting unit transmits the user information of other users who belong to the same community to the information terminal through the web server.09-17-2009
20090100036Methods and Systems for Classifying Search Results to Determine Page Elements - This invention relates to determining page elements to display in response to a search. A method embodiment of this invention determines a page element based on a search result. The method includes: (04-16-2009
20090254548INFORMATION PROCESSING APPARATUS AND METHOD, PROGRAM, RECORDING MEDIUM, RECOMENDATION APPARATUS AND METHOD, AND INFORMATION PROCESSING SYSTEM - An information processing apparatus is provided whereby digital content can be viewed more comfortably and conveniently. A category information transmitter transmits, to a server, category information expressing one or more content categories associated with a recording medium loaded into the apparatus. A recommendation list presentation unit then receives and presents a recommendation list showing a server-generated list of content associated with the category information. The user selects content from the recommendation list. A selection information transmitter accepts the selection of content and then transmits, to the server, information specifying the selected content.10-08-2009
20090254547RETRIEVING APPARATUS, RETRIEVING METHOD, AND COMPUTER-READABLE RECORDING MEDIUM STORING RETRIEVING PROGRAM - A retrieving server 10-08-2009
20090254538METHODS, SYSTEMS, AND COMPUTER PROGRAM PRODUCTS FOR SOCIAL BASED ASSISTANCE IN A SOURCE CODE CONTROL SYSTEM - A method, system, and computer program product for social based assistance in a source code control system are provided. The method includes selecting a segment of source code and parsing the selected segment of source code to identify one or more syntax terms. The method also includes searching source files for the one or more syntax terms to locate matching results, where the source files are managed by the source code control system. The method further includes scoring the matching results of the searching as a function of developer activity associated with the matching results. The method additionally includes identifying one or more developers with the highest degree of matching based on the scoring.10-08-2009
20090222437CROSS-LINGUAL SEARCH RE-RANKING - Cross-lingual search re-ranking is performed during a cross-lingual search in which a search query of a first language is used to retrieve two sets of documents, a first set in the first language, and a second set in a second language. The two sets of documents are each first ranked by the search engine separately. Cross-lingual search re-ranking then aims to provide a uniform re-ranking of both sets of documents combined. Cross-lingual search re-ranking uses a unified ranking function to compute the ranking order of each document of the first set and the second set of documents. The unified ranking function is constructed using generative probabilities based on multiple features, and can be learned by optimizing weight parameters using a training corpus. Ranking SVM algorithms may be used for the optimization.09-03-2009
20090222438Method, system, and apparatus for location-aware search - Performing location-aware search involves intercepting a network request targeted for an Internet-based search engine. The network request includes a location-dependent query containing a location term, and the location term cannot be used by the search engine to positively determine a target location. A location descriptor that can be used by the search engine to positively determine a target location is determined via a location database. The location database may include a location sensor such as GPS. The network request is modified to replace the location term with the location descriptor, and the modified network request is sent to the search engine.09-03-2009
20090248673METHOD OF SORTING WEB PAGES, SEARCH TERMINAL AND CLIENT TERMINAL - A method of sorting web pages includes the steps of acquiring a plurality of forbidden keywords, receiving information of a list of web pages provided by a search engine, separating the web pages into valid web pages and invalid web pages according to forbidden keywords, rearranging the information of the valid web pages and the invalid web pages, and outputting the rearranged information of the valid web pages and the invalid web pages. A related search terminal and a client terminal are also provided.10-01-2009
20090248684METHOD AND APPARATUS FOR SEARCHING METADATA - Methods and apparatuses for searching metadata are described herein. In one embodiment, an example of a process for search metadata includes, but is not limited to, in response to a search query for metadata stored in one or more of metadata stores, the search query is partitioned into multiple search query segments. Thereafter, searches corresponding to the search query segments are performed, where each search is performed independently within the one or more metadata stores. Other methods and apparatuses are also described.10-01-2009
20090248682SYSTEM AND METHOD FOR PERSONALIZED SEARCH - A system and method is disclosed for profiling a subject's search engine keywords and results based on relevancy feedback. Because the system is based on the search behavior of the user, the profiling is language independent and balances the specificity of search terms against the profiled interests of the user. The system can also synthesize new keyword combinations to assist the user in refining the search or acquiring related content. The system has application in text mining, personalization, behavioral search, search engine optimization, and content acquisition, to name but a few applications.10-01-2009
20090248680System and Method for Sharing Anonymous User Profiles with a Third Party - The invention provides a system and method for sharing anonymous user profiles with a third party. In one aspect of the invention, the system shares user profiles with content servers on a mobile data network so that they may select content responsive to the user's profile. The system provides a store of user profiles for associating profile information with either a source IP address or mobile phone number, where the profile includes information on the user and the user's network usage. The system detects a user's transaction request and inspects it for either an IP address or phone number, which it uses to retrieve the appropriate profile. The system subsequently applies predetermined opt-out policies to determine how much of the user profile may be provided in response to the profile request. The system then returns the profile information such that the user's identity is masked.10-01-2009
20090248678INFORMATION RECOMMENDATION DEVICE AND INFORMATION RECOMMENDATION METHOD - A document set, and history documents including documents, etc., browsed by a user are input. The document set and the history documents are each analyzed to obtain characteristic vectors. A plurality of topic clusters and a plurality of sub-topic clusters are obtained by clustering the document set. A transition structure showing transitions of topics among the sub-topic clusters is generated, and a characteristic attribute is extracted from each topic cluster and each sub-topic cluster. An cluster-of-interest is extracted in comparison among characteristic vectors of the history documents and a characteristic vector of each document included in the document set, a sub-topic cluster having transition relations with the cluster-of-interest is obtained on the basis of a transition structure owned by the cluster-of-interest, and a document included in the sub-topic cluster is extracted as a recommended document to be presented together with the characteristic attribute.10-01-2009
20090248674SEARCH KEYWORD IMPROVEMENT APPARATUS, SERVER AND METHOD - A search keyword improvement apparatus includes a unit extracting a word as an additional keyword candidate from a new document, number of times of appearance of the word in the new document being greater than number of times of appearance of the word in each of a first documents except for the new document, if the new document and a new search target identification information item which is used to search the new document are accumulated, a unit generating a first search query based on an input keyword, a second search target associated with the input keyword, and one of the additional keywords, and generating a second search query, a unit moving the additional keyword candidate and the third search target identification information item, if the desired search result is selected from a third search result list corresponding to the second search query.10-01-2009
20090248670Content search engine - Search constraint specific searching for content from a mobile device is disclosed. Following a mobile device generated request for content, a content server provides for the search of content on a network service or personal computer. The search for content may occur directly through the content server or via a connector application. An index engine parses and lists structured and unstructured content, which may be responsive to the search request. The content server or a proxy then provides a sub-set of the search results, that subset corresponding to both the mobile device generated request for content and a search constraint such as mobile device capabilities or network service provider limitations.10-01-2009
20090248666INFORMATION RETRIEVAL USING DYNAMIC GUIDED NAVIGATION - An apparatus and method for providing relevant search result and query terms are disclosed herein. Natural language processing of the documents and previous search session history are used to dynamically determine document relevance, queries relevant to search categories prior to start of a search session, and query to query correlations.10-01-2009
20090248665MEDIA OBJECT QUERY SUBMISSION AND RESPONSE - Methods and systems for submitting media object queries and receiving suggested answers for the media object queries. In one aspect, a method includes receiving from a first user a first media object and a first query relating to content in the first media object, presenting the first media object and the first query to multiple second users, receiving a suggested answer to the first query from each of two or more second users of the multiple second users, where at least two of the suggested answers are distinct, ranking the suggested answers, and presenting one or more of the ranked suggested answers to the first user.10-01-2009
20090248660BUNDLING OF QUERY-RELATED CONTEXT FOR SPONSORED SEARCH - A sponsored search auction system is configured to receive bids for queries from advertisers. Each bid on a particular query relates to a particular context, such as an age, sex, or location of a user that may submit the query to a search engine. A valuation is provided for each available context of the query by each advertiser. The bids are processed to generate one or more context bundles. The context bundles are groupings of contexts. Not necessarily all contexts are bundled. The bundled and unbundled (if present) contexts may be sold to the advertisers as bundled. Selling of contexts in bundled form may enable increased revenue to be generated as compared to auction systems that sell each context separately or sell contexts bundled into a single group.10-01-2009
20090248658USING EMBEDDED METADATA TO IMPROVE SEARCH RESULT PRESENTATION - The present invention is directed towards systems and methods for using metadata to improve search result presentation. The method according to one embodiment of the present invention comprises receiving a search query from a user and parsing the search query and retrieving a ranked list of search results. The method then extracts metadata from the search results and casts the extracted metadata into an object model. A template is then applied to the cast extracted metadata and a search results page is generated comprising the ranked list of search results and the templated metadata10-01-2009
20090248659SYSTEM AND METHOD FOR MAINTENANCE OF QUESTIONS AND ANSWERS THROUGH COLLABORATIVE AND COMMUNITY EDITING - Systems, methods, and computer program products are disclosed for asking and searching for the answer to given questions, retrieving answers to such questions, as well as presenting such answers in a user-generated content style framework on a search engine. The system of the present invention comprises a question processor, operative to determine whether a question entered by a user has been previously answered, an answer data store storing answers to previously asked questions stored therein, an answer repository storing questions not yet answered by another user, and an editor tool operative to format an answer stored in the answer data store into user-generated content style and migrate the answer to a user-generated content style web page.10-01-2009
20090248655Method and Apparatus for Providing Sponsored Search Ads for an Esoteric Web Search Query - A method and apparatus are included for providing sponsored search ads for an esoteric Web search query. In one example, the method includes receiving search results and a request for placement of sponsored ads onto a search results page, retrieving bidder term vectors associated with every search result URL of the search results, calculating for every bidder term a weight associated with a given search result set, sorting bidder terms by their weight that is associated with a given search result set, and returning sponsored search ads for bidder terms with higher weights.10-01-2009
20090240690Systems and methods for historical information management - A networked computer system is provided for collecting and displaying historical content comprising a plurality of digital objects associated with a historical period or event. The computer system is comprised of one or more networked servers for processing the historical content. The servers are configured to access a database of host historical content input by an operator and user historical content input by a user, and display the historical content. One or more software applications running on the servers facilitate collection, integration and display of historical content. The software applications provide a template accessible to a user via a computer in communication with the network. The template is configured to allow the user to input the user historical content and relate a portion of the host historical content to create a user website integrating the user historical content with a portion of the host historical content.09-24-2009
20090240677Personalizing Sponsored Search Advertising Layout using User Behavior History - Embodiments of the invention relate to methods of presenting personalized search results pages to users, and to search engine systems and servers configured to implement such methods. For example, a method of presenting such a page to a user of a search engine includes steps of computing an engagement index of the user based on the distribution in time of that user's interactions with the search engine then presenting, in response to a query by the user, a personalized search results page to the user.09-24-2009
20090150376Mutual-Rank Similarity-Space for Navigating, Visualising and Clustering in Image Databases - A method of representing a group of data items comprises, for each of a plurality of data items in the group, determining the similarity between said data item and each of a plurality of other data items in the group, assigning a rank to each pair on the basis of similarity, wherein the ranked similarity values for each of said plurality of data items are associated to reflect the overall relative similarities of data items in the group.06-11-2009
20090150374SYSTEM, METHOD AND PROGRAM PRODUCT FOR DETECTING SQL QUERIES INJECTED INTO DATA FIELDS OF REQUESTS MADE TO APPLICATIONS - System, method and program product for detecting a malicious SQL query in a parameter value field of a request. The parameter value field is searched for query operands, characters and/or symbols and combinations of query operands, characters and/or symbols indicative of malicious SQL injection. A respective score assigned to each of the query operands, characters and/or symbols or combinations of query operands, characters and/or symbols found in the parameter value field is added to yield a total score for at least two of the query operands, characters and/or symbols or combinations of query operands, characters and/or symbols found in the parameter value field. Responsive to the total score exceeding a threshold, the request is blocked.06-11-2009
20090150371METHODS AND APPARATUS FOR COMPUTING GRAPH SIMILARITY VIA SIGNATURE SIMILARITY - This disclosure describes systems and methods for identifying and correcting anomalies in web graphs. A web graph is transformed into a set of weighted features. The set of weighted features are then transformed into a signature via a SimHash algorithm. The signature is compared to the signature of one or more other web graphs in order to determine similarity between web graphs. Actions are then carried out to remove anomalous web graphs and modify parameters governing web mapping in order to decrease the likelihood of future anomalous web graphs being built.06-11-2009
20090259654INFORMATION PROCESSING APPARATUS, CONTROL METHOD THEREOF, AND STORAGE MEDIUM - An information processing apparatus includes: a read unit adapted to read first content information in which a characteristic of first content is written; an extraction unit adapted to extract rule information in which a characteristic that is common among one or more items of second content is written by analyzing second content information in which characteristics of the second content are written; and an update unit adapted to update the first content information based on the first content information and the rule information.10-15-2009
20090259652INFORMATION SEARCHING APPARATUS, INFORMATION SEARCHING METHOD, AND COMPUTER PRODUCT - An information searching apparatus retrieves a sub graph matching an inquiry graph from a graph to be searched. The apparatus includes an extracting unit that extracts, from among clusters of nodes in the graph to be searched, plural cluster pairs that each include a first cluster and a second cluster including a node linked by a link to a node in the first cluster and a calculating unit that calculates a bonding strength for each of the cluster pairs. The apparatus further includes a determining unit that determines, among the cluster pairs and based on the bonding strength of each of the cluster pairs, a cluster pair to be merged; a merging unit that merges the cluster pair; and a searching unit that searches the merged clusters for a sub graph matching the inquiry graph. An output unit outputs a search result of the searching unit.10-15-2009
20090259650SYSTEM AND METHOD FOR IDENTIFICATION OF NEAR DUPLICATE USER-GENERATED CONTENT - A computer-implemented system and method for identification of near duplicate user-generated content in a networked system are disclosed. The apparatus in an example embodiment includes a data receiver to receive a first instance of user-generated content; a tokenizer to tokenize the first instance into a set of words, create a set of portions from the tokenized first instance, and assign weight to each portion of the set of portions; a magnitude calculator to calculate a magnitude for the first instance based on the weight of each portion; a resemblance score calculator to search a data store for a second instance with at least one portion in common with the first instance and calculate a resemblance score between the first instance and the second instance; and an account linker to link accounts associated with each of the first instance and the second instance.10-15-2009
20080319984SYSTEM AND METHOD FOR REMOTELY GATHERING INFORMATION OVER A COMPUTER NETWORK - A method of finding relevant content on one or more target storage devices includes the step of receiving an instruction from an instruction queue specifying a content description to be searched for on the target digital storage medium. Content on the target digital storage medium specified by the content description is then search for. A report of the search is created and transferred to one or more predetermined users after the report is created. A discovery hold is implement by preserving content found in the search.12-25-2008
20080319981KNOWLEDGE PORTAL FOR ACCESSING, ANALYZING AND STANDARDIZING DATA - A method and system is provided to access one or more historical incident databases, for example, CDC, CPSC, DTI, AAPCC and the like, for standardizing the potentially differing categories and coding among the databases. The standardizing includes recoding of the categories by providing a unified set of categories reflective of similar categories found among the one or more databases, if any. Submission of search queries allows users to obtain unified data across the databases so that incident history statistics for one or more products tracked by commonly available databases may be easily acquired. The resulting reports and statistics may be used by various entities to understand historical incidents from multiple perspectives including, for example, injury and fatality statistics as a function of age group, type of injury, time periods, diagnosis, injury outcome, severity, and the like. Data may be presented in standardized formats or in any of the native database formats.12-25-2008
20080319980METHODS AND SYSTEM FOR INTELLIGENT NAVIGATION AND CACHING FOR LINKED ENVIRONMENTS - A real-time, content-based document navigation and caching tool for use within linked-document environments. The system includes a combination of rooted spidering, ranking and displaying to provide the user with a capability to perform intelligent navigation of the document collection. The system allows users to intelligently browse their local information environments, finding the documents that are most relevant to their information needs but also respecting the locality of the user's current location within the hyperlinked environment.12-25-2008
20080319975Exploratory Search Technique - A technique for the creation of synthesized results from multi-query searches to provide more relevant information to the user in a more useful format and to discard or reduce in relevancy information that is not so useful. It allows a user to define the boundaries of the exploratory search before it starts or retroactively define which queries belong to the search. It can imply which queries belong to the search based on parameters in the queries or results. It also provides mechanisms for supporting exploratory searches including: saving/restoring search context; search-specific query history; a “keepers” bin for storing useful results; elimination of redundant results; re-ranking of common search results; integration of searching with navigation; pivoting on search results; collaboration among multiple searchers; user-generated content; generation of hypotheses; re-executing queries and executing standing queries; multi-monitor searching and automatic preparation of search summaries. User interfaces for conducting multi-query searches are also provided.12-25-2008
20090177646Plug-In for Health Monitoring System - A monitoring and management system may use a plugin mechanism to add or update an interface to a managed service or device. The plugin may have capability to interface with the managed service or device, as well as an interface to a status database that may be populated by the managed service or device as well as other services or devices. The plugin may have rules that may be used to determine a status for the monitored service or device based on the statuses of several services or devices, and may also have rules that define a multi level query into the database to determine those services and devices.07-09-2009
20090254549METHODS AND APPARATUSES FOR SEARCHING CONTENT - Embodiments of methods and apparatuses for searching contents, including structured search are described herein. Embodiments of the present invention use tree structures (or more generally, graph structures), layout structures, and/or content category information to capture within search results relevant content that would otherwise be missed, to reduce the incidence of false positives within search results, and to improve the accuracy of rankings within search results. Embodiments of the present invention further use tree structures (or more generally, graph structures), layout structures, and/or content category information to extend search results to include sub-document constituents. Embodiments of the present invention also support the use of distribution properties as criteria for ranking search results. And embodiments of the present invention support search based on structural proximity, search expressions with recursively embedded operators, predicates, and/or quantifiers, and applications to selection of advertisements.10-08-2009
20090144259USING REPUTATION MEASURES TO IMPROVE SEARCH RELEVANCE - A system and method for determining relevancy for dynamic data sets is disclosed. A specific embodiment for use in an internet marketplace is presented wherein the relevancy for a descriptive factor associated with an item is increased when a user selects that item. To prevent abuse of the relevancy determination system, various embodiments incorporate abuse prevention measures. In one embodiment, a user's selection of the user's own items does not affect the relevancy system. In one embodiment, only a first selection of a particular item by a user will affect the relevancy system and any additional selections of that item will have no effect. In another embodiment, the size of the changes made due to the selections of particular user to the relevancy system are correlated to that user's reputation score.06-04-2009
20090254544RANKING ITEMS - A method of ranking items includes displaying a set of categories. Each category has a set of weights for a user to choose. Each item is associated with the set of categories. The method also includes displaying a search result based on the weights chosen by the user. The search result includes a ranking of the items.10-08-2009
20090254546PERSONALIZED SCREENING OF CONTEXTUALLY RELEVANT CONTENT - A system and method store and locate objects within a business organization. Objects created by individuals are stored in association with a context for which they are created. Security can be used to prevent unauthorized individuals from accessing objects without permission.10-08-2009
20090254545Method and System for Scoring Domain Names - Methods and systems for scoring domain names are provided. A domain name may be scored based on a set of criteria, and a sub-score assigned to each criteria. The sub-scores may be used to generate a domain name score and identify ways of increasing the score of the domain name. A domain name score may provide an indication of the value or usefulness of the domain name.10-08-2009
20090254543System and method for matching search requests and relevant data - A system and methods for matching between search requests and relevant data (web pages, online documents, essays, online text in general, images, video, footage etc.). The system comprises three components that can work separately or together and can be integrated with other search engine methods in order to further improve the relevancy of search results. The system can find similarity between different document and measure the distance (in similarity) between documents. The three components are: Context based understanding, comprising putting the documents in the context of aspects of the human knowledge external to the documents, Partial Sentence analysis and 100 percentage points to keyword/tag sets.10-08-2009
20090254540METHOD AND APPARATUS FOR AUTOMATED TAG GENERATION FOR DIGITAL CONTENT - A method and apparatus for automatically generating tags for digital content are provided. The method is adapted to be run on a computer, which is an example of the type of apparatus which may generate the tags. The generated tags describe the digital content, and may be used as topics for the content to organize, retrieve, and process the content. The tag generation begins by accessing content from a content collection unit and a tags candidate tag database unit, which are then processed using techniques from computational linguistics in a multi-pass process that generates sets of tags, then refines and normalizes them. Finally, scores are generated and stored along with the tags.10-08-2009
20090254536METHOD AND SYSTEM FOR PROCESSING SEARCH REQUESTS - Methods and system for processing search requests are described. In one embodiment, a term of a search request may be received. A determination of whether the term is a meta-keyword may be made. One or more linguistically transformed keywords associated with the meta-keyword may be obtained. A search may be run on at least one of the one or more linguistically transformed keywords to obtain a result of the search.10-08-2009
20090254534Methods, Systems, and Articles of Manufacture for Managing Search Metadata - Embodiments of the invention are generally related to metadata describing users accessing a network and network content. Each user may have a user profile comprising a list of user tags describing the user. Each item of network content may include a list of content tags describing the item. Each user tag and content tag may have an associated weight value. When a user selects an item of network content, weights of one or more user tags of the user profile and one or more content tags may be adjusted based on the selection. In some embodiments, the tags may be removed based on the weight values so that only tags relevant to the user profile and network content remain.10-08-2009
20090063450APPARATUS AND METHOD FOR SELECTING AN AUTHOR OF MISSING CONTENT IN A CONTENT MANAGEMENT SYSTEM - A content management system (CMS) includes metadata for each element in the repository. When an element has missing content that needs to be created, the repository is queried to identify elements which most closely match the metadata of the missing content. The metadata for these identified elements is analyzed to determine the authors for these elements which most closely match the element that needs to be authored. The authors are then ranked according to an author selection policy that may specify any suitable criteria for ranking authors, including author selection criteria, author ranking criteria, author filtering criteria, and author backup criteria. The result is a ranked list of one or more authors that are deemed the best choices of authors to author the missing content. The user may then request one of the authors in the ranked list to create the missing content.03-05-2009
20090248653Construction and use of a database - A method for constructing a database, comprises permitting a plurality of users to enter individual-associated data bits (IDBs) into a computerized system, each of the IDBs comprising at least one personal identifier relating to the user and relationship data comprising data on one or more related individuals and the nature of relationship; and processing the entered IDBs to generate an individual-identifier data set (IDS), one for each identified individual, being either one of the users or one of the related individuals and construct a database comprising IDSs of identified individuals.10-01-2009
20090259646Method for Calculating Score for Search Query - A method and system for automatically calculating, regarding an input search query, a score for evaluating a new query or URL which is a candidate for recommendation information according to a user's search intention. To this end, a recommendation server 10-15-2009
20090254537IMAGE SEARCH APPARATUS AND IMAGE SEARCH METHOD - An image search apparatus has:10-08-2009
20090259655DATA CREATING APPARATUS AND DATA CREATING METHOD - A data creating apparatus extracts meta data about a topic from a document, the meta data including at least one linguistic expression about a behavior, a plurality of the linguistic expressions having a first modification relation. The data creating apparatus converts the linguistic expressions included in the behavioral meta data into each class, based on a behavior ontology that is expressed by a graph where the linguistic expression about a behavior is an instance and a concept of the instance is a class to create behavior map data that represents each of the classes converted and also representing a second modification relation among the classes as a link.10-15-2009
20090259656DATA SEARCH DEVICE, DATA SEARCH METHOD, AND RECORDING MEDIUM - A data search device to extract relevant data matching a specified requirement from multiple pieces of data to be searched stored in a database. The data search device includes a specified requirement data acquisition unit to acquire specified requirement data including the specified requirement, a data extraction unit to extract the relevant data based on the specified requirement data, an extracted data counter to count a number of pieces of the relevant data for each piece of classification data provided for the data to be searched, a display data generation unit to generate data to display the number of pieces of the relevant data counted for each piece of classification data on a coordinate space based on the classification data, and a positional data storage unit to store positional data including coordinates for specifying a position in the coordinate space and the classification data associated with the coordinates.10-15-2009
20090259653INFORMATION PROCESSING APPARATUS, METHOD, AND PROGRAM - An information processing apparatus searches search target frame images in video data to find a frame image matching a search query frame image. An extractor extracts characteristic quantities expressing the characteristics of respective images. A reliability judge judges the reliability of the values of each characteristic quantity in a characteristic quantity group extracted from the search query frame. If certain characteristic quantity values in the search query frame are judged to be of low reliability, the converter converts those values into predetermined values. Values are similarly converted for all search target frames. A comparer compares the converted characteristic quantities in the search query frame to the converted characteristic quantities in all search target frames. On the basis of the comparisons, a decision unit then chooses a search solution frame that matches the search query frame. In so doing, search processing robustness with respect to data variance is improved.10-15-2009
20090259651SEARCH RESULTS RANKING USING EDITING DISTANCE AND DOCUMENT INFORMATION - Architecture for extracting document information from documents received as search results based on a query string, and computing an edit distance between the data string and the query string. The edit distance is employed in determining relevance of the document as part of result ranking by detecting near-matches of a whole query or part of the query. The edit distance evaluates how close the query string is to a given data stream that includes document information such as TAUC (title, anchor text, URL, clicks) information, etc. The architecture includes the index-time splitting of compound terms in the URL to allow the more effective discovery of query terms. Additionally, index-time filtering of anchor text is utilized to find the top N anchors of one or more of the document results. The TAUC information can be input to a neural network (e.g., 2-layer) to improve relevance metrics for ranking the search results.10-15-2009
20090259647FUZZY KEYWORD SEARCHING - A fuzzy, or ambiguous, keyword searching process and systems for implementing the fuzzy keyword searching process are provided. In general, one or more keyword search terms are first identified for a search. Next, a user is enabled to adjust a logical fuzziness, or logical ambiguity, for each of the one or more keyword search terms. As used herein, logical fuzziness of a keyword search term refers to the extent to which associated keywords are considered for the search. In one embodiment, the user may also be enabled to view and adjust keyword associations for each of the keyword search terms. A search is then performed based on the one or more keyword search terms and the logical fuzziness of the one or more keyword search terms, and results of the search are presented to the user.10-15-2009
20090265339METHOD AND SYSTEM FOR FACILITATING RULE-BASED DOCUMENT CONTENT MINING - A system for facilitating rule-based content mining to extract content from structured or unstructured data receives a file that contains structured or unstructured data, or a mixture of both. The system then generates a processable extensible markup language (pXML) file based on the received file. The system further extracts content from the pXML file based on one or more rules and generates a semantic XML file based on a specified format.10-22-2009
20090271396METHOD AND APPARATUS FOR MEDIA CONTENT PROVISION - Disclosed is a method of providing relevant media content to a user, comprising: storing static data relating to the user's personal profile; providing a choice of media items to the user and allowing the user to select at least one media item from the choice for inclusion in a single media entity to be provided to a media device of the user; selecting at least one relevant media item from a set of additional media items in dependence upon at least some of the static data and at least some of any metadata associated with the or each media item selected by the user; concatenating the at least one user-selected media item and the at least one selected relevant media item to form the single media entity; and sending the single media entity to the user's media device.10-29-2009
20090271395MEDIA FILE SEARCHING SYSTEM AND METHOD FOR A MOBILE PHONE - A media file searching system for a mobile phone is disclosed. The system comprises: a capturing module configured for capturing a section of rhythm sung by a user; a character calculating module configured for calculating a characteristic parameter of the section of the rhythm by using a levinson-durbin recursion arithmetic; a relevancy calculating module configured for calculating a relevancy of the calculated characteristic parameter with each characteristic parameter of each of the media files stored in the mobile phone by using a relevancy arithmetic; the relevancy calculating module further configured for searching a matched media file whose characteristic parameter have a highest relevancy with the characteristic parameters of the section of the rhythm; and a media player for playing the searched media file. A corresponding method is also disclosed.10-29-2009
20080306942On the Role of Market Economics in Ranking Search Results - The invention presents a method of and service for searching computerized networks, such as the internet, that first performs a search based on a user query to produce results that are ranked. The results comprise references to entities (addresses on the network, such as web sites). Before reporting the results to the user, the invention provides that the search entity contacts the entities listed in the search results to determine whether entities listed in the search results desire to change their rank when compared to other entities listed in the results. If some entities do desire to change their rank, the invention charges fees to entities that increase their rank and credits (pays fees) to entities that decrease their rank. A portion of the amount charged to entities that increase their rank can be paid to the entity performing the search (helping to support the high quality search engines), and a portion will go to the entities that voluntarily decrease their rank within the search results (helping to support high-content web sites).12-11-2008
20080306941SYSTEM FOR AUTOMATICALLY EXTRACTING BY-LINE INFORMATION - A by-line extraction system detects a set of potential headlines from a title meta-tag of a crawled document, selects a candidate headline from the set of potential headlines, and extracts the by-line information from the document using the location of the selected candidate headline. The system constructs the set of potential headlines based on the title meta-tag. The system selects a candidate headline by evaluating the set of potential headlines in order of the lengths of the potential headlines. The system extracts the by-line information from the document by using the location of the selected candidate headline to extract a string representing a date, a name, or a source located within a minimum distance from the location of the potential headline.12-11-2008
20080306940IMAGE DISPLAY DEVICE AND METHOD - An image display device is provided. The device includes a display screen, a storage unit, and a processing unit. The storage unit stores images and selection interface information. The selection interface information includes a plurality of catalogues each of which collects one or more images. The processing unit includes a selection interface display module, a selection control module, and an image search module. The selection interface display module controls the display screen to display a selection interface that provides the catalogues for selection according to the selection interface information. The selection control module generates a selection signal in response to a selection operation on the selection interface. The selection signal indicates which catalogue on the selection interface is selected. The image search module searches the storage unit for the images according to the selection signal. An image display method is also provided.12-11-2008
20080306939USE OF FIXED-WIDTH FIELD ARRAY WITH INVERTED INDEX - A computer based system can comprise an inverted index for a number of documents and a fixed-width field array storing data associated with the documents. The system provides results data using information obtained from the fixed-width field array.12-11-2008
20080306933DISPLAY OF SEARCH-ENGINE RESULTS AND LIST - Displaying a list of search-engine results in the same web-browser window as a viewing frame that is configured to display one of the results is described herein. A user's web search is performed on a search engine, and results are returned to a client computing device. The results are listed in a web-browser window that is configured to simultaneously display any result selected by a user.12-11-2008
20080306938ELECTRONIC PUBLICATION SYSTEM - A system and method for modifying publication data in a publication system are described. An example embodiment includes receiving proposed publication data and accessing a success measurement associated with past publications within a publication system. The success measurement may indicate a measurement of success associated with the past publications. An example system and method may generate modification data to be used to modify the proposed publication data. The modification data may be based on the success measurement and proposed publication data.12-11-2008
20080306934USING LINK STRUCTURE FOR SUGGESTING RELATED QUERIES - An approach is provided for determining related queries for a given search query based on the linking structure of electronic documents within a document set. Document titles are used to represent potential search queries and links between the electronic documents are used to determine relationships between the potential search queries. As such, the document set may be represented as a directed graph in which document titles (which represent potential search queries) are nodes and links are edges between the nodes. When a particular search query is received, a corresponding node is identified and related queries are determined by identifying other nodes having connections with that node.12-11-2008
20080306932SYSTEMS AND METHODS FOR A RATING SYSTEM - An embodiment relates generally to a method of searching. The method includes providing for a knowledgebase item and associating a review for the knowledgebase item. The method also includes associating a rating for the knowledgebase item and developing a ranking associated with the knowledgebase item based on at least one the review and the rating. The method further includes displaying the knowledgebase item based on the ranking in subsequent searches that include the knowledgebase item.12-11-2008
20080306931Event Weighting Method and System - A system and method to facilitate automatic weighting of events in a network and targeting of advertising information to users within the network based on assigned event weights are described. Multiple events associated with a user are retrieved from a data storage module. Each event is further analyzed to extract one or more event features. A weight parameter value is further calculated for each retrieved event. Each event is further assigned to a predetermined category based on the calculated weight parameter value. Finally, each event and the associated weight parameter value are stored within the data storage module in connection with the predetermined category.12-11-2008
20080306930Automatic Content Organization Based On Content Item Association - An association engine for organizing content items in a logical database is provided. First description data including dimension data for a first identified content item in the database is extracted (S12-11-2008
20090094232Refining A Search Space In Response To User Input - In one embodiment, a search space of a corpus is searched to yield results. The corpus comprises documents associated with keywords, where each document is associated with at least one keyword indicating at least one theme of the document. One or more keywords are determined to be irrelevant keywords. The search space is refined according to the irrelevant keywords.04-09-2009
20090094230RELATED INFORMATION PROVIDING APPARATUS AND PROVIDING METHOD - Related information of a content is provided quickly.04-09-2009
20090094229METHOD AND APPARATUS FOR EXPLOITING 'TRACE' FUNCTION TO SUPPORT DATABASE INTEGRATION - A data processing method manipulates the internal command and communications controls of a database server to redirect certain data, for the purpose of developing a data integration cross reference for equivalent or ostensibly-equivalent fields. At least two databases can be cross referenced wherein the databases have fields that are similar or identical. Alternatively, an extension table can be developed to align a database to a standardized reference, such as a set of XML field labels. A ‘trace’ function of at least one monitored database server accumulates a log of database events and associated data field contents for memory access steps that involve creation or alteration of a field or field value, or that trigger an operation (e.g., create new, edit, delete, trigger event, etc.). The log of events from the monitored server is communicated or made accessible in real time to a middleware interface program, a remote server, or to another process, wherein correlation of the logged data from the monitored server versus reference values is used tentatively to assign field equivalence.04-09-2009
20090094228METHODS FOR CONTROL OF DIGITAL SHREDDING OF MEDIA - According to the disclosure, a unique and novel archiving system that allows the digital shredding of archived data is disclosed. Embodiments of the archiving system include removable disk drives that store data, which may be erased such that the data is considered destroyed but that allows the removable disk drive to be reused. The archiving system can determine which data should be erased. Then, the data is digitally shredded such that the removed data cannot be retrieved or deciphered. In alternative embodiments, a protection may be placed on the data required to be kept because the data is associated with a legal suit. This “legal hold” prevents the data from being digitally shredded. As such, the archiving system can provide a system that can dispose of data on a file-by-file or granular level without physically destroying the media upon which the data is stored.04-09-2009
20090094227Adaptive e-procurement find assistant using algorithmic intelligence and organic knowledge capture - The present invention is a real-time intelligent find-assistant that suggests useful search terms to a buyer shopping an electronic catalog, as they progress in their online journey to find a particular item for potential purchase. In contrast to current approaches, the present invention uses an adaptive algorithm to extract and rank-order possible search terms from candidate vendor catalog pages, based upon a measure of relevance, or utility, derived from the proximity of possible non-generic terms in the vendor catalog to the terms already selected by the user in their current search. Also in contrast to current approaches, the invention captures and stores buyers' entire history of choices over time (even years), and uses this knowledge to make increasingly useful suggestions to a buyer as to what item descriptions to look for, as they progress through their search, thus allowing an organization to leverage expert purchasing behaviors for use by novice buyers.04-09-2009
20090094226Apparatus and methods for performing a rule matching - Apparatus and methods for performing a rule matching are disclosed. In one embodiment, an apparatus for performing a rule matching includes a content matching module and a first rule matching module. The content matching module searches the data stream for contents. The contents are organized into rules including a simple rule with a single content and a complex rule with multiple contents. The first rules matching module is coupled to the content matching module for determining whether the rules are matched by the data stream according to a searching result of the content matching module. To this end, the first rule matching module updates status registers according to the searching result and each status register can indicate whether one of the rules is matched by the data stream.04-09-2009
20090282032TOPIC DISTILLATION VIA SUBSITE RETRIEVAL - A method and system for generating a search result for a query of hierarchically organized documents based on retrieval of subtrees that are key resources for topic distillation is provided. The retrieval system may identify documents relevant to a query using conventional searching techniques. The retrieval system then calculates a subtree feature for subtrees that have an identified document as their root. After the retrieval system calculates the subtree feature for the subtrees, the retrieval system may generate a subtree relevance score for each subtree based on its subtree feature. The retrieval system may then order the identified documents based on their corresponding subtree relevances.11-12-2009
20090055377Collaborative Media Recommendation and Sharing Technique - A media recommendation and sharing technique that employs agents on media players/devices to expand the scope of media sharing scenarios. The technique assists a user in discovering media items, such as, for example, music, recordings, play lists, pictures, video games, on nearby media players or devices (devices which are capable of receiving, storing and playing media) which are interesting to the user. The collaborative media recommendation and sharing technique contemporaneously determines a user's media preferences based on media stored on a pair of media devices and recommends media for potential sharing based on these determined user preferences.02-26-2009
20090292692Information Search Method and Information Processing Apparatus - According to one embodiment, an information processing apparatus includes an information acquisition processing module, a scheduling module and a control module. The information acquisition processing module performs an information acquisition process of acquiring information corresponding to an input keyword via an Internet by transmitting the keyword to a predetermined server apparatus on the Internet via the data communication module. The scheduling module acquires a date and time which is relevant to the input keyword, and records the date and time together with the input keyword. And then, the control module causes the information acquisition processing module to timely re-perform an information acquisition process using the keyword recorded by the scheduling module together with the date and time based on the date and time recorded by the scheduling module.11-26-2009
20090292690Method and System for Automatic Event Administration and Viewing - This is a method and system for automated calendar event creation from unstructured text, with assisted administration and viewing.11-26-2009
20080215576FUSION AND VISUALIZATION FOR MULTIPLE ANOMALY DETECTION SYSTEMS - The present invention is a method for detecting anomalies against normal profiles and for fusing and visualizing the results from multiple anomaly detection systems in a quantifying and unifying user interface. The knowledge patterns discovered from historical data serve as the normal profiles, or baselines or references (hereinafter, called “normal profiles”). The method assesses a piece of information against a collection of the normal profiles and decides how anomalous it is. The normal profiles are calculated from historical data sources, and stored in a collection of mining models. Multiple anomaly detection systems generate a collection of mining models using multiple data sources. When a piece of information is newly observed, the method measures the degree of correlation between the observed information and the normal profiles. The analysis is expressed and visualized through anomaly scores and critical event notifications that are triggered by fusion rules, thus allowing a user to see multiple levels of complexity and detail in a single view.09-04-2008
20080215569Ad Placement Method with Frequency Component - A method of generating a search result list in response to a search request from a client using a computer network is described. The search request is received, and a set of network addresses associated with search terms that match the search request is identified, each network address being associated with a modifiable bid value and a position value initially set to the bid value. The set is ordered according to the respective position values of the matching network addresses, and divided into a display set sent to the client and an excluded set. The position value of the network addresses in the display set is reset to its respective bid amount, and the position values of the network addresses in the excluded set are incremented by an increment factor I determined using the bid amount of the network address having the lowest position value of the display set.09-04-2008
20080215566METHOD FOR USING ONE-DIMENSIONAL DYNAMICS IN ASSESSING THE SIMILARITY OF SETS OF DATA - A method for finding sets of data (SDDS) for presentation in one-dimension, which are similar to a target SDD, is invented. The method leverages a new category of signatures, called equivalence signatures, to characterize the SDDs and is applicable to all types of data with special interpretation for data, such as text, binaries and audio, that may be presented in one-dimension. The equivalence signature is computed as the functional for the kinetic energy of a point particle whose path is specified by the values of the digital data. These signatures have the salient feature that, at worst, they change in a bounded manner when small changes are made to the SDDs and when used to find SDDs that are similar to a target SDDs, they allow for a significant reduction in the number of SDDs to be compared with the target. This is an improvement over the state of the art wherein the computational expensive process of performing a complete search against the entire corpus must be applied.09-04-2008
20080215565SEARCHING HETEROGENEOUS INTERRELATED ENTITIES - Systems and methods for searching heterogeneous interrelated entities for a heterogeneous entities search query are disclosed herein. A user may enter the heterogeneous entities search query. The search retrieves and returns multiple types of heterogeneous entities. The retrieved heterogeneous interrelated entities are searched in a unified matrix that represents relationships between one or more heterogeneous entities. The retrieved heterogeneous interrelated entities may have one or more entity types. The set of retrieved interrelated entities may also be ranked based on the similarity between each entity and the search query. Feedback may also be incorporated into the system to improve search accuracy.09-04-2008
20080215562System and Method for Improved Name Matching Using Regularized Name Forms - A system and method for improved name matching using regularized name forms is presented. A regularization rule engine uses culture-specific regularization rules to iteratively convert candidate names and query names to a canonical form, which are regularized candidate names and regularized query names, respectively. The regularization rules are context-sensitive or context-free rules that pertain to a name's originating culture. Subsequently, a name search engine compares the regularized query name with the regularized candidate names and identifies the regularized candidate names that meet a particular regularization matching threshold. In turn, name search engine selects the candidate names that correspond to the identified regularized candidate names and provides the selected candidate names to a user.09-04-2008
20090171943MODIFYING RELEVANCE RANKING OF SEARCH RESULT ITEMS - Systems, computer-implemented methods, and computer-readable media for modifying the rank of search result items returned by a search engine are provided. A search engine determines a plurality of search result items that satisfy a user query and the order the search result items are to be presented to a user. A rank modifier determines whether any modification should be made to the rank of each search result item identified by the search engine. The rank of search result items identified as potential spam may be demoted while the rank of search result items identified to be in the language of the search query, having a high click-through rate, or as containing adjacent search terms from the search query may be promoted. The search result items are presented according to modified ranking to the querying user.07-02-2009
20090198686Method and System for Indexing Information about Entities with Respect to Hierarchies - Systems and methods for indexing, associating or compositing data records and hierarchies from various information sources are disclosed. Embodiments of the present invention may provide the ability to link data records and thus to link data records to known hierarchies of data records. More specifically, embodiments of the present invention may provide the capability to associate data records in varying information sources and to thereby associate incoming data record with existing data records or existing data hierarchies such that an incoming data record may not only be associated with an existing data record comprising information about the same entity but may additionally be associated with other members of the data hierarchy in the same manner as the existing data record. In addition to associating an incoming data record with an existing data record and incorporating the incoming data record into an existing data hierarchy, embodiments of the present invention may provide the capability of reconciling an incoming data hierarchy to which an incoming data record belongs with an existing data hierarchy belongs such that the two data hierarchies may be composited.08-06-2009
20090198685ANNOTATION SYSTEM FOR CREATING AND RETRIEVING MEDIA AND METHODS RELATING TO SAME - The invention described herein is generally directed to a method and apparatus for creating and retrieving audio data. In one implementation the invention comprises an annotation system configured to record, store, and retrieve media. The annotation system contains a set of client-processing devices configured to capture media for subsequent playback. Each client-processing device typically contains a record button to initiate the capture and is configured upon performing the capture operation to trigger an association of a unique ID with the media. The client-processing devices are further configured to upload the media and a unique ID to a server for purposes of storage. The server obtains the media and unique ID for subsequent retrieval and provides the media and the unique ID to at least one client-processing device from the set of client processing devices.08-06-2009
20090077072USER ENTERTAINMENT AND ENGAGEMENT ENHANCEMENTS TO SEARCH SYSTEM - According to one aspect of the present invention, a method of actively engaging a user of a search system can include receiving from the user a search query for a search of a corpus of information and providing the user with search results for the search. The user can also be prompted to participate in a search-related activity wherein at least one aspect of the search-related activity is dependent on a context of the search. User input for performing the search-related activity can be accepted and an activity response can be provided to the user.03-19-2009
20090077069Calculating Valence Of Expressions Within Documents For Searching A Document Index - Tools and techniques related to calculating valence of expressions within documents. These tools may provide methods that include receiving input documents for processing, and extracting expressions from the documents for valence analysis, with scope relationships occurring between terms contained in the expressions. The methods may calculate calculating valences of the expressions, based on the scope relationships between terms in the expressions.03-19-2009
20090077068CONTENT AND QUALITY ASSESSMENT METHOD AND APPARATUS FOR QUALITY SEARCHING - A computer-based process retrieves information organized in documents containing text and/or coded representations of text. The process involves obtaining and labeling a selected set of documents based on content quality, and extracting and representing features from each document in the selected set. The extracted and selected features are modified, and models are constructed using parametric learning algorithms. The constructed models are capable of assigning a label to each document. The model parameters are instantiated using a first subset of the selected set of documents. Parameters are chosen by validating the corresponding model against at least a second subset of the full document set. The constructed models also are capable of assigning labels to similar documents outside a selected subset not previously given to the process of model construction.03-19-2009
20090077067Information processing apparatus, method, and program - An information processing apparatus which may include acquisition means for acquiring meta-data of a content; morphological analyzing means for performing morphological analysis on text information included in the meta-data of the content; comparison means for comparing a morphological analysis result of the morphological analyzing means and a plurality of list patterns of predetermined performer names; and when there is a list pattern of predetermined performer names having matched at least one part or more out of the morphological analysis result on the basis of the comparison result of the comparison means, first extraction means for extracting a performer name with the list pattern of the matched predetermined performer name.03-19-2009
20090077065Method and system for information searching based on user interest awareness - A method and system are provided for information searching based on user interest awareness. Information that represents user interest is obtained. One or more key terms are obtained from the user interest information. Then, a given query is enhanced based on one or more of the key terms for generating an enhanced query for searching.03-19-2009
20090077064Methods, systems, and products for recommending social communities - Methods, devices, and products are disclosed for recommending a social community. A “social community” may be any individual(s), clubs, and/or organizations that have expressed some affinity to terms or subject matter, such as a media identifier. The media identifier identifies some media that is scheduled for recording. A community database is queried for the media identifier, and the community database associates social communities to media identifiers. The social community associated with the media identifier is retrieved. The social community is then sent to a user.03-19-2009
20090077063Dynamic member match-making system and method thereof - A dynamic member match-making system and a method thereof for solving the problem that members cannot make friends with each other accurately are provided. The dynamic member match-making system collects the members' dynamic data in a time period through a dynamic information module, and then matches the members through a preset weight and selects an appropriate member to make friends, so that the members may make friends with each other more accurately and the friend-making quality is enhanced.03-19-2009
20090077062System and Method of a Knowledge Management and Networking Environment - Systems and methods of a knowledge management networking are disclosed here. In one aspect, embodiments of the present disclosure include a method, which may be implemented on a system, of hosting a web-space having a plurality of objects, the plurality of objects to include one or more of, representations of a set of users, a set of web-items, and a set of nets; wherein a net of the set of nets is a subset of the web-space comprising a sub-plurality of the plurality of objects. One embodiment can include, tracking an explicit relationship between a first set of at least two objects of the set of objects; the explicit relationship to be pre-determined by a user of the set of users, identifying an implicit relationship between a second set of at least two objects of the set of objects; the implicit relationship to be identified based on a semantic relationship between the at least two objects, and determining a default set of privacy rules governing access between the at least two objects based on one or more of the identified explicit relationship and the implicit relationship.03-19-2009
20090077059METHOD AND APPARATUS FOR LINKAGE OF QUANTITATIVE AND QUALITATIVE TEXTUAL, AUDIO, VISUAL AND OTHER INFORMATION SEARCHES TO METRIC DISPLAYS - The improved metric display displays links to qualitative information, and indicates if the qualitative information may affect the current or future value of a quantitative metric. The improved metric display displays a quantitative metric, one or more links to ranked qualitative information related to the metric by key words or phrases, and an indicator of the qualitative information's potential effect on the current or future value of the quantitative metric.03-19-2009
20090077058FAST LOCAL RECOMMENDER QUERIES VIA MODIFIED SPATIAL DATA STRUCTURE QUERYING - One embodiment of the present invention provides a system that can recommend leisure activities to a user. During operation, the system receives one or more activity types. Next it receives a bound in terms of a nearness metric such as travel distance, travel time, or travel cost. Next, it receives location information associated with a computing device of the user. The system then uses the location information to identify a cell stored in a spatial database. The system then returns a set of leisure activities that match the activity types and that are within the bound relative to the cell. The spatial database includes leisure activity data that is segmented based on physical position such as latitude and longitude. Moreover, the cells of the spatial database are linked based on the nearness metric.03-19-2009
20090077055Personalized Plant Asset Data Representation and Search System - A process control system uses an asset data and search expert to collect data, or status information, pertaining to assets of a process plant from various sources or functional areas of the plant including. The collected information may then be accessed by a user through a user interface routine displaying a graphical user interface to that user's computer. The user may browse through status information on various assets, identifying them by device, unit, process, area, alert status, health, performance, or other data types. The asset data and search expert tracks user interaction with such plant data by, for example, tracking the types of search fields a user most frequently searches with or the type of information a user more frequently browses for. The expert automatically profiles this tracked information to develop user preferences that are later used in personalizing the reporting of asset data, personalizing searching for asset data, and personalizing the results of such searches. The expert may also automatically identify asset data that correlates with other asset data to present correlated asset data when the primary asset data is selected for viewing.03-19-2009
20090077053Method For Searching For, Recognizing And Locating A Term In Ink, And A Corresponding Device, Program And Language - Method for searching for at least one term, consisting of at least one character, in at least one set (03-19-2009
20090171932SYSTEM AND METHOD FOR ANNOTATION AND RANKING OF REVIEWS PERSONALIZED TO PRIOR USER EXPERIENCE - The present invention is directed towards methods and computer readable media for annotating and ranking user reviews on social review systems with inferred analytics. A reference framework is provided by creating context according to previous activity, bias, or background information of a given reviewer. The method of the present invention comprises receiving a first query identifying a given content item, generating a collection of content items based on one or more identical objective attributes associated with the given content item, identifying one or more subjective attributes associated with a given item in the collection of items, and providing a reference framework to interpret the subjective attributes associated with each item in the collection.07-02-2009
20090043758INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING PROGRAM - In an information processing apparatus adapted to determine information, to be presented to a user, as to one or more contents, a calculation unit calculates the similarity between a first search axis, produced on the basis of information associated with the user, for use as a reference on the basis of which to present contents and a second search axis, produced on the basis of information associated with one of other users, for use as a reference on the basis of which to present contents, and a display control unit controls displaying of information associated with contents with reference to first and second search axes detected as being similar to each other.02-12-2009
20090043754SYSTEMS AND METHODS FOR PROVIDING ENHANCED CONTENT PORTABILITY IN A WORD PAGE MODULE - A computer implemented method for porting a visual object from a word page to another website is disclosed. The other website is accessible to the word page via Internet. The method includes enabling user selection of the visual object for transferring to the other website, and enabling user identification of the other website. The method also includes determining a communication interface of the other website. The communication interface defines one or more of acceptable content format, data types, size, and metadata. The method further includes transferring the visual object to the other website via the communication interface upon receiving instructions to port the selected visual object.02-12-2009
20090043750Query Optimization in a Parallel Computer System with Multiple Networks - An apparatus and method for a database query optimizer to optimize a query that uses multiple networks. The database query optimizer optimizes a query that uses multiple networks to satisfy the query by splitting the query execution to use multiple networks. Thus, the query optimizer rewrites or optimizes a query to execute on multiple nodes or networks to more efficiently execute the query and reduce network traffic on a network. The query optimizer uses plan cache statistics to determine whether to use multiple networks to optimize the query.02-12-2009
20090292697METHOD AND SYSTEM FOR LEXICAL MAPPING BETWEEN DOCUMENT SETS HAVING A COMMON TOPIC - Terms (e.g., words) used in an expert domain that correspond to terms in a naïve domain are detected when there are no vocabulary pairs or document pairs available for the expert and naive domains. Documents known to be descriptions of identical topics and written in the expert and naive domains are collected by searching the Internet. The frequencies of terms that occur in these documents are counted. The counts are used to calculate correspondences between the vocabularies of the expert and naive language expressions.11-26-2009
20100057719System And Method For Generating Training Data For Function Approximation Of An Unknown Process Such As A Search Engine Ranking Algorithm - A system and method for generating training data for a machine learning system. A training data generator server sends at least one keyword to a search engine. The training data generator server receives at least a first and a second page from the search engine in response to the keyword, the first page having a first rank, the second page having a second rank, the first and second rank being based on the keyword. The training data generator server assigns a first label to the first page based on the first rank; and assigns a second label to the second page based on the second rank. The first web page, second page, first label and second label are forwarded to a machine learning server.03-04-2010
20100042611Location-based search mash-up engine, web site, and application programming interface - This invention is a location-based search mash-up engine, web site, and application programming interface (API) for use by web browsers or programs running on wireless mobile devices with web or internet access and location finding capability such as GPS and cell tower or Wi-Fi hotspot triangulation. The invention also provides the capability of presenting sponsored listings or coupons with the search results. The logic to choose these sponsored business listings or coupons will be based on keywords associated with a specific location, defined by latitude & longitude and surrounding radius defined by meters or fractions of a mile (¼ mile, ½ mile, etc.) or kilometer.02-18-2010
20090259657NETWORK PEER-TO-PEER GOODS AND SERVICES DELIVERY SYSTEM AND METHOD FOR RANKING PEERS BY DEGREES OF ASSOCIATION - Methods and systems for providing a list of peers that satisfy a query are disclosed. A plurality of peer profiles are obtained from a data repository, each having a degree of association with the query originating peer. A first and second group of peers, respectively having a first and second degree of association with the query originating peer are determined. A third group of peers that satisfy the query are determined. A fourth group of peers is determined from the third group based on an assigned score to each of the peers in the first and second group, the assigned score reflecting a degree of association between the query originating peer and each of the peers in the first and second group. Results are ranked based on the assigned score.10-15-2009
20080294630QUERY STATISTICS PROVIDER - A system to provide search query information. The system receives a request for search query information, identifies a set of search queries from a search query log that includes search queries submitted to a search service over a predetermined length of time, and provides the set of search queries. Each of the set of search queries is associated with at least a predetermined number of unique identifiers. Each of the set of search queries is matched to the request for search query information by a combination of exact matches, expanded matches, and broad matches.11-27-2008
20090282034METHODS TO CREATE A USER PROFILE AND TO SPECIFY A SUGGESTION FOR A NEXT SELECTION OF A USER - A user profile and/or the suggestions computed based thereon are obtained taking a special set of user features into account. The user features are defined to represent a typical general behaviour of an individual user in respect to the application where the user profile is used. In other words, for each application where a user profile is used a special set of user features are defined which are able to represent a typical general behaviour of an individual user. Based on these user features the weights in the list of word-weight pairs or weighted keywords which represents the user profile are computed or influenced during the creation of the user profile, and/or a mufti-user profile is split during the creation of an individual user profile from a mufti-user profile, and/or during specification of a suggestion a user history which is used to create the user profile, and/or the user profile, and/or the suggestion results are filtered.11-12-2009
20090282027Distributional Similarity Based Method and System for Determining Topical Relatedness of Domain Names - Systems, computer software and methods for calculating relatedness scores of domain names, which are indicative of relatedness of pairs of domain names requested by clients are described. The method includes receiving DNS traffic data, where the DNS traffic data includes at least domain names requested by the clients and identities of the clients requesting the domain names; generating, based on the identities of the clients, vectors including the requested domain names, where entries in the vectors correspond to client sessions in which the client has requested the domain names; reducing a dimensionality of the vectors by applying a dimensionality reduction method for generating reduced vectors; applying a similarity metric to the reduced vectors to calculate the relatedness scores; and storing the relatedness scores of the domain names.11-12-2009
20090327284INFORMATION SEARCH APPARATUS, AND INFORMATION SEARCH METHOD, AND COMPUTER PRODUCT - A computer-readable recording medium stores therein an information search program that causes a computer to search for text items described in a text file. The information search program causes the computer to execute receiving input of a search keyword; searching an index file for a writing keyword that includes the search keyword, the index file including writing keywords described, for respective entries, in an order identical to the order in which the text items are described in the text file; identifying an entry that corresponds to the writing keyword retrieved at the searching; and outputting the identified entry.12-31-2009
20090254535SEARCH ENGINE TO IMPROVE PRODUCT RECALL TRACEABILITY ACTIVITIES - The present invention provides an improved method of handling product recall activities through traceability. One embodiment of the present invention involves gathering data in a multi-layered database architecture containing supply, process, test, and customer layers. The data is supplied to a traceability module which contains a search engine to link and access the data. The search engine enables a search by part number, lot number, serial number, time stamp, and date/time frame. A traceability analysis is generated from the search, allowing failure analysis of the data. This analysis is performed through an event list over the entire supply, manufacturing, and customer data, facilitating backward and forward traceability of parts and components of the parts. This failure analysis further facilitates automatic response such as automatic warning and automatic recall to manufacturers and customers.10-08-2009
20090287675Extending OLAP Navigation Employing Analytic Workflows - Analytic workflows for performing data analysis and other related operations are stored in an analytic workflow library and provided to a user upon selection of data from a data store. A workflow manager may rank the workflows based on a number of ranking algorithms prior to presentation. User selected workflows are executed in conjunction with relevant external applications and the analysis result provided to the user through the user's client application used to select the data. Workflows and associated interfaces may be received from a variety of sources and integrated into the workflow framework for enhancing data analysis.11-19-2009
20090327285SEMANTIC RECONSTRUCTION - Determining a semantic relationship is disclosed. Source content is received. Cluster analysis is performed at least in part by using at least a portion of the source content. At least a portion of a result of the cluster analysis is used to determine the semantic relationship between two or more content elements comprising the source content.12-31-2009
20090327286METHODS AND SYSTEMS FOR IMPROVING A SEARCH RANKING USING LOCATION AWARENESS - Systems and methods improve search rankings for a search query by using location data associated with queries and documents related to the search query. In one aspect, a search query is received, a location score is determined, a topical score is determined, and an ordering of documents related to the search query is determined based, at least in part, on the location score and the topical score.12-31-2009
20090327272Method and System for Searching Multiple Data Types - A method may include receiving data. A format of the data may be detected. Search parameters may be extracted from the data based on the format of the data. The search parameters may be extracted in a manner dependant upon the detected format of the data. The search parameters may be compared to index parameters. Search results based on the comparison of the search parameters to the index parameters may be output.12-31-2009
20090327259AUTOMATIC CONCEPT CLUSTERING - A method of identifying thematic groups of nodes by analysis of a corpus of documents. The method uses a distance metric based on connectedness of nodes, which is derived from a co-occurrence measure. The invention is also embodied as a computer-implemented visualization tool that generates a display of nodes and thematic groupings. The invention is useful for ‘data mining’ a large corpus of documents, particularly textual documents, to extract relevant information.12-31-2009
20090259648AUTOMATED AVATAR CREATION AND INTERACTION IN A VIRTUAL WORLD - Automated avatar creation and interaction in a virtual world may include detecting if a user's avatar has entered a predefined proximity area in the virtual world. An automated avatar may be presented an automated avatar in response to the user's avatar entering the predetermined proximity area. The automated avatar is presented for autonomous interaction with the user's avatar and presentation of the automated avatar is based on a specified criteria.10-15-2009
20090204595Method and apparatus for tracking a change in a collection of web documents - A method and an apparatus for tracking changes in a collection of web documents, for example, provided by a web site. The web documents are retrieved at a first assigned point in time and a second assigned point in time. Then a similarity measure for a combination of a retrieved web document at a first assigned point in time and a retrieved web document at a second assigned point in time is calculated for determining pairs of corresponding web documents. By comparing said calculated similarity measure of a pair of corresponding web documents with predetermined thresholds for the similarity measure a change in the content of the corresponding web document between the first assigned point in time and second assigned point in time is detected. Instead of referring to identifiers like URLs for web pages the content similarities of web pages are considered. The proposed strategy facilitates the work of marketing analysts.08-13-2009
20090248663ONLINE TARGET LOCATION DETECTION - Documents are provided that are geographically relevant to a user or a request. Information describing the online activity of a user is received and location identifiers are obtained. The plurality of location identifiers provide geographic location information defining the geographic intent of the information describing the online activity of the user, the location of the computing device utilized by the user, or the user's registered geographic location. Sets of predefined rules are then applied to the received information and the plurality of location identifiers to select at least one geographic location. Documents are then returned to the user that are geographically relevant based on the selected geographic location. The received information may include search queries, and the documents may include search results and/or advertisements.10-01-2009
20090248675METHOD AND SYSTEM FOR SUPPORTING DOCUMENT EVALUATION - A document evaluation support system narrows the search for related terms in a document, evaluates a search result, provides information of the evaluation, and further searches the search result for a related paragraph in the document so as to support evaluation and determination of the document. The system includes a document division section, a specified term search section, and a search result evaluation section. The sections further include a document attribute database, a document division determination rule database, a document division determination unit, a document division input unit, a divided document (paragraph) with heading database, a keyword database, a numeric database, a search method input unit, a search condition database, a specified result search unit, a search result database, a search result display unit, a weight database, a search result evaluation unit, an evaluation result database, and an evaluation result display unit.10-01-2009
20090248657 WEB SEARCHING - Mislabeled URLs are identified and corrected based upon a click relevance ranking computed from user data comprising user click information. The click relevance ranking is formed by applying a set of relevance ordering rules to user log data aggregated by query and URL and by mapping the results of the relevance ordering rules into a linear ordering. For a given query, the aggregated user log data comprises a relative total number of impression, a relative total number of clicks received and a rank associated with the query/URL pair at the time of the total number of impressions and total number of clicks received. The click relevance ranking is used to identify and correct mislabeled query/URL pairs of other rankings according to a number of disclosed methods.10-01-2009
20090240685APPARATUS AND METHOD FOR DISPLAYING SEARCH RESULTS USING TABS - A graphical user interface includes tabs representative of different classes of search results. The tabs are derived in response to the processing of a query. The different classes of search results group content by meaning, such that a query term with different meanings produces different classes of search results with different meanings.09-24-2009
20090177642METHOD AND SYSTEM FOR AUTOMATED DETECTION OF APPLICATION PERFORMANCE BOTTLENECKS - A system for detecting performance bottlenecks in a target application. In response to receiving hotspot selections from a user interface, bottleneck rules are extracted from a database. A hotspot is a region of source code that exceeds a time threshold to execute in the target application. Metrics needed to evaluate the bottleneck rules extracted from the database are identified. The identified metrics are computed. It is determined whether each bottleneck rule extracted from the database is evaluated to true using the computed metrics for hotspots in the target application. In response to determining that a bottleneck rule is evaluated to true using an appropriate computed metric corresponding to the bottleneck rule, a bottleneck description is created for the bottleneck rule. Then, the bottleneck description is sent to the user interface.07-09-2009
20090319520Method and System for Generating Analogous Fictional Data From Non-Fictional Data - A method and system for generating analogous fictional data from non-fictional data, is provided. One implementation involves recording non-fictional data, scoring the non-fictional data in terms of occurrence percentile, obtaining a set of user-configurations that represents a likeness range between non-fictional data and corresponding fictional data, based on the scores and the user-configurations, generating analogous fictional data from the non-fictional data, and comparing hash values for the fictional data with hash values for the non-fictional data to determine matches, and in case of matches, generating analogous fictional data from the non-fictional data based on the scores and incrementally lowered likeness range, whereby entire records of fictional data are generated based on entire records of non-fictional data, wherein the fictional data is consistent with the non-fictional data.12-24-2009
20090319507METHODS AND APPARATUSES FOR ADAPTING A RANKING FUNCTION OF A SEARCH ENGINE FOR USE WITH A SPECIFIC DOMAIN - Methods and apparatuses are provided for adapting hierarchical structure information associated with a first ranking function tuned for use in a first domain for use in a second domain.12-24-2009
20090319504Method and Apparatus for Providing Enhanced Search Results to a User of a Communication Device - A method and apparatus of a wireless communication system for providing enhanced search results to a user. The method includes monitoring data communication on a user device, determining at least one contextual datum from at least a portion of the monitored data based on a predetermined rule; and generating a search result by applying the determined contextual datum to a search application.12-24-2009
20090150379METHOD FOR PROVIDING MULTIMEDIA TO PROVIDE CONTENT RELATED TO KEYWORDS, AND MULTIMEDIA APPARATUS APPLYING THE SAME - A method for providing multimedia and a multimedia apparatus applying the same. The method for providing multimedia includes searching for content related to keywords and generating channels to provide the content found as a result of searching. Therefore, it is possible for a user to more conveniently use Internet multimedia content using a TV.06-11-2009
20080313179INFORMATION STORAGE AND RETRIEVAL - An information retrieval apparatus is described for searching a set of information items and displaying the results of the search, the information items each having a set of characterizing information features. The apparatus comprises a search processor to search information items in accordance with user-defined characterizing information features and identify information items with corresponding characterizing information features. A mapping processor generates a map of information items, similar information items mapping to similar positions in the array, from a set of information items identified in the search. The apparatus includes a graphical user interface with a user control for selecting information items, and the search processor refines the search to identify information items relating to the selected information item. As such the user is provided with a facility for refining a search, and searching and navigating large amounts of data are thereby made easier.12-18-2008
20080313171Cluster-Based Ranking with a Behavioral Web Graph - A computer implemented method for returning relevant nodes in a network search has steps for (a) creating a behavioral network graph having points relating pairs of network nodes with values at the points indicating probability that a user connected at one node of the pair will transition next to the other node of the pair; (b) determining node clusters based on relatively high probability of transition between nodes in the cluster; (c) entering search criteria for finding nodes, and noting and returning nodes that satisfy the search criteria; and (d) returning in addition nodes in one or more clusters associated with one or more nodes that satisfy the search criteria.12-18-2008
20080313164System and Method for Selecting Search Listing in an Internet Search Engine and Ordering the Search Listings - Provided is a keyword advertising system extracting search listings in response to a search request, the system comprising: an interface receiving bid price information corresponding to a keyword from an advertiser; a search information database storing search listings associated with the advertisers in association with each of the received bid price information; a ranking module generating a search result list by referring to the search information database, in response to a search request from a searcher; and a search results providing module providing the searcher with the generated search result list; wherein the ranking module generates the search result list by performing the steps of: identifying a keyword received from the searcher in association with the search request; selecting N of search listings from the at least one search result listing corresponding to the identified keyword, based on the bid price information; and ordering the selected search listings in order of click through rate.12-18-2008
20090182724Database Query Optimization Using Index Carryover to Subset an Index - A method, apparatus and program product use a first index associated with a field in a database table to identify a range of records in the database table that includes instances of a first key value in the field and use the identified range of records to subset a second index associated with another field in a database table. The database query identifies the first key value for the field in the database table and the second key value for the other field in the database table. By doing so, information from an index may be carried over and applied to another index to subset the other index, often reducing the quantity of entries that are searched in the other index and improving performance.07-16-2009
20090177654SYSTEM AND METHOD FOR LEVERAGING MEDIA VIA USER RATING DATA - A method and system for leveraging user media file rating data. In one aspect, the system comprises a component in communication with a plurality of media file related services, a rating storage, and a component for making the user rating data available to be used by plural services so that each respective service can use the rating data to tailor a user experience during interaction by the particular user with the respective service. In one aspect, the user preference rating is received from any of the media file related services. In one aspect the user preference rating information and the associated media file information are associated with a particular user regardless of the respective media file related service the preference rating was received from.07-09-2009
20090319519COMMUNICATION SYSTEM, COMMUNICATION DEVICE, AND COMPUTER PROGRAM - There is provided a communication system including a plurality of communication devices including first, second and third communication devices and performable of one-on-one communication with one another. The first communication device includes: a search request reception part receiving a search request for data from the second communication device; a determination part determining whether or not data relevant to the search request is retained; a search request transmission part transmitting, when the relevant data is not retained, the search request to the third communication device; a data reception part receiving the data relevant to the search request from the third communication device; a data transmission part transmitting, to the second communication device that has transmitted the search request, the data received by the data reception part. The first communication device relays the search request and the data from/to the second communication device and the third communication device.12-24-2009
20090319518METHOD AND SYSTEM FOR INFORMATION DISCOVERY AND TEXT ANALYSIS - A method for searching text sources including temporally-ordered data objects, such as a blog, is provided including the steps of: (i) providing access to text sources, each text source including temporally-ordered data objects; (ii) obtaining or generating a search query based on terms and time intervals; (iii) obtaining or generating time data associated with the data objects; (iv) identifying data objects based on the search query; and (v) generating popularity curves based on the frequency of data objects corresponding to one or more of the search terms in the one or more time intervals. A system and computer program for text source searching is also provided.12-24-2009
20090319515SYSTEM AND METHOD FOR MANAGING ENTITY KNOWLEDGEBASES - Systems and methods are presented for building comprehensive entity knowledgebases that can consolidate multiple linked references to the same entity. The resulting virtual repository can be efficiently queried. An incoming record is clustered into entities, which are collections of attributes. The system can determine the entity that most closely matches an incoming record. Coarse-grain representations (blocking) may be used initially to select a set of the most closely-matching entities, and then fine-grain representations (linkage) may be used. Coarse-grain and fine-grain match probabilities may be integrated to obtain integrated match probabilities between the record and each of the closest-matching entities. Entities are updated, including creating a new entity, merging two or more entities into one, dividing one entity, and making no change in the entities, after which the record is entered into the appropriate entity or entities. Embodiments support both free-form querying and document matching.12-24-2009
20090319513SIMILARITY CALCULATION DEVICE AND INFORMATION SEARCH DEVICE - [Problems] To accurately calculate similarity between media data and a query even if the media data or its meta data has an error.12-24-2009
20090319511DESIRABILITY VALUE USING SALE FORMAT RELATED FACTORS - Some example embodiments illustrate a system and method to sort a search result using sale format information. The system and method include providing a desirability index including multiple desirability values. Each desirability value may be associated with a keyword and indicate an accumulative frequency of the keyword being in an item listing selected throughout multiple user transactions. The system and method include identifying a search result including item listings in response to a query from a user device. The system and method include accessing, for each item listing, the desirability index and getting a desirability value for each keyword included in the item listing. The system and method include calculating a relevancy value using the desirability values for the keywords of a given item listing. The system and method further include sorting the item listings according their relevancy values and returning the sorted item listings to the user device.12-24-2009
20090319508CONSISTENT PHRASE RELEVANCE MEASURES - Two methods for measuring keyword-document relevance are described. The methods receive a keyword and a document as input and output a probability value for the keyword. The first method is a similarity-based approach which uses techniques for measuring similarity between two short-text segments to measure relevance between the keyword and the document. The second method is a regression-based approach based on an assumption that if an out-of-document phrase (the keyword) is semantically similar to an in-document phrase, then relevance scores of the in and out-of document phrases should be close to each other.12-24-2009
20090319505TECHNIQUES FOR EXTRACTING AUTHORSHIP DATES OF DOCUMENTS - Various technologies and techniques are disclosed for calculating authorship dates for a document. A portion of a document to select to look for possible authorship dates is determined. The possible authorship dates are extracted from the portion of the document. A revised authorship date of the document is generated using a neural network. The revised authorship date is returned to an application or process that requested the date.12-24-2009
20090319503MATCHING QUERIES IN A NETWORK - A method of matching queries in a hybrid infrastructure/infrastructure-less network, the network comprising pluralities of first and second type communication devices respectively, the method comprising placing a first query by a user on one of the first type devices and forwarding the query via infrastructure based communication to one of the second type devices; forwarding, depending on a category of the first query, the first query from the one second type device to one or more first type devices via infrastructure based communication; and relaying the first query from each of one or more first type devices to one or more neighbouring first type devices via infrastructure-less communication.12-24-2009
20090164443DATABASE PERFORMANCE MINING - A system, method and program product for analyzing performance of a system comprised of a database and its related operating environment. A system is provided that includes: a set of monitoring tools for monitoring event data from a database application and from an operating environment running the database application; a performance data warehouse for storing the event data; a modeling system for generating a performance mining model of the database system based on the event data stored in the performance data warehouse; and a system for comparing a stream of current event data against the performance mining model to identify performance issues in the database system.06-25-2009
20090112843SYSTEM AND METHOD FOR PROVIDING DIFFERENTIATED SERVICE LEVELS FOR SEARCH INDEX - Programs, systems and methods for providing differentiated service levels for a search index are disclosed. Data object documents are processed by extracting terms and scoring each of the terms associated with each document according to criteria to indicate relative importance of the associated document. A plurality of posting lists are generated for each term each comprising entries identifying documents that include the term. The entries are allocated to the different posting lists for the given term depending upon the score for the term associated with particular document. The different posting lists, e.g. a high score and low score posting list, may then be stored as data objects managed according to their indicated importance. For example, the high score posting list data object may be stored in higher performance storage than the low score posting list data object. Scores may be regularly updated.04-30-2009
20090112854SYSTEM AND METHOD FOR RELATED INFORMATION SEARCH AND PRESENTATION FROM USER INTERFACE CONTENT - A method and computer program product for extracting primary information from the content in response to a user selecting indicia rendered on a display of the handheld device. The primary information includes entities mentioned within the content. Related information is obtained from one or more content sources based on the primary information. The content is annotated to link at least a portion of the content to at least a portion of the related information, thus defining annotated content.04-30-2009
20090112851DATABASE MANAGEMENT SYSTEM, DATABASE MANAGEMENT METHOD AND DATABASE MANAGEMENT PROGRAM - A meta-information storing section records a plurality of pieces of taxonomy data in ranks. A plurality of pieces of leaf meta-data respectively correspond to pieces of the lowest taxonomy data. A database records a plurality of pieces of real data which respectively correspond to pieces of leaf meta-data. A server control section searches for upper taxonomy data corresponding to the keyword included in a search request and acquires lower taxonomy data associated with the upper taxonomy data. The server control section is capable of repeatedly acquiring further lower taxonomy data until the lowest taxonomy data is specified, and as a result, acquiring leaf meta-data. The server control section samples real data corresponding to all of the leaf meta-data from the database and outputs it.04-30-2009
20090112856STORAGE MEDIUM INCLUDING METADATA AND REPRODUCTION APPARATUS AND METHOD THEREFOR - A storage medium including metadata, which provide an extended search function using a variety of search keywords on audio-visual data, and a reproduction apparatus and a reproduction method of reproducing the storage medium. The storage medium includes: audio-visual data; and metadata to provide an extended search function on the audio-visual data, wherein the metadata include a predefined search keyword and a search keyword which may be additionally defined by an author. Accordingly, by using a variety of search keywords additionally defined by an author as well as predefined search keywords, providing an extended search function is possible. In addition, by recording only portions of the metadata relative directly to supporting multiple languages in an additional text-based file, providing an extended search function using a plurality of languages is also possible.04-30-2009
20090112850Bioitem Searcher, Bioitem Search Terminal, Bioitem Search Method, and Program - According to one aspect of the present invention, a bio-item searching apparatus searches for a target bio-item with a keyword input by a user. In the bio-item searching apparatus, the storage device stores a bio-item literature set having a literature in which the bio-item name is described for each of bio-items. The control device searches each of the bio-item literature sets with the keyword to acquire the number (Nh) of literatures including the keyword for each of the bio-items, selects the bio-item in which the number-of-literatures Nh is 1 or larger as a candidate bio-item, creates, for each of the candidate bio-items, a number-of-literatures table constituted by any one or both of a) the number-of-literatures Nh, and b) the number of literatures each not including the keyword and including the bio-item name (the number of literatures in the bio-item literature set of the bio-item—Nh), calculates a correlation score between the bio-item and the keyword based on statistical calculation by using the number-of-literatures table for each of the candidate bio-items, and outputs the candidate bio-items to the output device based on the correlation score.04-30-2009
20090112853Ranking query processing method for stream data and stream data processing system having ranking query processing mechanism - A mechanism for managing ranking information using a sign of a stream tuple generated when stream data is inserted into, or deleted from, a window is provided. A mechanism for generating only the differential information of ranking calculation results, a mechanism for adding ranking information according to a request, an interface for generating and outputting all ranking information from the differential information, a mechanism for generating all ranking calculation results, and an interface for using these mechanisms are provided.04-30-2009
20090112847APPARATUS AND METHOD FOR ENHANCING A COMPOSITION WITH RELEVANT CONTENT POINTERS - A computer readable medium includes executable instructions to identify a pre-existing list of keyphrases, to compare a keyphrase in a composition to the pre-existing list of keyphrases to obtain at least one candidate content pointer, and to associate the keyphrase with a content pointer selected from the at least one candidate content pointer.04-30-2009
20090112849Selecting a second content based on a user's reaction to a first content of at least two instances of displayed content - Embodiments provide a device, apparatus, system, computer program product, and method. A provided electronic apparatus includes a display surface, a response sensor apparatus, a target-content selector circuit, a characterization circuit, a query circuit, and a chooser circuit.04-30-2009
20090150390DATA RETRIEVING APPARATUS, DATA RETRIEVING METHOD AND RECORDING MEDIUM - In a server apparatus including a document database for storing a plurality of documents, a retrieval log database for storing a retrieval history made when retrieving documents corresponding to an inputted retrieval condition from the document database, and an access log database for storing an access history made when browsing and printing documents, degrees of utilization of documents are calculated based on the respective retrieval history and access history, and documents are extracted from the document database based on the calculated degrees of utilization. When a request for an extraction result is received, the extraction result is presented to a PC that the user is using.06-11-2009
20090138457GROUPING AND WEIGHTING MEDIA CATEGORIES WITH TIME PERIODS - A method and system for scoring media items are provided. In general, a number of media categories are defined. Each of the media categories is defined by at least one criterion such as at least one genre, at least one artist, or the like, or any combination thereof. For each of the media categories, weights are assigned to a number of time periods. Thus, a weight assigned to a particular time period, such as a decade, may vary between media categories. In one embodiment, the criteria defining the media categories and the weights assigned to the time periods within each of the media categories are user-defined. Media items are then matched to the media categories and scored as a function of the weights assigned to the time periods for the matching media categories.05-28-2009
20090112842METHODS AND APPARATUS FOR WEB-BASED RESEARCH - A method and apparatus for facilitating web-based research among a community of users. Templates created by at least one user within the community may be modified by another user to facilitate further research on a item, such as a product and/or service, associated with the template. The modified template may be populated with data collected from at least one website and organized in such a manner so as to facilitate making a decision about the item.04-30-2009
20080288480EFFICIENT ONLINE COMPUTATION OF DIVERSE QUERY RESULTS - The system includes a query engine and an advertisement engine. The query engine is configured to receive a query from the user. The advertisement engine generates advertisement results corresponding to the query. The advertisement results are selected from entries in an advertisement database, where the entries include predicate values corresponding to a domain. The advertisement engine generates a diverse advertisement result that is a subset of the database entries that match the query. The diversity result varies at least one predicate by selecting entries for the list that include a proportional representation of each available predicate value in the database that matches the query.11-20-2008
20090112845SYSTEM AND METHOD FOR LANGUAGE SENSITIVE CONTEXTUAL SEARCHING - A method, system and computer-readable media for searching a database and returning relevant results are disclosed. The method includes the steps of receiving a user query in one language, searching a database based on the user query to obtain one or more results, processing the results according to a local linguistic context association with the user query, and presenting to the user the results with an identifier for each result in which a local linguistic context around a location of the user query is in a second language.04-30-2009
20090112852USER MANUAL SUPPORTING METHOD AND APPARATUS USING ERROR PATTERN ANALYSIS - A user manual supporting method for use in an electronic appliance includes converting a series of operations performed by the user to operate the electronic appliance and converting the operations into a pattern of user operation sequence, and checking if an error is present in the pattern of user operation sequence to retrieve a pattern of erroneous operation sequence corresponding to the pattern of user operation sequence having the error. Thereafter, a manual content associated with the pattern of erroneous operation pattern is extracted and the extracted manual content is provided to the user. The manual content the manual content includes text and/or graphics information for notifying the user of a missing operation in the pattern of user operation sequence, or for guiding a normal operation against the pattern of erroneous operation sequence.04-30-2009
20090112848Method and system for suggesting search queries on electronic devices - A method and system implementing a process for suggesting search queries on an electronic device is provided. The process involves displaying terms related to content accessed by a user for selection by the user, obtaining one or more key terms related to a user-selected term, and displaying the one or more key terms to the user as query suggestions corresponding to the selected term. Obtaining one or more key terms involves obtaining one or more key terms related to the selected term, based on local content information and/or external content information.04-30-2009
20090112846SYSTEM AND/OR METHOD FOR PROCESSING EVENTS - The subject matter disclosed herein relates to processing information regarding events. In one particular example, a stabbing query may be formulated in response to an event. One or more sets are associated with and/or mapped to nodes of a tree.04-30-2009
20090112857Methods and Systems for Improving a Search Ranking Using Related Queries - Systems and methods that improve search rankings for a search query by using data associated with queries related to the search query are described. In one aspect, a search query is received, a related query related to the search query is determined, an article (such as a web page) associated with the search query is determined, and a ranking score for the article based at least in part on data associated with the related query is determined. Several algorithms and types of data associated with related queries useful in carrying out such systems and methods are described.04-30-2009
20090112841DOCUMENT SEARCHING USING CONTEXTUAL INFORMATION LEVERAGE AND INSIGHTS - A method and system are disclosed that enable a user to search a large collection of structured and unstructured documents using semantic concepts that the system provides to them, to search the most relevant business activity first, and then using one of the business activities as the additional context to search for specific document or documents that are most relevant. One aspect of the invention provides a methodology to perform concept-based structured search over document collections to obtain search results as business activities and associated relevant documents using the business activity context. The document collections are obtained by aggregating documents corresponding to a business activity. The instances are extracted from the document collections together with any concept-relationship specific heuristics in that domain. Another aspect of the invention enables the enterprise user to enter concepts and instances that define the search parameters using a structured user interface.04-30-2009
20090112840Method For Selecting Electronic Advertisements Using Machine Translation Techniques - A system for selecting electronic advertisements from an advertisement pool to match the surrounding content is disclosed. To select advertisements, the system takes an approach to content match that takes advantage of machine translation technologies. The system of the present invention implements this goal by means of simple and efficient machine translation features that are extracted from the surrounding context to match with the pool of potential advertisements. Machine translation features used as features for training a machine learning model. In one embodiment, a ranking SVM (Support Vector Machines) trained to identify advertisements relevant to a particular context. The trained machine learning model can then be used to rank advertisements for a particular context by supplying the machine learning model with the machine translation features measures for the advertisements and the surrounding context.04-30-2009
20080270385Method and Tool For Searching In Several Data Sources For a Selected Community of Users - The invention concerns a search method including a mode for defining at least one reading table (V) of contents of documents specific to a selected community of users (10-30-2008
20090106225IDENTIFICATION OF MEDICAL PRACTITIONERS WHO EMPHASIZE SPECIFIC MEDICAL CONDITIONS OR MEDICAL PROCEDURES IN THEIR PRACTICE - A scheme enables the identification of medical professionals having expertise with a particular medical condition or procedure. Areas of expertise are assigned to both conditions and procedures and medical professionals who treat the condition or perform the procedure. A description for treatment is received and used to identify a specific condition or procedure. Upon identification of the condition or procedure, the areas of expertise assigned to the condition or procedure are retrieved. Medical professionals who also have assigned one or more of the retrieved areas of expertise are then identified.04-23-2009
20080256056System for building a data structure representing a network of users and advertisers - A system is described for building a data structure representing a network of advertisers and users. The system may include a memory and a processor. The memory may be operatively connected to the processor and may store a historical dataset comprising of a plurality of query items and advertisement items, a plurality of query-advertisement link items, a weight, a data structure and a condition. The processor may identify the historical dataset, and link the query items to the advertisement items to generate query-advertisement link items. The processor may determine the weight of each query-advertisement link item and may store the query-advertisement link items and the weight in the data structure if the query-advertisement link item satisfies the condition.10-16-2008
20080250011METHOD AND APPARATUS FOR QUERY EXPANSION BASED ON MULTIMODAL CROSS-VOCABULARY MAPPING - A computer implemented method, apparatus, and computer usable program code for multimodal cross-vocabulary mapping. A corpus of multimodal content is annotated simultaneously using annotations from a plurality of vocabularies to form a set of common annotations. Relationships between a first vocabulary associated with a first modality and a second vocabulary associated with a second modality are identified using the set of common annotations to form a multimodal vocabulary mapping. Items in the first vocabulary associated with the first modality are mapped to items in the second vocabulary associated with the second modality using the multimodal vocabulary mapping.10-09-2008
20080243812RANKING METHOD USING HYPERLINKS IN BLOGS - A method for static ranking of web documents is disclosed. Search engines are typically configured such that search results having a higher PageRank® score are listed first. A modified scoring technique is provided whereby the score includes a reset vector that is biased toward web pages linked to blogs. This requires identifying web pages as either blogs or non-blogs.10-02-2008
20080228757Identifying Co-associating Bioattributes - A bioinformatics method, software, database and system are presented in which combinations of pangenetic and non-pangenetic attributes that are most likely to co-associate with a query attribute (i.e., an attribute of interest) are identified from a database containing attribute combinations and corresponding statistical results that indicate the strength of association of the attribute combinations with the query attribute.09-18-2008
20080228756Compiling Co-associating Bioattributes - A bioinformatics method, software, database and system are presented in which attribute profiles of query-attribute-positive individuals and query-attribute-negative individuals are compared, and combinations of pangenetic and non-pangenetic attributes that occur at a higher frequency in the group of query-attribute-positive individuals are identified and stored to generate a compilation of bioattribute combinations that co-associate with the query attribute (i.e., an attribute of interest).09-18-2008
20090055390INFORMATION SORTING DEVICE AND INFORMATION RETRIEVAL DEVICE - An information retrieval device and the like are provided to quickly retrieve information desired by a user even when information is collected based on the user's taste or interest. Each of sort item generating units (02-26-2009
20090055391INFORMATION PROCESSING APPARATUS AND METHOD, PROGRAM, AND RECORDING MEDIUM - An information processing apparatus includes: an extracting means for extracting a feature volume from a predetermined content; and a computing means for computing an evaluation axis that classifies a first content and a second content by using a first feature volume extracted from the first content by the extracting means or a second feature volume extracted from the second content by the extracting means.02-26-2009
20090070325Identifying Information Related to a Particular Entity from Electronic Sources - Presented are systems, apparatuses, articles of manufacture, and methods for identifying information about a particular entity including receiving electronic documents selected based on one or more search terms from a plurality of terms related to the particular entity, determining one or more feature vectors for each received electronic document, where each feature vector is determined based on the associated electronic document, clustering the received electronic documents into a first set of clusters of documents based on the similarity among the determined feature vectors, and determining a rank for each cluster of documents in the first set of clusters of documents based on one or more ranking terms from the plurality of terms related to the particular entity, where the one or more ranking terms contain at least one term from the plurality of terms for the particular entity that is not in the one or more search terms.03-12-2009
20090070322BROWSING KNOWLEDGE ON THE BASIS OF SEMANTIC RELATIONS - Computer-readable media and computer systems for conducting semantic processes to facilitate navigation of search results that include sets of tuples representing facts associated with content of documents in response to queries for information. Content of documents is accessed and semantic structures are derived by distilling linguistic representations from the content. Groups of two or more related words, called tuples, are extracted from the documents or the semantic structures. Tuples can be stored at a tuple index. Representations of the relational tuples are displayed in addition to documents retrieved in response to a query.03-12-2009
20090070323INFERENCE OF QUERY RELATIONSHIPS - Various example embodiments are provided for inferring relationships between queries. In an example, queries are related based on the identification of common terms between the queries. Another example is to relate queries based on the identification that the queries are associated with a single search session.03-12-2009
20090070320Methods and Apparatus for Interactive Name Searching Techniques - Methods and apparatus include presenting an initial set of names to a user. The user selects a set of names from those presented. An Interactive Evolutionary Algorithm (IEA) extracts features of each selected name from a database of names and features to form a feature set. The IEA forms a set of match features that are chosen from the feature set according to a priority function and/or weighting of the features, either of which may vary in succeeding iterations. The IEA searches the database to obtain a candidate set of names, where each name has features matching the match features. One or more names is chosen from the candidate set and added into a presentation set of names. The IEA may repeat the formation of the match features, candidate set, and selection of one or more names from the candidate set until the new presentation set is complete.03-12-2009
20090070318Method and system for selecting personalized search engines for accessing information - A method and system for selecting personalized search engines for accessing information is provided. Each personalized search engine represents one or more base search engines. Characteristic information, representing searching capabilities of each of the multiple personalized search engines is obtained. A personalized search engine is selected among the multiple personalized search engines for executing a query based on said characteristic information and the query.03-12-2009
20090070316WEB-BASED SUCCESSION PLANNING - A system and method are provided for providing web-based succession planning process. Through a web-based interface, pre-defined talent criteria is utilized to define talent pools retrieved from a database. Each criteria can then in turn have a weight value and a threshold value assigned to it. A gap threshold percentage is assigned to the talent pool. Succession criteria are then assigned from a pre-defined succession criteria database list based upon characteristics of a group of employees being assessed. Candidates for the talent pool are then determined from the group of employees based upon the assessed talent criteria score relative to the gap threshold of the talent pool, and the succession criteria scores for each of the employees from an employee assessment database. The candidates for the talent pool can then be displayed in an html viewer based upon the assessment score and succession criteria scores.03-12-2009
20090070314Research System And Method With Record Builder - A system for providing relevant documents from a plurality of databases, including a search module for receiving at least one search expression, at least one managed database including a plurality of managed documents and a plurality of search records, each search record including at least one prior search expression associated with at least one of the plurality of managed documents, a plurality of unmanaged databases including a plurality of unmanaged documents, wherein the search module queries the managed database to determine at least one of the search records corresponding to the received search expression, and wherein the search module retrieves at least one of the managed documents associated with the determined search record. The search module may further query the plurality of unmanaged databases to determine one or more unmanaged documents corresponding to the at least one search expression and store the unmanaged document in the managed database.03-12-2009
20090070313ADAPTIVELY REORDERING JOINS DURING QUERY EXECUTION - A method is disclosed for executing a predetermined query plan, the method comprising: executing a portion of the query plan; providing a reordered query plan; comparing ranking metrics for the query plans; and executing the query plan having the lower ranking metric.03-12-2009
20090070311SYSTEM AND METHOD USING A DISCRIMINATIVE LEARNING APPROACH FOR QUESTION ANSWERING - Disclosed are systems, methods, and computer readable media for answers to natural language questions. The method embodiment comprises training a lexical association model between a question and a first set of one or more possible answers, training a semantic association model between a question and a second set of one or more possible answers, receiving a user question containing at least one query word, parsing the user question syntactically and semantically, formulating a query from the parsed user question containing at least one query word, expanding the query based on the lexical association model and the semantic association model, weighting the at least one query word according to its importance when answering the user question, and returning an answer based on the weighted at least one query word, the lexical association model, and the semantic association model. Other features include using question-answer pairs mined to train the models and returning a plurality of answers in an order based on the lexical association model and the semantic association model.03-12-2009
20090070312INTEGRATING EXTERNAL RELATED PHRASE INFORMATION INTO A PHRASE-BASED INDEXING INFORMATION RETRIEVAL SYSTEM - An information retrieval system uses phrases to index, retrieve, organize and describe documents, analyzing documents and storing the results of the analysis as phrase data. Phrases are identified that predict the presence of other phrases in documents. Documents are the indexed according to their included phrases. Related phrases and phrase extensions are also identified. Changes to existing phrase data about a document collection submitted by a user is captured and analyzed, and the existing phrase data is updated to reflect the additional knowledge gained through the analysis.03-12-2009
20090037412QUALITATIVE SEARCH ENGINE BASED ON FACTORS OF CONSUMER TRUST SPECIFICATION - A method of providing a search engine for use on global computer networks which identifies and merges categories of information that reflect, influence and imitate intelligent choice by concurrently searching one or more of eight factors of consumer trust: books, experts, news and articles, associations, celebrities and pro choice, awards, web information and blogs and people's choice. The results from the search of these selected consumer trust factors are then combined to generate a final report.02-05-2009
20090037410SYSTEM AND METHOD FOR PREDICTING CLICKTHROUGH RATES AND RELEVANCE - Systems and methods according to embodiments leverage click data to predict a relevance judgment for a given query-content item pair. An initial training phase utilize a training set of query-content item pairs coupled with click data and relevance data (e.g., relevance judgments or labels) to train a model of the relationship between relevance and clicks. Accordingly, given an unlabeled query-content item pair as input to the model, a relevance judgment or label is provided. Theses relevance labels, in turn, may be used in conjunction with query-content item pairs with which they are associated to train a model to determine a content item relevance function. When a user provides a query to a given search engine, the search engine applies the content item relevance function to the query and content items in a responsive result set to provide a relevance ordered result set to the user.02-05-2009
20090037407SYSTEM AND METHOD FOR SORTING ATTACHMENTS IN AN INTEGRATED INFORMATION MANAGEMENT APPLICATION - A system and method to sort attachments in an integrated information management application. The system includes an email agent, and email repository, and an attachment engine. The email agent facilitates organization of email communications within the integrated information management application. The email repository is coupled to the email agent. The email repository stores a plurality of email files and a plurality of email attachments. The email attachments are associated with at least some of the email files. The attachment engine is coupled to the email agent. The attachment engine generates a list of the email attachments within the email repository for visual communication on a display device.02-05-2009
20090037400CONTENT MANAGEMENT SYSTEM THAT RENDERS A DOCUMENT TO A USER BASED ON A USAGE PROFILE THAT INDICATES PREVIOUS ACTIVITY IN ACCESSING THE DOCUMENT - A content management system (CMS) monitors a user's activity for a document, generates corresponding usage data for the user, and binds the usage data to corresponding sections of the document. A relevance policy may be defined for a user and/or for a user's role. The CMS may then render the document to the user based on the usage data and the relevance policy. The rendered document may include displayed sections, hidden sections, and accentuated sections. The result is a document rendered to a user in a way that hides sections that are not of interest, displays sections of interest, and accentuates sections of high interest, all based on usage data that indicates how the document was accessed in the past.02-05-2009
20090037402SYSTEM AND METHOD FOR PREDICTING CLICKTHROUGH RATES AND RELEVANCE - Systems and methods according to embodiments leverage click data to predict a relevance judgment for a given query-content item pair. An initial training phase utilize a training set of query-content item pairs coupled with click data and relevance data (e.g., relevance judgments or labels) to train a model of the relationship between relevance and clicks. Accordingly, given an unlabeled query-content item pair as input to the model, a relevance judgment or label is provided. Theses relevance labels, in turn, may be used in conjunction with query-content item pairs with which they are associated to train a model to determine a content item relevance function. When a user provides a query to a given search engine, the search engine applies the content item relevance function to the query and content items in a responsive result set to provide a relevance ordered result set to the user.02-05-2009
20090037401Information Retrieval and Ranking - A learning method is used to generate ranking models. The learning method can create a ranking function that assigns scores to documents and then ranks the documents using the scores. In this learning method, a training set along with performance measures are used to generate weak rankers which a used in the ranking model. During information retrieval, for a given query, the system may return a ranked list of documents in descending order of the relevance scores.02-05-2009
20090299996RECOMMENDER SYSTEM WITH FAST MATRIX FACTORIZATION USING INFINITE DIMENSIONS - Systems and methods are disclosed for generating a recommendation by performing collaborative filtering using an infinite dimensional matrix factorization; generating one or more recommendations using the collaborative filtering; and displaying the recommendations to a user.12-03-2009
20090299994Automatic generation of embedded signatures for duplicate detection on a public network - In accordance with an aspect of the invention, a method and system are disclosed for constructing an embedded signature in order to facilitate post-facto detection of leakage of sensitive data. The leakage detection mechanism involves: 1) identifying at least one set of words in an electronic document containing sensitive data, the set of words having a low frequency of occurrence in a first collection of electronic documents; and, 2) transmitting a query to search a second collection of electronic documents for any electronic document that contains the set of words having a low frequency of occurrence. This leakage detection mechanism has at least the following advantages: a) it is tamper-resistant; b) it avoids the need to add a watermark to the sensitive data, c) it can be used to locate the sensitive data even if the leakage occurred before the embedded signature was ever identified; and, d) it can be used to detect an embedded signature regardless of whether the data is being presented statically or dynamically.12-03-2009
20090240693Service and Method for Providing a Single Point of Access for Multiple Providers' Video and Audio Content - A method that uses a computer network to provide a single point of access between a plurality of video & audio devices and a plurality of video & audio content provider systems. The present invention is a method for providing a searchable, aggregated directory view of available video & audio content from multiple content providers as well as additional services such as content renting. The present invention includes a method for identifying users as subscribers of content providers' memberships.09-24-2009
20090187552System and Methods for Generating Data Analysis Queries from Modeling Constructs - A method for automatically generating data analysis queries from at least one modeling construct includes selecting a preconfigured template identifying at least one metric or dimension; retrieving dashboard model data comprising the preconfigured template; filtering to the dashboard model data using at least one user-specific access control; and automatically generating a query for at least one database.07-23-2009
20090077071SYSTEM AND METHOD FOR RESPONDING TO A SEARCH REQUEST - Provided is a system and method for context based searching. A participating website may be provided with an interface which allows users to perform searches by indicating keywords to be searched. A search services provider performs a search based on the keywords as well as the context of the participating website. In a further development, the context of the participating website may be determined by deriving one or more context words from the text of the participating website. In a second aspect, a system and method are provided which allow a user to move results in a list of results provided in response to search request. These actions of the user are recorded and saved. The rankings of subsequent searches are based on these recorded actions.03-19-2009
20090055381Domain Dictionary Creation - Methods, systems, and apparatus, including computer program products, to identify topic words in a document corpus that includes topic documents related to a topic are disclosed. A reference topic word divergence value based on the document corpus and the topic document corpus is determined. A candidate topic word divergence value for a candidate topic word is determined based on the document corpus and the topic document corpus. The candidate topic word is determined to be a topic word if the candidate topic word divergence value is greater than the reference topic word divergence value.02-26-2009
20090063465System and method for string processing and searching using a compressed permuterm index - An improved system and method for string processing and searching using a compressed permuterm index is provided. To build a compressed permuterm index for a string dictionary, an index builder constructs a unique string from a collection of strings of a dictionary sorted in lexicographic order and then builds a compressed permuterm index to support queries over the unique string. A dictionary query engine supports several types of wild-card queries over the string dictionary by performing a backward search modified with a CyclicLF operation over the compressed permuterm index. These queries may used to implement other queries including a membership query, a prefix query, a suffix query, a prefix-suffix query, a query for an exact or substring match, a rank query, a select query and so forth. String processing and searching tasks may accurately be performed for sophisticated queries in optimal time and compressed space.03-05-2009
20090063459System and Method for Recommending Songs - A system and method, operable by a processor running on a computing device and stored on a tangible computer readable medium, the system and method creating continuous, fixed duration, fixed size, or other such playlists for use on an individual listener's portable music player, as a programming guide for an Internet radio station, or the like. Information can be drawn from a number of recommendation sources to help generate such playlists based on a dictionary of terms. Recommendation sources are sources available via the internet or other published information that identify the order in which songs are presented or played, and which may be aggregated and processed into song sequence data that allows the instant system and method to utilize the experience, effort and musical expertise of others to generate a continuous playlist. Exemplary recommendation sources include, but are not limited to, published Disc Jockey (“DJ”) playlists, radio (terrestrial, satellite or internet) station websites from which playlists can be extracted or derived, individual listener playlists, or the like.03-05-2009
20090083264REPORTING TO A WEBSITE OWNER ONE OR MORE APPEARANCES OF A SPECIFIED WORD IN ONE OR MORE PAGE-SPECIFIC OPEN-ENDED COMMENTS CONCERNING ONE OR MORE PARTICULAR WEB PAGES OF A WEBSITE - In one embodiment, a method for reporting to a website owner one or more appearances of a specified word in one or more page-specific open-ended comments concerning one or more particular web pages of a website includes receiving one or more page-specific open-ended comments concerning one or more particular web pages of a website from users that have accessed the particular web pages, identifying, in the one or more page-specific open-ended comments concerning the one or more particular web pages of the website, each appearance of the specified word, and generating a report reflecting the identified appearances of the specified word in the one or more page-specific open-ended comments concerning the one or more particular web pages of the website.03-26-2009
20090265343SYSTEMS AND METHODS FOR CREATIVE WORKS REGISTRATION AND OWNERSHIP DETERMINATIONS - Systems and methods for creative work registration are described. In one aspect, a processor may perform various steps. An upload of a creative work may be received at the processor. The processor then may create a digital fingerprint for the creative work. The processor then may associate the digital fingerprint with the creative work and associate ownership information with the digital fingerprint and the creative work. The associated ownership information, digital fingerprint, and the creative work may be stored in a database.10-22-2009
20100030772SYSTEM AND METHOD FOR CREATING AND USING PERSONALITY MODELS FOR USER INTERACTIONS IN A SOCIAL NETWORK - A social computing system and method includes a model creator configured to create an initial model of a user's musical preferences across a spectrum of attributes. The system and method further include a matching and comparison technique, which matches users based on the similarity of their respective musical preferences. The system and method further include an interactive display, allowing a user to view and interact with the nearby users who share the user's musical preferences.02-04-2010
20090313236SEARCHING, SORTING, AND DISPLAYING VIDEO CLIPS AND SOUND FILES BY RELEVANCE - A documents database has a plurality of documents, including but not limited to text files, video clips and sound files. Each document is associated with at least one category of a plurality of categories in a categories database, and each category has at least one keyword. A search request having at least one search term is received from a user, and a categories database is searched for categories having a keyword corresponding to the user search term to identify first level categories. The other keywords from the identified first level categories are retrieved and the documents database is searched for documents having a user search term or a retrieved keyword. The identified documents are then ranked and presented to the user. Other search expansion techniques, and display techniques, are also discussed.12-17-2009
20090313233INSPIRATION SUPPORT APPARATUS, INSPIRATION SUPPORT METHOD AND INSPIRATION SUPPORT PROGRAM - An inspiration support apparatus includes: a text database that stores a plurality of texts; a text mining section that analyzes the plurality of texts stored in the text database by text mining, and outputs a text that is a result of the mining; a keyword set database that stores conversion keywords; a keyword extraction section that extracts a keyword from the text that is the result of the mining by using the conversion keywords stored in the keyword set database; a keyword conversion section that converts, with respect to the text that is the result of the mining, the keyword extracted by the keyword extraction section in the text into one of the conversion keywords stored in the keyword set database; and a result output section that outputs the text converted by the keyword conversion section.12-17-2009
20090313242CONTENT ASSESING APPARATUS, CONTENT SEARCHING APPARATUS, CONTENT ASSESING METHOD, CONTENT SEARCHING METHOD, AND FIRST AND SECOND COMPUTER PROGRAMS - A content evaluation device includes: extraction means (12-17-2009
20090083251CONTENT QUALITY APPARATUS, SYSTEMS, AND METHODS - Embodiments herein receive a set of content quality threshold values, a search string, and a content data stream at a content quality metric (CQM) apparatus. Content segments associated with the content data stream are scored and/or graded according to a set of content relevance scales. The content data stream is then filtered to include only passing content segments and intermediate calculation values used to determine whether a content segment is passing. Other embodiments are described and claimed.03-26-2009
20090313247User Interface for Facts Query Engine with Snippets from Information Sources that Include Query Terms and Answer Terms - A method and a system for providing snippets of source documents of an answer to a fact query are disclosed. Snippets of source documents may be provided in response to a user request for the source documents from which the fact answer to a fact query was extracted. The snippets include the terms of the fact query and terms of the answer. The snippets may be displayed along with Uniform Resource Locators (URL's) of the source documents.12-17-2009
20090313244SYSTEM AND METHOD FOR DISPLAYING CONTEXT-RELATED SOCIAL CONTENT ON WEB PAGES - A method for displaying context-related social content on web pages may comprise a method wherein one or more computer processors cause performance of steps comprising matching at least one cue with content of at least one web page served by a web site a user has currently open in a web browser of the user, said cue containing social content or objects from other web sites than the web site serving the web page the user currently has open in the web browser.12-17-2009
20100023499SYSTEM AND METHOD FOR A CONTENT FINGERPRINT FILTER - A system and method for a content fingerprint filter. Various embodiments include receiving content and a preference from a user. The content is encoded without any available identifying information. A technical analysis of the encoded content is performed for one or more technical attributes. The available identifying information is paired with the one or more technical attributes to form a content fingerprint, where the content fingerprint identifies the content. The content fingerprint is combined with the preference to create a content fingerprint filter. The content fingerprint filter is used to filter pieces of available content, where each piece of available content has an associated content fingerprint. Other embodiments are described and claimed.01-28-2010
20090106224REAL-TIME QUERY SUGGESTION IN A TROUBLESHOOTING CONTEXT - A method for assisting a user to develop a query in a natural language includes receiving a user's query in a natural language and, while the user's query is being entered, presenting a subset of ranked query suggestions from a collection of ranked query suggestions to the user as candidates for user queries. The subset is based on that portion of the user's query already entered. The query suggestions in the subset of query suggestions are presented according to their respective rankings in the collection. Each of the query suggestions in the collection is formulated to retrieve at least one responsive instance in the knowledge base. The rankings of the query suggestions in the collection are based at least in part on stored logs of prior user sessions in which user queries were input to a search engine for retrieving responsive instances from the knowledge base.04-23-2009
20100042616SYSTEMS AND METHODS FOR SELECTING AND PRESENTING REPRESENTATIVE CONTENT OF A USER - Representative content of a user may be selected from the content submitted by the user and available on a website. The content may be rated and the selection of the representative content may be based upon the user ratings. The ratings may be submitted by other users and/or may be received or derived from other sources, such as an editorial board, website employees, hit-count, or the like. The representative content may include the highest-rated, lowest-rated, and/or average-rated content items submitted by the user. Indications of the representative content items may be displayed in connection with content submitted by the user. Displaying the indications of the representative content may motivate users to author and/or submit quality content to the website. The representative content displayed in connection with content submitted by the user may provide an easy-to-digest assessment of the user's corpus.02-18-2010
20100017387SYSTEM AND METHOD FOR PERFORMING ADVANCED SEARCH IN SERVICE REGISTRY SYSTEM - A system and associated method for searching a service registry system with a service name. The present invention receives a request to search a service description with the service name. If conventional search does not find a match for the service name in a registry, the present invention parses the service name and generates candidate service names for alternative searches from synonyms stored in a dictionary database. The registry is searched again with generated candidate service names and any service description found to be a match of any candidate service name is returned.01-21-2010
20090204597SYSTEM AND METHOD FOR PREFERRED SERVICES IN NOMADIC ENVIRONMENTS - A method of locating preferred services includes searching an augmented spatial index, which is based on a user's determined preferred services. Additionally, the method includes indicating a location of a currently-sought preferred service.08-13-2009
20090204598AD RETRIEVAL FOR USER SEARCH ON SOCIAL NETWORK SITES - In this invention, systems and methods for providing keywords for advertising are provided. After a user searches for another user in a social network, the webpage or blog of the queried user is retrieved, and keywords are extracted from this webpage. The keywords may be extracted from the user's profile on the social network (e.g., favorite sports, music artists, etc.), or keywords may be extracted from the text of the webpage (e.g., comments that comprise the blog entries). Once extracted, these keywords may then be used by an advertising system to provide targeted advertisements to the user.08-13-2009
20090210407METHOD AND SYSTEM FOR ADAPTIVE DISCOVERY OF CONTENT ON A NETWORK - A method is provided for identifying documents that include a searchable form relevant to a topic. A document is received. If the received document comprises a form is determined. A form includes a field presented to a user requesting information from the user. If the received document is determined to comprise a form, a determination is made concerning whether or not the form is a searchable form. A searchable form returns non-trivial information to a requester in response to a submission of the form. If the form is determined to be a searchable form, a determination is made concerning whether or not the form is relevant to an identified topic. If the form is determined to be relevant to the identified topic, the document is identified as a searchable form relevant to the identified topic.08-20-2009
20090210405Method, system, and apparatus for providing advice to users - In one embodiment of this invention, one can record and provide answers to users problems. When a user selects a command (e.g., Show Tips) on the user interface (UI), the mouse pointer is changed, so when the user clicks on an aspect of the UI, a web window displays all of the problems that the users have entered on the particular clicked item by querying a forum knowledge base. One embodiment provides capability to expand the search, e.g., to anything the user clicked on recently or items which are related to the selected item.08-20-2009
20100036839APPARATUS, METHOD AND COMPUTER PROGRAM FOR CONTENT RECOMMENDATION AND RECORDING MEDIUM - A base/inversion component extractor calculates an occupancy rate of each component of a vector of user preference information obtained from information stored on a user preference database, and extracts a base component from the user preference information in accordance with the calculated occupancy rate of the base component. The base/inversion component extractor extracts a similar base component from item preference information obtained from information stored on an item metadatabase. A recommendation engine calculates a similarity between a base vector of the user preference information and a base vector of the item preference information, and identifies, as candidate items, items of the predetermined number in the order of from high to low similarity. The recommendation engine further calculates a similarity between an inversion vector of the user preference information and an inversion vector of the item preference information, and identifies an item candidate having a low similarity.02-11-2010
20100036828CONTENT ANALYSIS SIMULATOR FOR IMPROVING SITE FINDABILITY IN INFORMATION RETRIEVAL SYSTEMS - A system and method including a simulator operating in conjunction with a search-engine, for improving document and site findability. Users input their content (pages or sites) and the simulator will analyze the site in terms of structure and content. It will then give the user a ranked list of suggestions about how the user might improve his/her site's findability. The user will then be able to apply some or all of these suggestions, or any other changes, by virtually modifying the site, and then immediately receive feedback both on how the pages look and a sense of the degree of findability improvement. The interactive process allows users to simulate modifications in their site structure and content in order to improve its findability. When the user completes the modifications and is satisfied with the new findability of his site, the user will be able then to replace his/her current site in the repository with the modified one.02-11-2010
20090049034ONTOLOGY SYSTEM PROVIDING ENHANCED SEARCH CAPABILITY - Ontology system providing enhanced search capability receives a search request specifying nodes and edges of interest and determines a set of matching ontologies stored in a knowledge store. The ontology system also generates a ranking for each of the matching ontologies based on the extent of matching. Data indicating the matching ontologies and corresponding rank is sent as a search result.02-19-2009
20100023502FEDERATED COMMUNITY SEARCH - The subject matter disclosed herein relates to web searching protocols.01-28-2010
20090327269PATTERN GENERATION - Generation of patterns used to facilitate search queries is provided herein. A pattern includes a sequence of token classes and new token classes. A sample query is parsed to identify tokens within the sample query that match a token associated with a referenced set of token classes. New token classes are generated for unidentified tokens within the sample query. A pattern is generated by substituting the identified tokens of the sample query with corresponding token classes and substituting the unidentified tokens of the sample query with corresponding new token classes.12-31-2009
20090327279APPARATUS AND METHOD FOR SUPPORTING DOCUMENT DATA SEARCH - In a search support server, a related word extraction unit generates frequency information and co-occurrence information of keywords, a graph generation unit generates coordinate information of a spring graph including the keywords as nodes, on the basis of the co-occurrence information, a cluster generation unit groups the nodes into clusters and thereby generates cluster definition information, and a display information generation unit generates display information of the spring graph. In addition, an operation determination unit determines which operation is performed on the spring graph. Then, when a level change is instructed, the display information generation unit generates display information of the spring graph after the level is changed. When a node change is instructed, a cluster re-generation unit changes the cluster definition information and the frequency information. When a search query generation is instructed, a search query generation unit generates a search query with a keyword of a selected cluster.12-31-2009
20090313239Adaptive Visual Similarity for Text-Based Image Search Results Re-ranking - Described is a technology in which images initially ranked by some relevance estimate (e.g., according to text-based similarities) are re-ranked according to visual similarity with a user-selected image. A user-selected image is received and classified into an intention class, such as a scenery class, portrait class, and so forth. The intention class is used to determine how visual features of other images compare with visual features of the user-selected image. For example, the comparing operation may use different feature weighting depending on which intention class was determined for the user-selected image. The other images are re-ranked based upon their computed similarity to the user-selected image, and returned as query results. Retuning of the feature weights using actual user-provided relevance feedback is also described.12-17-2009
20090313237GENERATING QUERY SUGGESTIONS FROM SEMANTIC RELATIONSHIPS IN CONTENT - A method for suggesting related queries to a user query using semantic relationships that are present in informational content stored in public domains. Semantic relationships between named entities are discovered and the named entities are extracted. The entities are indexed according to the relationships. When a user query is received that includes one of the entities, query suggestions are returned to the user based on indexed relationships corresponding to the entity named in the user query.12-17-2009
20090043760PROGRAM SEARCHING APPARATUS AND PROGRAM SEARCHING METHOD - There is provided with a program searching apparatus, including: an extracting unit extracting words or phrases described in plural program information as keywords; an identifying unit identifying categories to which the keywords belongs; a first calculating unit calculating a number of program information containing the keywords as first information; a second calculating unit calculating a number of keywords that belong to the categories as second information; a specifying unit specifying one program as a search query; a weight calculating unit calculating, for each of query keywords extracted from program information of the search query, a weight based on the first and second information; a similarity calculating unit calculating a similarity level to the search query with respect to a search target program according to the weight corresponding to a query keyword included in the program information of the search target program.02-12-2009
20090313232Methods and Apparatus to Calculate Audience Estimations - Methods and apparatus for calculating audience estimations are disclosed. An example method includes identifying a subset of stored viewership data and allocating an observation array having a first-dimension index, each indicie of the index associated with one time-period of at least one household datapoint in the subset of stored viewership data. Additionally, the example method includes transferring the identified subset to the observation array, building an extensible markup language (XML) file based on at least one detected characteristic in the observation array, and generating a graphical user interface (GUI) based on the XML file for use with at least one query selection associated with the at least one detected characteristic.12-17-2009
20090313246DOCUMENT IMPORTANCE CALCULATION APPARATUS AND METHOD - A computer readable storage medium stores a program that allows a computer to execute a process comprising: acquiring information related to N documents; determining elements of an N-th square matrix D based on the acquired information, in order that D, a positive real number e, and a column vector u having N elements satisfy e u=D u according to the Perron-Frobenius theorem, each of the elements of D being a positive real number; initializing a column vector v having N elements, each of the elements of v corresponding to each of the elements of u; calculating a column vector w=(D v)/|D v|; updating v in the memory to w; iterating the calculating and the updating, until the v satisfies a predetermined condition; and assigning each of elements of the v to the importance of the document.12-17-2009
20090313245Mixed Media Reality Brokerage Network With Layout-Independent Recognition - A Mixed Media Reality (MMR) system associated techniques are disclosed. The MMR system provides mechanisms for forming a mixed media document that includes media of at least two types (e.g., printed paper as a first medium and digital content as a second medium. The MMR system of the present invention provides mechanisms for forming a mixed media document that includes media of at least two types, such as printed paper as a first medium and a digital photograph, digital movie, digital audio file, or web link as a second medium. The present invention also includes a number of novel methods including: a method for layout independent MMR recognition, a strip fragment candidate generation process, and a page candidate accumulation process.12-17-2009
20090313243METHOD AND APPARATUS FOR PROCESSING SEMANTIC DATA RESOURCES - A semantic data resource of a domain is processed by calculating relevance scores for terms which occur in domain corpora and weighting the semantic data resource depending on the relevance scores calculated for these terms. The semantic data resource may include domain-specific terms and relations, such as a domain ontology, a domain terminology and a domain classification. The domain ontology may include a domain-specific-hierarchy of terms assigned to nodes which are connected by edges and may be encoded in a web ontology language. The relevance scores may be chi-square scores which are calculated depending on a frequency of a term in the domain corpora and an expected frequency of the term.12-17-2009
20090313241Seeding search engine crawlers using intercepted network traffic - A method includes monitoring data packets exchanged in a computer network over which documents having respective location identifiers are distributed, so as to detect a request to access a given document. A location identifier of the given document is extracted from the request. The location identifier is provided to a search engine that searches for data in a set of the documents, so as to cause the search engine to add the given document to the set.12-17-2009
20090313238SEARCH INDEX FORMAT OPTIMIZATIONS - A search index structure which extends a typical composite index by incorporating an index which is optimized for fast retrieval from storage and which eliminates data which is specific to phrase searching. Other data is represented in a manner which allows it to be calculated rather than stored. Associating variable length entries with logical categories allows their length to be inferred from the category rather than stored. Using delta values between document IDs rather than the ID itself generates a compact, dense symbol set which is efficiently compressed by Huffman encoding or a similar compression method. Using an upper threshold to remove large, and thus rare, delta values from the symbol set prior to encoding further improves the encoding performance.12-17-2009
20100005088Using An Encyclopedia To Build User Profiles - Described are various embodiments which enable organizations to track and use knowledge and expertise of their associated individuals. An organization can use exemplary embodiments to automatically summarize the expertise of each individual from documents available from internal or external web sites. For example, a web crawler crawls a computer network to identify documents that name an individual. Summaries of the documents are generated based on articles in an encyclopedia, and a profile is built of the individual using the summaries. These summaries are used for automatically searching and automatically discovering individuals having particular knowledge or expertise on certain topics and subjects.01-07-2010
20100036830CONTEXT BASED SEARCH ARRANGEMENT FOR MOBILE DEVICES - Embodiments are directed towards managing mobile searches by enabling a user to indicate a context of a search query to narrow a scope of the search. A user may fine tune a search by selecting from a plurality of pre-defined contexts for which to perform a search query. In one embodiment, the user may combine two or more pre-defined contexts to create more complex contexts for use in customized context search queries. The user also enters one or more search terms. A subset of databases is selected from a plurality of databases associated with different subject categories. The subset of databases is selected as predefined by an operator based on the user's context, and searched based on the user's entered search terms and selected context. Results are then aggregated and provided to the user. Results may be rank ordered based on the given user context or user's previous search behavior.02-11-2010
20100036838Search Engine - A search engine for retrieving documents from a database including a semantic document editor that allows a user to edit an existing document by creating searchable compound words that contains information contextually relevant to the contents of the document. The editor associates the created compound words with the document to produce an enhanced document having the compounds words associated therewith. A database is provided for storing enhanced documents and a semantic query editor is provided that enables a searcher to address the database of enhanced documents with a query. The query editor receives the query and converts it into one or more compound search words that contain contextually relevant information. A search module is provided that receives the searchable compound words and locates the relevant enhanced documents that have compound words associated with the document matching the searchable compound words. An output module presents any located documents to the searcher.02-11-2010
20100036835Caching Query Results with Binary Decision Diagrams (BDDs) - Construct a plurality of first binary decision diagrams (BDDs), each representing a different one of a plurality of words. Construct a plurality of second BDDs, each representing a different one of a plurality of search queries, each of the search queries comprising one or more of the words. Construct a plurality of third BDDs, each representing a different one of a plurality of web pages. Construct a plurality of fourth BDDs, each representing a different one of a plurality of search results, each search result comprising one or more web pages. Construct a plurality of fifth BDDs each representing a different one of a plurality of search tuples, each of the search tuples comprising a different one of the search queries and a different one of the search results. Construct a sixth BDD representing the search queries and the search results.02-11-2010
20100036834LOCATION-BASED INFORMATION RETRIEVAL - A collection of data records may be augmented by, for each data record of a plurality of data records, parsing each data record to find an address, converting the address to a geographic location indicator, and associating the geographic location indicator with said data record. These data records may be searched by receiving an indication of a geographic area in order to obtain a set of records with geographic location indicators representing geographic locations within, or partly within, the geographic area.02-11-2010
20100036832SEARCHING BY OBJECT CATEGORY FOR ONLINE COLLABORATION PLATFORM - In an example embodiment, an online advertising management platform receives a login that identifies a user as a user allowed access to an account maintained by the platform. The platform displays a toolbar having a textbox that allows the user to search for data relating to all accounts to which the user has access. The platform displays a first page of initial search results after the user enters an initial search term in the textbox and launches a search. The first page includes a list of data objects relevant to the initial search term grouped by object category and a list box that allows the user to select an object category. The platform displays a second page after the user selects an object category from the list box. The second page includes an entry box that is related to the selected object category and that facilitates subsequent search.02-11-2010
20100036831GENERATING CONTINUOUS QUERY NOTIFICATIONS - Techniques are described to allow a query to be registered as a persistent stored entity within the database, and to generate notifications as and when the query result changes continuously as long as the query continues to be registered with the database. According to one aspect, for a table referenced in a query, a filter condition is generated based, at least in part, on a predicate of the query. Then, the database server determines whether the filter condition is satisfied by either a before image of a row, or an after image of the row, that was modified by a transaction. If the filter condition is satisfied by either the before image or the after image, then the query is added to a first set of queries whose result sets may have been affected by the transaction. From among the first set of queries, a second set of queries that have result sets that were actually affected by the transaction is determined. Notifications are then sent based on the second set of queries.02-11-2010
20100036829SEMANTIC SEARCH BY MEANS OF WORD SENSE DISAMBIGUATION USING A LEXICON - Techniques are disclosed for analyzing a “context window” of a search query to determine a semantic meaning of a search word and to filter search results based upon the semantic meaning. Generally, a lexicon may be used to store forms, meanings, and usages of words and phrases. When a user specifies a query, a semantic analyzer obtains all of the word senses for a search word. The semantic analyzer applies lexical analysis techniques to the search word and context window to obtain a total score for each word sense and selects the word sense with the highest total score. After query results such as documents containing the search words are obtained, the semantic analyzer applies lexical analysis techniques to filter the results so that only documents which use the search terms, according to the selected word sense are returned.02-11-2010
20100036827INTERCONNECTED, UNIVERSAL SEARCH EXPERIENCE ACROSS MULTIPLE VERTICALS - One or more query terms that were submitted by a user in connection with a first vertical of a plurality of verticals and not in connection with any other vertical of the plurality of verticals are received. A first set of search results that are both (a) indexed in the first vertical and (b) relevant to the one or more query terms is determined. A second set of search results that are both (a) indexed in a second vertical, but not in the first vertical and (b) relevant to the one or more query terms is also determined. A search results page that contains search results both sets of search results, and that visually distinguishes the sets from each other, is generated. According to one aspect, the results from the second set are shown on the search results page in what appears to be a yellow sticky note.02-11-2010
20100036836Contextual Keyword-Based Access Control - Various implementations of contextual keyword-based access control are disclosed.02-11-2010
20100036840Presentation of Search Results - Objects contained within enormous geographically distributed virtual file servers spanning thousands (or even millions) of organizations are each assigned globally unique object identifiers, enabling the implementation of highly distributed indexing and retrieval operations. The file system API (application programming interface) is extended to provide a search capability. A search request targeting a specific domain creates a parallel namespace anchored in that domain's root directory. The parallel namespace, containing directories and links to all objects satisfying the search criteria, may be navigated using the standard file system API. Relevance scores, added as new members of the file attribute structure, enable the construction and presentation of views that convey where the centers of expertise associated with the search matter are located. In addition, a wide range of methods addressing the scalability issues associated with integrating retrieval operations into the fabric of geographically distributed virtual file servers are disclosed.02-11-2010
20100057724SERVER DEVICE FOR CREATING LIST OF GENERAL WORDS TO BE EXCLUDED FROM SEARCH RESULT - A server device of the present invention includes a control unit collecting texts stored in a storage unit in response to an instruction from the outside or when a predetermined time is reached, extracting words from the collected texts, determining, as a general word, a word which appears at a frequency higher than a first predefined value for a first predetermined period, and which appears at a frequency that varies within a second predefined value range for every second predetermined period that is shorter than the first predetermined period, and creating a general word list which enumerates the general words.03-04-2010
20100057721Information Providing Server, Information Providing Method, and Information Providing System - According to one embodiment, a record information storage module stores record information related to content recorded by an external apparatus in association with a keyword representing the content and user information. A search information storage module stores a search phrase used for searching in an external apparatus in association with user information. A search word handler extracts a predetermined number of words having high search frequency from search phrases stored in association with specific user information to generate a word list. A record information handler extracts keywords stored in association with the specific user information to generate a keyword list. A ranking processor generates ranking information indicating a word in the word list which matches a keyword in the keyword list. A communicator provides the ranking information to an external apparatus corresponding to the specific user information.03-04-2010
20100057729System, Method, and Computer Program Product for a Geometric Search of a Configurable Product Structure - A method for searching a bill of materials (BOM) in a data processing system, a data processing system configured to perform a corresponding method, and a computer program product encoded with instructions for performing the method. The method includes retrieving BOM data in a data processing system, and forming a wavefront queue of a plurality of proto lines corresponding to the BOM data. The method also includes determining the cumulative geometric bounds of multiple ones of the plurality of proto lines and performing a geometric bounds test on the cumulative geometric bounds of multiple ones of the plurality of proto lines. The method also includes producing a BOM line to each proto line that passes the geometric bounds test, and adding the produced BOM lines to a candidate results list stored in the data processing system.03-04-2010
20090216745Techniques to Consume Content and Metadata - Content and metadata associated with the content may be provided to a number of users. The content may be displayed on a display device while the metadata may be transmitted to a remote device corresponding to a receiving user. The user may further request desired information or metadata pertaining to the content and the requested information or metadata may be transmitted to the user's remote device. Different users may request different information on the same or different objects being displayed or presented on a display device. Each requesting user may receive requested information on the same or different objects via corresponding remote devices.08-27-2009
20090216762Just in Time Wiring Information System - A just in time wiring information system, which includes an aircraft wiring information system module, a technical reference module, an interactive computer aided cable repair system module, and an e-suite. The e-suite communicates with the aircraft wiring information system module, the technical reference module, and the interactive computer aided cable repair system module such that via the e-suite a user may obtain information from each of the modules.08-27-2009
20090216758METHOD AND APPARATUS FOR AN APPLICATION CRAWLER - A computer-implemented method is provided for searching for files on the Internet. In one embodiment, the method may provide an application crawler that assembles and dynamically instantiates all components of a web page. The instantiated web application may then be analyzed to locate desired components on the web page. This may involve finding and analyzing all clickable items in the application, driving the web application by injecting events, and extracting information from the application and writing it to a file or database.08-27-2009
20090216761Signature Based System and Methods for Generation of Personalized Multimedia Channels - A system for generating personalized channels of multimedia content. The system comprises an interface to one or more multimedia sources, wherein the multimedia sources provide multimedia content to the personalized channels of multimedia content; and a server for receiving multimedia content from the one or more multimedia sources through the interface and for serving selected multimedia content to users of the system over one or more of the personalized channels; wherein a user of the system receives personalized multimedia content gathered by the server into the one or more of the personalized channels responsive of preferences of the user as observed by the system for the user.08-27-2009
20090216760SEARCH ENGINE WITH WEBPAGE RATING FEEDBACK BASED INTERNET SEARCH OPERATION - The system and methods herein provide feedback of a web page quality/legitimacy factor, various user interaction parameters, a contact address correlation factor, and an explicit web page rating on the reverse path from the client to the severs for Internet search operations. This operation facilitates to improve the quality of websites/web pages and enhances the efficiency of the Internet search operation. This reverse communication also allows for the automatic blockage of any illegitimate websites due to poor “contact address correlation factor” and poor “web page quality factor.” The rating of the websites is based on a computed number called “web page quality factor.” The “web page quality factor” is communicated in the reverse path of Internet search operation back to various whois servers, domain registrars, and web servers on the Internet to further improve quality. This facilitates the filtering of scammers, squatters, illegal/unwanted sites, etc., which have low “web page quality factor” rating resulting in high efficiency of search operations.08-27-2009
20090216759METHOD AND VECTOR ANALYSIS FOR A DOCUMENT - The invention provides a document representation method and a document analysis method including extraction of important sentences from a given document and/or determination of similarity between two documents.08-27-2009
20090216740Method for Indexing for Retrieving Documents Using Particles - An information retrieval system stores and retrieves documents using particles and a particle-based language model A set of particles for a collection of documents in a particular language is constructed from training documents such that a perplexity of the particle-based language model is substantially lower than the perplexity of a word-based language model constructed from the same training documents. The documents can then be converted to document particle graphs from which particle-based keys are extracted to form an index to the documents. Users can then retrieve relevant documents using queries also in the form of particle graphs.08-27-2009
20090216757System and Method for Performing Frictionless Collaboration for Criteria Search - A criteria search performed by a search engine comprises associating each of a plurality of users with at least one knowledge domain, associating a user's query to at least one subject area in the at least one knowledge domain, and generating search results based on the user's query and the at least one subject area. A search engine comprises a plurality of search repositories configured to perform different types of searches and a search component. The search component receives a search query from a user, modifies the query based on one or more knowledge domains that the user is associated with, submits the modified search query to the plurality of search repositories, and receives search results from the plurality of search repositories.08-27-2009
20090216756RECORDING MEDIUM CARRYING DATA SEARCH PROGRAM, DATA SEARCH APPARATUS, AND DATA SEARCH METHOD - A data search apparatus searches for target data based on a keyword included in the target data, and classification information indicating classification of the target data.08-27-2009
20090216754System and method of distribution for geospatial data - In accordance with one or more embodiments, a method comprises receiving a query comprising geospatial attributes, displaying at least one image indicative of at least one geospatial data set matching the query, receiving a request for the at least one geospatial data set, and transmitting the at least geospatial data set.08-27-2009
20090216753Electronic data retrieving apparatus - An electronic data retrieving apparatus is provided that increases the retrieval accuracy without deteriorating the retrieval efficiency by reflecting differences between the numbers of word appearances due to genres of electronic data in the setting of the retrieval words. The electronic data retrieving apparatus according to the present invention sets the retrieval words of the electronic data not only as a word appearing on a retrieval word setting table of the recorded electronic data for a predetermined number of times (e.g., three times) or more but also a word appearing on the retrieval word setting table and appearing on a retrieval word setting reference table for a predetermined number of times (e.g., three times) or more.08-27-2009
20090216751ATTRIBUTE EXTRACTION PROCESSING METHOD AND APPARATUS - A machine-executable attribute extraction method comprising: extracting, vis-à-vis a plurality of documents in the archival memory (that also stores registration dates and attributes of the documents) having registration dates falling within a desired time period, feature words for each attribute value of the corresponding attributes of the plurality of documents; registering, into the work memory, the desired time period, and the extracted feature words for each attribute value of the corresponding attributes of the plurality of documents; determining, amongst the extracted feature words in the work memory, first feature words for which the attribute has a first attribute value and second feature words for which the attribute has a second attribute value; calculating a similarity between the first feature words and the second feature words; judging whether the similarity satisfies a condition; and outputting the second attribute value when the similarity satisfies the condition.08-27-2009
20090216750ELECTRONIC PROFILE DEVELOPMENT, STORAGE, USE, AND SYSTEMS THEREFOR - Examples of the present invention include profiling systems that store, manage, and utilize profile information to take predictive or deterministic action. Embodiments of the invention allow the profiling system to be used as a trusted intermediary where the profile owning entity controls access to their profile information across their network of devices and services.08-27-2009
20090216748INTERNET DATA MINING METHOD AND SYSTEM - A method for automatically acquiring a set of data opens a searchable Internet database; initiates an automated timed search of each one of a plurality of records, each record in the plurality of record includes common criteria with the other records; retrieves information associated with the searched record; and provides the retrieved information in a desired format.08-27-2009
20090216746Method, System, and Apparatus for Aggregation System for Searchable Travel Data - The system may be configured to actively obtain or passively receive charter data objects. The data objects may include information like available charter equipment (e.g., aircraft), availability location and/or time information. The system may be configured to process either a variety for object formats from a variety of data sources. The system extracts charter flight data indicators and corresponding flight characteristics to create and populate a charter flight data record, which serve as the foundation for charter data management system components. The system implements search functionality identifying charter flight data records with a best match or alternate suggested results. Charter flight records originate from multiple sources that are published/distributed in standardized or non-standardized ways. information. The system accepts and stores available flight requests from clients and automatically alert users (i.e.: indirectly through sales representatives or directly to the prospect or customer).08-27-2009
20090216744GRAPHICAL/RICH MEDIA ADS IN SEARCH RESULTS - A method and system for mixing rich media content with textual listing on a webpage includes receiving a plurality of advertisement parameters associated with an advertisement from an advertiser. The advertisement parameters define the advertisement and are used for booking the advertisement. Additional media content associated with the advertisement is obtained from the advertiser. The additional media content includes rich media content. A dynamic content window is defined for rendering the additional media content. A graphical icon is provided for the advertisement to indicate that additional media content is available for the advertisement. The graphical icon is activated through a control or is activated by default. The graphical icon is associated with the dynamic content to provide access to the additional media content on the webpage in response to detecting a user action at the graphical icon.08-27-2009
20090216743Systems, Methods and Computer Program Products for the Use of Annotations for Media Content to Enable the Selective Management and Playback of Media Content - The exemplary embodiments of the present invention provide a method for searching an annotation repository and visualizing the results of the search, wherein the annotation in the annotation repository is associated with a plurality of media content. The method includes retrieving the media contents used to generate the metadata terms satisfying a search criteria and generating a ranked list of search results. The method further includes visualizing the ranked list of media contents and displaying relevant annotation and corresponding metadata associations for the media contents to enable navigation of the media contents.08-27-2009
20090216742SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR INDEXING, SEARCHING AND VISUALIZING MEDIA CONTENT - The exemplary embodiments of the present invention provide a method for searching a metadata repository and visualizing the results of the search, wherein the metadata in the metadata repository is associated with a plurality of media content, and wherein each media content including at least one audio track. The method comprises retrieving the media contents used to generate the metadata terms satisfying a search criteria, and generating a ranked list of search results. The method further includes visualizing the ranked list of media contents, and displaying relevant metadata and corresponding associations for the media contents to enable navigation of the at least one audio track included in the media contents.08-27-2009
20090216741PRIORITIZING MEDIA ASSETS FOR PUBLICATION - Methods and apparatus are described by which media assets may be prioritized and published in accordance with current topics of interest derived from a dynamic data set representing the online activity of a relevant population of users.08-27-2009
20090216739BOOSTING EXTRACTION ACCURACY BY HANDLING TRAINING DATA BIAS - Methods and apparatus are described for use with information extraction techniques based on sequential models. Additional statistics are maintained during inference and employed to boost the accuracy of the extraction algorithm and mitigate the effects of training bias.08-27-2009
20090216738Systems and Methods of Identifying Chunks Within Inter-Related Documents - A computer receives a request to search one or more secondary documents. At least one of the secondary documents is associated with a primary document. The computer searches at least a subset of the secondary documents for documents that satisfy the search request and identifies at least one secondary document that satisfies the search request.08-27-2009
20090216737Systems and Methods of Refining a Search Query Based on User-Specified Search Keywords - After receiving a search keyword provided by a user, a computer selects an archetype for the search keyword. The computer identifies one or more search results in accordance with the archetype and returns at least one of the search results to the user. After selecting the archetype, the computer identifies at least one query operator for the selected archetype, constructs a search query using the query operator, and executes the search query against one or more data sources. Sometimes, the computer solicits user instructions with respect to the archetype and then generates feedback to the user instructions. This process may repeat multiple loops until the user submits a search query execution request, which suggests that the user is satisfied with the customized search query.08-27-2009
20090216736Systems and Methods of Displaying Document Chunks in Response to a Search Request - A computer displays a portion of a document to a user. Upon receiving a user-specified text string that includes multiple search keywords, the computer identifies a chunk within the document that satisfies the search keywords and displays the identified chunk to the user, wherein terms in the identified chunk that match the search keywords are either ordered differently from the search keywords in the user-specified text string or separated from one another by at least one term not matching any of the search keywords.08-27-2009
20090216735Systems and Methods of Identifying Chunks Within Multiple Documents - A computer identifies multiple resource identifiers, each resource identifier corresponding to a document at a respective data source. For at least one of the resource identifiers, the computer retrieves the corresponding document from the respective document source, identifies within the retrieved document a chunk that satisfies one or more user-specified search keywords, and displays the identified chunk and a link to the identified chunk within the document to the user.08-27-2009
20090216734SEARCH BASED ON DOCUMENT ASSOCIATIONS - A method and a processing device are provided. A group of documents may be selected from multiple documents of a search result. Associations among the selected group of documents may be determined and indicated. An indication of ones of the associations that are of interest and/or others of the associations that are of no interest may be received. A new search result may be presented, including one or more documents satisfying some or all of the associations of interest and none of the associations of no interest. In some embodiments, a document may be selected from a search result and characteristics of the document may be determined. A search result may be presented, which may include one or more documents having none or some of the characteristics of the selected document. A visual indication of a strength of an association of a document may be provided.08-27-2009
20090216733GEO-TRIP NOTES - A user may use a mobile device to request information related to a selected topic or a point of interest. A location of the mobile device may be determined in order to provide the user with informational content related to the selected topic or point of interest in close proximity to the user. The mobile device may receive and display the informational content as a set of search results. The user may select one or more of the search results in order to review the information content referenced by the selected one or more search results. A verification process or step may ensure that the selected information is relevant to the selected topic or determined location, and a link may be generated relating the topic, the selected search result(s), and the determined mobile device location. Moreover, a rating system may be used to provide an indication of the relevancy of one or more search results. Thereafter, additional users, or the same users, may be provided access to the link when located in close proximity to the determined location.08-27-2009
20100057726Collaborative Search - A collaborative search is disclosed. The collaborative search includes receiving a search request of a mobile terminal. Searching based on the search request to obtain search results. Obtaining a relationship list associated with an identifier of the mobile terminal. Obtaining search history information associated with members in the relationship list based on the relationship list and ranking the search results based on the search history information.03-04-2010
20100057727DETECTION OF RECURRING NON-OCCURRENCES OF EVENTS USING PATTERN MATCHING - Techniques for detecting recurring non-occurrences of an event. In one embodiment, techniques are provided for detecting the non-occurrence of an event within each of a series of time periods following the occurrence of another event. Language extensions are provided that enable queries to be formulated for detecting recurring non-occurrence of an event following occurrence of a triggering event.03-04-2010
20100057723PROVIDING ANSWER TO KEYWORD BASED QUERY FROM NATURAL OWNER OF INFORMATION - A type of search engine (referred to as the “Get Engine”) receives one or more keywords, semantically formulates a question being asked from the keywords, generates specifications for the query, and searches a website index to determine websites that are likely owners of the answer to the question based on the query specifications and website classifications. The Get Engine determines a website that is most likely the owner of the answer based on credibility, searches the pages of the website using the keywords and additional keywords related to the query, retrieves the answer from the pages of the website, and receives feedback used in part to determine the credibility of the website.03-04-2010
20100057725INFORMATION RETRIEVAL DEVICE, INFORMATION RETRIEVAL METHOD, AND PROGRAM - In an exemplary aspect, the present invention includes a control unit that when a keyword for search is entered, collects texts containing that keyword from texts stored in a storage unit, extracts a noun of collected first texts, determines a noun partially matching with the keyword as a first word, extracts a second text containing that first word among the first texts, extracts a word from the second text, the word being one of a noun, a verb, and an adjective, counts the number of times an extracted word is used, determines a word whose number of times of use is placed in predefined highest ranks as a second word, the second word being a related word to the first word, and outputs the first word and the second word.03-04-2010
20100057722Image processing apparatus, method, and computer program product - The target content selecting unit selects a first image from the content storage unit. The relevance calculating unit calculates relevance of second images to the first image by use of the metadata. The display content selecting unit identifies a second image selected before the first image, based on the history information, and selects the identified second image and any second images that satisfy a selection condition regarding the relevance. The output information generating unit generates output information that is used for displaying first selection information from which the first image can be selected and second selection information from which second images can be selected, on the display device. In this output information, the second selection information of second images having greater relevance is displayed closer to the first selection information.03-04-2010
20100057728ITERATIVE AND INTERACTIVE CONTEXT BASED SEARCHING - The present invention extends to methods, systems, and computer program products for iteratively and interactively searching for information. Embodiments of the invention can provide a user with relevant location-specific information in response to a query from the user. Provided information can also be relevant to a user's predicted future behavior. As context for a user is obtained and/or accumulated, such as, for example, through an interactive query dialogue, the probability of providing relevant information in response to a query from the user increases.03-04-2010
20100057730CONTACT INFORMATION QUERYING - System and method for querying of contact information are disclosed. An aspect of the invention includes a method for querying contact information. The method includes receiving a query language including relationship information of a plurality of contacts with unknown contact information. The method further includes acquiring a query request, wherein acquiring the query request includes parsing the query language according to a query language syntax. The method further includes querying contact information of the plurality of contacts with unknown contact information in at least one directory to obtain the contact information of the plurality of contacts with unknown contact information requested in the query request. The method further includes returning the contact information of the plurality of contacts with unknown contact information requested in the query request.03-04-2010
20100057713ENTITY-DRIVEN LOGIC FOR IMPROVED NAME-SEARCHING IN MIXED-ENTITY LISTS - According to one embodiment of the present invention, a method for name searching in mixed-entity lists is provided which comprises dividing a mixed list of entities into a plurality of entity-specific lists. A name to be searched is then categorized into a category and a specialized search logic is applied to the name to be searched. The specialized search logic is selected to be adapted to the category and uses a one of the entity-specific lists that corresponds to the category of the name to be searched. A shared search logic may also be employed, which is used for all names to be searched.03-04-2010
20100057717System And Method For Generating A Search Ranking Score For A Web Page - A system and method for generating a search ranking score for a web page. The system comprises a training data processor effective to receive training data including at least a first page, a first label, a second page and a second label. The system further comprises a feature extraction processor connected to the training data processor, the feature extraction processor is effective to receive the first page, identify first features in the first page and calculate first values relating to the first features; the feature extraction processor is further effective to receive the second page and identify second features and calculate second values relating to the second features. A machine learning processor is connected to the feature extraction processor. The machine learning processor is effective to receive the first features, the first values, the first label, the second features, the second values, and the second label and generate a ranking function for a search engine based on the first features, the first values, the first label, the second features, the second values, and the second label. A receiving processor is connected to the machine learning processor. The receiving processor is effective to receive a web page. A ranking processor is connected to the receiving processor. The ranking processor is effective to apply the ranking function to the web page to generate a score.03-04-2010
20100057716System And Method For Providing A Topic-Directed Search - A system and method for providing a topic-directed search is provided, which advantageously harnesses user-provided topical indexes and an ability to characterize indexes according to how articles fall under their topical organizations. A corpus of articles and an index that includes topics from the articles is maintained. For each topic, a coarse-grained topic model is built, which includes the characteristic words included in the articles relating to the topic and scores assigned to the characteristic words. A search query is executed against the index. The topics that match the search terms are chosen by their scores. The topics that match the coarse-grained topic models and the articles corresponding to the search query are presented. In contrast to conventional search engines, search results are organized according to topic and search results can be offered across multiple indexes, where part of returned results are selected from most-relevant indexes with their most-relevant topics.03-04-2010
20100057715Prevention of a User Mimicking Another User in a Virtual World - Methods and apparatus associate a computed difference factor to avatars that are to interact with one another in a simulated environment. Applying a difference factor to the avatars enables identification of similar avatars in order to avoid mistaken identities among the avatars. The difference factor predicts probability that one avatar is mimicking another avatar. An attribute uniqueness algorithm may assign the difference factor based on name, appearance, and/or accessory similarity between two avatars. A user index may be used to store data describing attributes of each avatar for analysis using programs that are stored in memory and that execute the attribute uniqueness algorithm. Further, system validation of each avatar provides ability to protect and control likeness of the avatars in the virtual world.03-04-2010
20100057712INTEGRATED COMMUNITY-BASED, CONTRIBUTION POLLING ARRANGEMENT - Disclosed are apparatus and methods for facilitating online user polling over a computer network. In general, a particular web property is a web service or site that is related to a particular type of subject matter or service. A user of a particular web property can select a polling feature and initiate a query regarding a particular subject matter that is related to such particular web property. A set of online users-to-be-polled may then be determined so that these users-to-be-polled are likely to have knowledge (or have access to knowledge) regarding such subject matter about which a query has been initiated. At least some of the determined users-to-be-polled are not currently accessing the particular web property or subject matter, e.g., may be accessing a different web property. Since the users-to-be-polled can include both users who are and are not accessing the particular web property of the originating query, the determined users-to-be-polled can form a diverse set of online users who can provide helpful information regarding the particular query. This determined set of online users is then polled with the query initiated by the querying user, and the one or more answers provided by such polled users are then provided to the querying user.03-04-2010
20100057714SEARCH RESULTS RANKING METHOD AND SYSTEM - A ranking method and system. The method includes receiving by a computing system, from a user, a keyword associated with a search for information. The computing system generates a results list comprising links to files comprising data associated with the keyword. The computing system generates and displays a first ranked results list comprising the links in a first ranked order. The computing system receives from the first user, a selection for a first link of the links. The computing system determines that the first link comprises relevant information associated with the keyword. The computing system generates a second ranked results list. The second ranked results list comprises the links in a second ranked order differing from the first ranked order. The first link is listed as a first selection on the second ranked results list. The computing system stores the second ranked results list.03-04-2010
20100057711EXTRACTION OF CRITICAL INFORMATION FROM DATABASE - Some embodiments of extraction of critical information from a database in a networked system have been presented. In one embodiment, a subset of data from the database in the networked system is extracted. The subset of data is indexed to generate an index. Using the index, a preview of the subset of data may be provided to users in response to a user request without accessing the database.03-04-2010
20090030894Spoken Document Retrieval using Multiple Speech Transcription Indices - A method and system are provided of spoken document retrieval using multiple search transcription indices. The method includes receiving a query input formed of one or more query terms and determining a type of a query term, wherein a type includes a term in a speech recognition vocabulary or a term not in a speech recognition vocabulary. One or more indices of search transcriptions are selected for searching the query term based on the type of the query term. The one or more indices are generated using different speech transcription methods. The results for the query term are scored by the one or more indices and the results of the one or more indices for the query term are merged. The results of the one or more query terms are then merged to provide the results for the query.01-29-2009
20090024606Identifying and Linking Similar Passages in a Digital Text Corpus - A corpus contains digital text from multiple documents. A passage mining engine identifies similar passages in the documents and stores data describing the similarities. The passage mining engine groups similar passages into groups based on degree of similarity or other criteria. The passage mining engine ranks the similar passages found in the text corpus based on quality or other criteria. A user interface is presented that includes hypertext links associated with the similar passages that allow a user to navigate the documents.01-22-2009
20080235205Database Search Results User Interface - A system and method for retrieving and displaying search results by retrieving a user's search results from a database and providing an interface with which the user scrolls through the search results. The system and method approximate a rate at which the user scrolls through the search results based on at least one user action, and retrieves additional search results from the database based on the approximated rate. The system and method display the search results on a display device in predetermined patterns of screen positions in cooperation with the navigation/scroll control interface.09-25-2008
20090006371SYSTEM AND METHOD FOR RECOMMENDING INFORMATION RESOURCES TO USER BASED ON HISTORY OF USER'S ONLINE ACTIVITY - Blogs (and other information sources) are recommended to a user based history of user's online activities. The system: (1) processes the user's web history, (2) identifies blog posts (and web pages) that link to pages read by the user, (3) generates multiple relevance scores for each identified post/page, and (4) produces multiple rankings of the corresponding source blogs (and web sites) by aggregating individual relevance scores (or combinations of relevance scores), according to users' preferences. The system allows the discovery of information sources that are likely to be interesting to the user and allows sources lost in the “long tail” to be seamlessly discovered.01-01-2009
20090006366AUTOMATIC SIGNIFICANCE TAGGING OF INCOMING COMMUNICATIONS - As incoming communications are received, a priority or significance level can be assigned to each communication. A communication determined to have a high priority can be presented to a user at substantially the same time as receiving the communication. A communication having a low priority can be placed in a low priority folder or flagged differently from a high priority communication (e.g., different color-coding). Behavior of a user as it relates to a received communication can be observed for learning purposes or to modify one or more classifications or priority levels.01-01-2009
20090006353Method and Apparatus for Selecting Items from a Number of Items - Techniques are presented for selecting one or more items from a collection of items. To select the one or more items, an interface is provided that is adapted to allow a user to define one or more weights. Each weight corresponds to one of a number of similarity criteria. Each item also corresponds to the number of similarity criteria. The one or more weights define a similarity function. The similarity function is applied to the one or more similarity criteria corresponding to the one or more weights and to each of the items in order to select one or more items from the collection of items. The interface can comprise movable markers corresponding to similarity criteria. Locations of the movable markers can be used to weight similarity criteria when creating the similarity function.01-01-2009
20100036841SEARCHING APPARATUS AND SEARCHING METHOD - When an album search is started, message “For Album ?” which prompts the user to select album search is displayed. When the user has selected the album search, message “By Title ?” which prompts the user to select album title name search is displayed. When the user has selected the title name search, message “Keyword IN” which prompts the user to input a key word is displayed. When the user has input key word “P” for the search, the HD recording and reproducing device 02-11-2010
20100005092SEARCH RESULT SUB-TOPIC IDENTIFICATION SYSTEM AND METHOD - A method and apparatus for sub-topic identification from a search result that matches a query, said method including the steps of receiving a search result, extracting snippets from said search result that contain said query, truncating snippets on an instance of a boundary token, identifying phrases within said snippets that include the query, comparing all said phrases to determine optimal phrases, and presenting said optimal phrases. The apparatus for sub-topic identification from a search result that matches a query may include a dedicated server or a proxy for processing the search and sub-topic query.01-07-2010
20100005090STATISTICAL MEASURE AND CALIBRATION OF SEARCH CRITERIA WHERE ONE OR BOTH OF THE SEARCH CRITERIA AND DATABASE IS INCOMPLETE - Disclosed is a system for, and method of, identifying an entity representation. In some embodiments, search criteria are used to identify an entity representation in a universal database, and this identification is then used to identify a corresponding entity representation in a foreign database. Certain embodiments provide assurance, with a know probability of error, that the entity representation identified in the universal database is correct.01-07-2010
20100005087Facilitating collaborative searching using semantic contexts associated with information - A method and system include sending a search context to a collaboration server for use in determining another user of the collaboration server that is associated with similar subject matter, the search context including at least one first context definition such that the first context definition has one or more words selected from a first plurality of information belonging to a first search information set of the user. The first plurality of information is related to each other and such relation is represented by the first context definition of the search context. The system and method also include receiving an identification of a matching search context associated with the another user such that the matching search context contains at least one second context definition considered to match the first context definition. The matching second context definition includes one or more words selected from a second plurality of information belonging to a second search information set associated with the another user.01-07-2010
20100005086RESOURCE LOCATOR SUGGESTIONS FROM INPUT CHARACTER SEQUENCE - Methods, systems, and apparatus, including computer program products, in which an input method editor receives Roman character inputs, identifies keywords for candidate sets of a non-Roman character, and identifies an associated resource location. Upon identifying an associated resource location, associating the resource location with the candidate set of non-Roman characters.01-07-2010
20100005084METHOD AND SYSTEM FOR PREFETCHING INTERNET CONTENT FOR VIDEO RECORDERS - A method and system for providing information related to content accessed by a user of an electronic device is provided. An implementation involves determining content of interest to the user for access via an electronic device; obtaining metadata for said content; prefetching information related to said metadata; upon detecting availability of further metadata for said content, pre-fetching additional information related to said further metadata; and upon access to the content by the user via the electronic device, selectively providing the prefetched information to the user.01-07-2010
20090319514METHOD AND SYSTEM FOR ASSIGNING SCORES - A system for implementing a scoring method, wherein the system includes at least a data analyzer configured to: determine a plurality of scoring intervals dependent upon the data to be analyzed; assign an integer score and a decimal score within the scoring intervals to each data to be analyzed, the score dependent upon a frequency of appearance; search a database for pairings of (scored element, decimal score); and generate an alert if the pairing is found in the database.12-24-2009
20090319506SYSTEM AND METHOD FOR EFFICIENTLY FINDING EMAIL SIMILARITY IN AN EMAIL REPOSITORY - Systems and methods for efficiently identifying emails with content similarity are disclosed. In one embodiment, a method comprises grouping a first set of a plurality of email documents with only common-type subsets of character sequences in a first searchable group, and grouping a second set of the plurality of email documents with one or more uncommon-type subsets of character sequences in a second searchable group. The method further comprises selectively searching either only one of or both of the first and second searchable groups, and identifying selected one or more email documents of the plurality of email documents that may contain content that is similar to the particular email document based on the searching.12-24-2009
20090313240METHOD OF EDITING RECIPIENT HEADER FIELDS BASED ON EMAIL CONTENT - A method is provided for flagging email messages sent to a user containing inquiries directed to the user comprises defining a natural language model for a set of inquiring phrasal forms in a first data store; defining a list of terms used to identify a first user having an email address managed by a host system in a second data store; accessing the host system to retrieve an email message sent to the email address; parsing a textual content of a body of the email message to generate one or more natural language tokens each corresponding to a text string in the body; accessing the first data store to identify each of the one or more natural language tokens that matches with an inquiring phrasal form; accessing the second data store to determine if any of the text strings corresponding to the one or more natural language tokens that match with an inquiring phrasal form includes a term from the list of terms; and flagging the email message if any of the text strings in the message body corresponding to the one or more natural language tokens that match with an inquiring phrasal form includes a term from the list of terms.12-17-2009
20090313234CONTENT SEARCHING APPARATUS - A content searching apparatus facilitating a search of a content which a user desires even where relativity between a content and a keyword change includes: a content table storing unit (12-17-2009
20090287682Social based search engine, system and method - A social based search apparatus, system and method. The apparatus, system and method may include receiving, from a user, at least one search keyword, comparing the search keyword to a plurality of keywords having one or more experts associated therewith, and producing a first search result including at least one expert and information associated with the at least one expert, wherein the at least one expert and the information are at least substantially related to the at least one search keyword. The present invention may additionally include applying at least one filter to the first search result, wherein the at least one filter includes a broadening of the at least one search keyword.11-19-2009
20090287677STREAMING MEDIA INSTANT ANSWER ON INTERNET SEARCH RESULT PAGE - A method and medium are provided for presentation of media to a user. In one embodiment of the invention, a search query is received containing descriptors of one or more aspects of media. A search is then conducted for sources of media generated in real time that satisfy the search query. Regular, algorithmic search results may be generated as well. An interface for the presentation of the sources of media is then integrated into a search results web page. Upon selecting a source of media in the interface, the source is presented to the user. Sources of media generated in real time may then be presented to a user without the need of navigating away from the search results page. Other embodiments of the invention are directed to receiving and indexing sources of streaming media based on their respective content to aid in satisfying search queries for streaming media.11-19-2009
20090282025METHOD FOR GENERATING A REPRESENTATION OF IMAGE CONTENT USING IMAGE SEARCH AND RETRIEVAL CRITERIA - A method for generating representations of visual characteristics of images is presented. The method includes receiving search criteria. The criteria include images to be searched, query images and expected result sets, and a retrieval metric. The method identifies objects within each image and selectively generates a representation of visual characteristics of each image using descriptors from an inventory of descriptors in accordance with the retrieval metric. The method compares the representations of the query image to representations of the images to be searched and determines a search result. The search result is compared to the expected result. If the results do not match, the generating, comparing and determining steps are re-executed with reselected descriptors based on the search result and the retrieval metric. The re-execution continues in a trial-and-error approach until acceptable search results are achieved. When achieved, the method encodes the process for generating the representations.11-12-2009
20090240680TECHNIQUES TO PERFORM RELATIVE RANKING FOR SEARCH RESULTS - Techniques to perform relative ranking for search results are described. An apparatus may include an enhanced search component operative to receive a search query and provide ranked search results responsive to the search query. The enhanced search component may comprise a resource search module operative to search for resources using multiple search terms from the search query, and output a set of resources having some or all of the search terms. The enhanced search component may also comprise a proximity generation module communicatively coupled to the resource search module, the proximity generation module operative to receive the set of resources, retrieve search term position information for each resource, and generate a proximity feature value based on the search term position information. The enhanced search component may further comprise a resource ranking module communicatively coupled to the resource search module and the proximity generation module, the resource ranking module to receive the proximity feature values, and rank the resources based in part on the proximity feature values. Other embodiments are described and claimed.09-24-2009
20090240692HIERARCHICAL TAGS WITH COMMUNITY-BASED RATINGS - A method for generating and maintaining hierarchical tags with community-based ratings is provided. Tags for media streams are organized into a hierarchical format. Users may select tags from the hierarchical tag database that describes a particular multimedia content. If the user is unable to locate a desired tag, the user may submit a new tag. Upon submission of the new tag, a librarian approves the tag before storing and placing the tag in the hierarchical tag database. Users are also able to rate the quality of the association between the tag and the multimedia content. If a tag is rated low, the tag may be removed from the hierarchical tag database. If the tag is rated highly, display of the tag in a list of tags becomes more prominent.09-24-2009
20090240691RECORDING MEDIUM RECORDING OBJECT CONTENTS SEARCH SUPPORT PROGRAM, OBJECT CONTENTS SEARCH SUPPORT METHOD, AND OBJECT CONTENTS SEARCH SUPPORT APPARATUS - An object contents search support apparatus supporting a user to search for desired object contents information, the object contents search support apparatus including an operating part, an information collecting part collecting composite operation information including all of contents information, an overall operation history database recording the collected composite operation information, a matching part matching historical records and extracting at least one item of contents information, a display part generating display information and displaying the generated display information, a feedback part accepting the operation input, holding the object contents information, comparing the composite operation information included in the operation input after displaying the generated object contents information, and generating effective contents information from a comparison result, and a verifying part accepting the effective contents information and the composite operation information, extracting effective operation information, and updating the certainty determination parameter.09-24-2009
20090240689System, method, and software for researching, analyzing, and comparing expert witnesses - The present inventors devised, among other things, system, methods, and interfaces for researching, evaluating, and comparing expert witnesses. One exemplary system includes interfaces that facilitate users entering queries regarding experts based on name or subject matter and filtering search results based on damage awards, case types, attorneys, clients and date range. The system also enables side-by-side comparisons of the cumulative litigation history for multiple experts, and provides an expert challenge report that indicates whether an expert has been challenged in past litigation, the result of any challenges, the presiding judges in the any challenges, and the text of the challenged testimony.09-24-2009
20090240688Information processing apparatus, information processing method, and program therefor - An information processing apparatus displays image data and plays back music data such that background music (BGM) is applied to a collection of content such as photographic data, the BGM being reminiscent of the time when the photographic data was acquired. When a command to display a scrapbook or similar plurality of image data is issued, a search unit searches for music metadata that is related to the image metadata of the image data. A display controller then controls the display of the specified plurality of image data, while in parallel, a playback controller controls the playback of the music data corresponding to the music metadata found by the search unit.09-24-2009
20090240684Image Content Categorization Database - Disclosed herein are databases that contain image context categorizations, those categorizations identifying the context of an image based on a computed fingerprint. Also disclosed herein are applications of such a database, including a viewing application that blocks the rendering of images with undesirable content noted by a content categorization, and a scanning application that locates images in a corpus or repository of images having certain content noted by content categorization, such as unlawful images. Such blocking may be through an obfuscation technique, such as blurring, distorting an image, or may be through a replacement of image material. A categorization tool may include obfuscation with an aperture tool for clarifying portions of blocked images. Detailed information on various example embodiments of the inventions are provided in the Detailed Description below, and the inventions are defined by the appended claims.09-24-2009
20090240683Presenting query suggestions based upon content items - Systems and methods for determining query suggestions based upon content items are provided. Content items may include, without limitation, a search query result item, e.g., displayed on a search results web page, an advertisement, and a query-based query suggestion. Once determined content-item-based query suggestions are presented to the user. If desired, such presentation may be dynamically exposed in response to a user action, for instance, in response to a user hovering over a portion of the associated content item for at least a predetermined period of time.09-24-2009
20090240682GRAPH SEARCH SYSTEM AND METHOD FOR QUERYING LOOSELY INTEGRATED DATA - A system, method and computer program product for executing a query on linked data sources. Embodiments of the invention generate an instance graph expressing relationships between objects in the linked data sources and receive a query including at least first and second search terms. The first search term is then executed on the instance graph and a summary graph is generated using the results of the executing step. A second search term is then executed on the summary graph.09-24-2009
20090240681MEDICAL RECORDS NETWORK - A medical records network is configured for communicating a plurality of electronic medical records over authenticated peer-to-peer connections among a plurality of client computer systems. The medical records network includes a first client computer system running a first agent application for generating an authentication request and a record request query to request access to one or more medical records stored on one or more other client computer systems. A proxy computer system receives and processes the authentication request and determines whether the first client computer system should be granted access to the medical records network. If the first client computer system is authenticated, the proxy computer system processes the record request query and forwards a proxy query to those client computer systems in a specific geographic region. The client computer systems receiving the record request query respond indicating whether they have access to the requested record(s). If so, the proxy computer system facilitates an encrypted peer-to-peer communication channel between the first client computer system and the client computer system(s) responding affirmatively in order to communicate the record(s) to the first client computer system.09-24-2009
20090240678PURPOSING PERSISTENT DATA THROUGH HARDWARE METADATA TAGGING - Storage devices can maintain metadata on a per-block basis, enabling the storage device, the file system, or other higher-level software to store and obtain information about individual blocks of data. A handshake between the storage device and a computing device can include an exchange of feature tables, whereby a commonly supported set of features and attributes can be selected and agreed upon. Such features and attributes can include access pattern specification in the per-block metadata, frequency of access or importance designations and specifications of the longevity of temporary data. The per-block metadata can either be provided by an application or the file system, or it can be generated by the storage device itself. Likewise, per-block metadata can be utilized by the storage device, either on its own or at the behest of an application or the file system, or it can be utilized directly by the application or file system.09-24-2009
20090240676Computer Method and Apparatus for Using Social Information to Guide Display of Search Results and Other Information - A computer implemented method and system presents search result or other data generated in response to a request by a user. The search results are formed of one or more items. The invention system corresponds each item to a respective person. A screen view is generated showing a hierarchy of people including the people corresponding to the items of the search results. Indicated in the screen view is the extent of connectedness between the user and the people corresponding to the items of the search results. The invention system displays indications of the items of the search results in the screen view in a manner illustrating the items in context of the shown hierarchy. This enables a user to (i) easily and readily assign respective confidence levels to items of the search results, and to (ii) determine relationships among people without explicitly requesting the information from others.09-24-2009
20080243829SPECTRAL CLUSTERING USING SEQUENTIAL SHRINKAGE OPTIMIZATION - A clustering system initially applies an eigenvalue decomposition solver for a number of iterations to a clustering objective function. The eigenvalue decomposition solver generates an eigenvector that is an initial approximation of a solution to the objective function. The clustering system fixes the eigenvector values for the identified objects. The clustering system then reformulates the objective function to focus on the objects whose clusters have not yet been determined. The clustering system then applies an eigenvalue decomposition solver for a number of iterations to the reformulated objective function to generate new values for the eigenvector for the objects whose clusters have not yet been determined. The clustering system then repeats the process of identifying objects, reformulating the objective function, and applying an eigenvalue decomposition solver for a number of iterations until a termination criterion is satisfied.10-02-2008
20080243839METHODS, SYSTEMS, AND COMPUTER PROGRAM PRODUCTS FOR DETECTING THE PRESENCE OF AN INSTALLATION ON A DATA PROCESSING SYSTEM BASED ON THE RELATIVE STORAGE LOCATIONS OF ONE OR MORE FILES - The presence of an installation on a data processing system may be detected by providing a signature that includes m files having paths associated therewith, respectively. A number n files on the data processing system are determined that match files in the signature and a files found ratio given by n/m is determined. A transformation is applied to the signature by replacing at least a portion of at least one of the paths with a new path. Then, a distance is determined between the n files on the data processing system and the m signature files. The distance corresponds to a sum of a number of path segments associated with the m signature files that cannot be matched to a corresponding path segment associated with files on the data processing system. The presence of the installation on the data processing system is determined based on the files found ratio and the distance.10-02-2008
20080243835PROGRAM, METHOD AND APPARATUS FOR WEB PAGE SEARCH - A web page searching method searches web pages publicized on a network by web servers. A computer performs the method by: 10-02-2008
20080243827Query generation using enviroment configuration - A query for a help system includes data about a user system and a task that the user is attempting. The query may be used by a search engine to generate relevant results to aid the user. The user system data may include configuration data about hardware and software. The task data may be derived from the current state of a device, or from operational history that may be developed from a single user or a group of users. The query may have a mechanism to weight various keywords or components of the query and a feedback system may adjust the weights for future queries.10-02-2008
20080243823System and method for automatically generating information within an eletronic document - A method for automatically generating target information within an electronic document including the steps of: retrieving term-based identifying information from the electronic document that specifies the target information to be generated; accessing rules associated with generation of the target information based on the retrieved term-based identifying information; analyzing the identifying information and the rules to identify a type of target information to be generated and a formula that uses underlying data to generate the target information; automatically generating data source instructions based on the type of target information to be generated and the formula; and automatically processing the data source instructions to generate the target information within the electronic document.10-02-2008
20080243811SYSTEM AND METHOD FOR RANKED KEYWORD SEARCH ON GRAPHS - Arrangements and methods for providing for the efficient implementation of ranked keyword searches on graph-structured data. Since it is difficult to directly build indexes for general schemaless graphs, conventional techniques highly rely on graph traversal in running time. The previous lack of more knowledge about graphs also resulted in great difficulties in applying pruning techniques. To address these problems, there is introduced herein a new scoring function while the block is used as an intermediate access level; the result is an opportunity to create sophisticated indexes for keyword search. Also proposed herein is a cost-balanced expansion algorithm to conduct a backward search, which provides a good theoretical guarantee in terms of the search cost.10-02-2008
20080243818CONTENT-BASED ACCOUNTING METHOD IMPLEMENTED IN IMAGE REPRODUCTION DEVICES - A content-based accounting method is implemented in a management section for a copier, scanner, printer or multifunction device (referred to as MFP), or on a networked server accessible by the copier, scanner, printer or MFP. When copying, scanning or printing a document, the management section automatically extracts content information from the documents being copied, scanned or printed, groups the documents based on the content, and updates an accounting database. The accounting database contains user accounts that store usage information according to content groups. For copied and scanned documents, textual content is extracted from the document image using OCR techniques. For printed documents, textual information is extracted from the digital data used to print the document.10-02-2008
20080243807NOTIFICATION METHOD FOR A DYNAMIC DOCUMENT SYSTEM - A dynamic document template contains a set of queries. Each query may include a query scope. The query scope may refer to a content of a source document that is maintained in a document collection. A content rule is applied to monitor the template for a change. A notification event is triggered when a change to the document collection results in an invalid query scope in the template. A notification event may also be triggered when a change to the document collection results in valid query but the template needs to be refreshed. An additional notification event may be triggered if the template is modified so that the resulting template is either invalid or needs to be refreshed with different content from the collection.10-02-2008
20080243824System and method for associating a geographic location with an internet protocol address - Systems and methods for associating a geographic location with an IP address are disclosed. Generally, a plurality of localized search queries of search queries received at an Internet search engine are determined, where each of the plurality of localized search queries is associated with a location. A geo tag is associated with each of the plurality of localized search queries and a subset of the plurality of localized search queries that are associated with a first IP address is identified. The subset of the plurality of localized search queries is clustered into a spatial cluster including localized search queries associated with geo tags located within a defined distance of a geo tag associated with at least one other localized search query of the cluster. A geographic location associated with a geographic center of the cluster is then associated with the first IP address.10-02-2008
20080243822System and method for associating a geographic location with an Internet protocol address - Systems and methods for associating a geographic location with an IP address are disclosed. Generally, an IP address associated with each of a plurality of browser cookies is determined, where each of the plurality of browser cookies indicate a geographic location such as a home address or business address of a user. A geo tag is associated with each of the plurality of browser cookies and a subset of the plurality of browser cookies including browser cookies associated with a first IP address is identified. The subset of the plurality of browser cookies is clustered into a spatial cluster including browser cookies associated with geo tags located within a defined distance of a geo tag of at least one other browser cookie of the cluster. A geographic location associated with a geographic center of the cluster is then associated with the first IP address.10-02-2008
20080243816PROCESSES FOR CALCULATING ITEM DISTANCES AND PERFORMING ITEM CLUSTERING - Computer-implemented processes are disclosed for clustering items and improving the utility of item recommendations. One process involves applying a clustering algorithm to a user's collection of items. Information about the resulting clusters is then used to select items to use as recommendation sources. Another process involves displaying the clusters of items to the user via a collection management interface that enables the user to attach cluster-level metadata, such as by rating or tagging entire clusters of items. The resulting metadata may be used to improve the recommendations generated by a recommendation engine. Another process involves forming clusters of items in which a user has indicated a lack of interest, and using these clusters to filter the output of a recommendation engine. Yet another process involves applying a clustering algorithm to the output of a recommendation engine to arrange the recommended items into cluster-based categories for presentation to the user.10-02-2008
20080243815CLUSTER-BASED ASSESSMENT OF USER INTERESTS - Computer-implemented processes are disclosed for clustering items and improving the utility of item recommendations. One process involves applying a clustering algorithm to a user's collection of items. Information about the resulting clusters is then used to select items to use as recommendation sources. Another process involves displaying the clusters of items to the user via a collection management interface that enables the user to attach cluster-level metadata, such as by rating or tagging entire clusters of items. The resulting metadata may be used to improve the recommendations generated by a recommendation engine. Another process involves forming clusters of items in which a user has indicated a lack of interest, and using these clusters to filter the output of a recommendation engine. Yet another process involves applying a clustering algorithm to the output of a recommendation engine to arrange the recommended items into cluster-based categories for presentation to the user.10-02-2008
20080243814Search Techniques for Page-Based Document Layouts - Systems, methods, and/or techniques (“tools”) for improved search techniques for page-based document layouts are described herein. The tools may analyze markup elements defined for pages within source documents, and may determine whether the markup elements for the page may include at least part of a search string.10-02-2008
20080243806Accessing information on portable cellular electronic devices - A method, performed by software executing on the processor of a portable cellular electronic device, which allows for the retrieval of personal, reference, and remote information with a minimum of operator interaction. A user interface is utilized to search and act on such information. Furthermore, additional features designed to assist the user of such devices is proposed.10-02-2008
20090228476SYSTEMS, METHODS, AND SOFTWARE FOR CREATING AND IMPLEMENTING AN INTELLECTUAL PROPERTY RELATIONSHIP WAREHOUSE AND MONITOR - An information retrieval system gathers intellectual property metadata based on a user query and a predetermined set of intellectual property databases, generating and rendering reports regarding intellectual property activities based on the user query.09-10-2009
20090216752SEARCH ENGINE, SEARCH SYSTEM, SEARCH METHOD, AND SEARCH PROGRAM PRODUCT - A search system can include a server, a token assignment unit for assigning types of tokens based on different kinds of character string analysis methods, an index generating unit for generating an index list that associates the tokens assigned with the token assignment unit, a type identification value for identifying a type of the character string analysis, and information, a search unit that receives a search word for referencing the information to combine types of search tokens generated from the search word to generate a single search command for parallel inquiry of the information to search for the information, and a search result generating unit for displaying information extracted in relation to the search word through parallel inquiry with the search unit and search tokens so as to identify the tokens.08-27-2009
20080294619SYSTEM AND METHOD FOR AUTOMATIC GENERATION OF SEARCH SUGGESTIONS BASED ON RECENT OPERATOR BEHAVIOR - A method, system and computer program product for enhancing the usability of web browsers by analyzing the recent behavior of an operator while executing a search pattern on a computer network. A search history and indexing datastore is defined and associated with the web document parser. The web document parser parses through each returned web page for significant terms that may be of later importance to the user. These terms are then forwarded to the datastore and indexed along with the search term to later provide a historical guide to identify the user's areas/topics of interest. When a search term is entered within the web browser, the search terms is compared against the index of terms for similar terms. The similar terms found are ranked according to closeness to the entered search term, and the ranked terms outputted to the user for possible selection in lieu of the search term.11-27-2008
20090187564Processor for Fast Phrase Searching - Phrases in a corpus of documents including stopwords are found using a data processor arranged to execute phrase queries. Memory stores an index structure which maps entries in the index structure to documents in the corpus. Entries in the index structure represent words and other entries represent stopwords found in the corpus coalesced with prefixes of respective adjacent words adjacent to the stopwords. The prefixes comprise one or more leading characters of the respective adjacent words. A query processor forms a modified query by substituting a stopword with a search token representing the stopword coalesced with a prefix of the next word in the query. The processor executes the modified query. Also, index structures including coalesced stopwords are created and maintained.07-23-2009
20090187562SEARCH METHOD - A search method for causing a computer to execute the search method of searching for and retrieving, when a search formula to document data having a hierarchy structure whose elements are delimited by an element identifier is obtained, data corresponding to the search formula from the document data, stores, when the search formula is obtained, the search formula to a memory device; determines, when the data corresponding to the search formula is searched for and retrieved from the document data, whether or not a hierarchy management is necessary to the search formula based on the search formula; and searches for and retrieves, when the hierarchy management is not necessary to the search formula, the document data corresponding to the search formula without executing the hierarchy management.07-23-2009
20090164459CONTIGUOUS LOCATION-BASED USER NETWORKS - A system and method are provided for creating location-based user networks. In general, a proximity group including a number of users is identified. Each user in the proximity group is within a proximate area of at least one other user in the proximity group and has an area of interest. The areas of interest of the users in the proximity group are aggregated to provide an aggregate area of interest for the proximity group. Other users within the aggregate area of interest are identified as neighbors of each of the users in the proximity group. Once the neighbors are identified, each of the users in the proximity group may use the neighbors as members, or potential members, for a user network.06-25-2009
20090144275SYSTEM AND METHOD FOR GENERAL SEARCH PARAMETERS HAVING QUANTIZED RELEVANCE VALUES THAT ARE ASSOCIATED WITH A USER - The system and method comprises enhancement of results for a search engine, wherein the results from the search engine are refined or reorganized, based upon information from an identified secondary source. The results obtained using a conventional search are compared against the identified secondary source, e.g. a ratings service, and are filtered and/or sorted appropriately. In some embodiments, identification of the secondary source, such as a ratings service comprising information which may supplement the subject of a search query, is based upon information entered by the user. In alternate embodiments, the secondary source is associated with a user, as part of general user-specified search parameters, wherein one or more parameters are consulted automatically for searches for appropriate subject matter.06-04-2009
20090144258SYSTEMS AND METHODS FOR QUERY PROCESSING - Embodiments relate to systems and methods for online query processing, in which a SQL or other query server can generate a record of results served to clients. The distribution record of results, referred to as a distribution map, can record the identity of properties of database entries or other content that has been distributed to individual clients. When a client transmits a query whose results include properties that have already been served to that client, the re-transmission of that information can be suppressed leading to improved communications efficiency. A notification function can be provided whereby all users automatically receive updates to the properties or content they have already received, when those data components have been updated in the underlying database. The delivered content can relate to personal contact lists, media play lists, or other information displayed in an application or Web service.06-04-2009
20090138467DATA REDUCTION FOR OPTIMIZING AND TESTING - A reasonably-sized testing database instance can be efficiently replicated and maintained for a very large production database while retaining the characteristics and cross-sectional data. The performance characteristics are maintained in order to provide for proper testing of the production database for various application programs. Statistics on the type of data distribution for the customer data are obtained, allowing for parameters to be determined which can be used to store data only near the endpoints of the distribution (and/or at other key locations). In this way, a substantial amount of data skew is retained in a much smaller instance of the production database, allowing for easier performance testing, upgrade testing, etc.05-28-2009
20080319991System for Searching Network Accessible Data Sets - A Sales-Chip Relevance Alert extends the capability of the regenerating search engine by allowing users to become informed about relevant information that has changed to a second query or search request that is run after a first query or search request. The user can narrowly and specifically describe with a high degree of precision what is relevant and meaningful to the user and how the user is notified. The Alert provides a timely notice to the user about the relevant information change through user-defined means and user defined messages.12-25-2008
20080319986PROCESS OF TIME-SPACE COLLABORATIVE FILTERING OF INFORMATION - The invention is a process for collaborative filtering of information called Time-Space Filtering (TSIF). The invention is used in the fields of information filtering and publishing and is particularly useful in the field of providing web-based information, e.g. electronic newspapers. TSIF is a process of filtering and ranking the relevance of an article's content to specific readers, taking into account the time dimension of information as well as the factors traditionally considered by content-based or collaborative filtering.12-25-2008
20080319988USE OF FIXED FIELD ARRAY FOR DOCUMENT RANK DATA - A computer based search server can comprise an archive including a fixed-width field array storing numeric rank data associated with documents. The search server can provide search results using the numeric rank data obtained from the fixed-width field array.12-25-2008
20080319989APPARATUS AND METHOD OF SEARCHING DOCUMENT DATA - An apparatus and method of searching an electronic document are disclosed. A document that is assumed to contain a search symbol set is searched. The search symbol set is a symbol set being extracted from a plurality of symbols representing a search request when the symbol set being extracted satisfies a predetermined condition.12-25-2008
20080319982Method and Apparatus for Manipulating Data Files - A method of encoding a data file stored in a storage unit, said method comprising the steps of:—extracting (12-25-2008
20080319987SYSTEM, METHOD AND PROGRAM FOR CREATING INDEX FOR DATABASE - An entire document set is decomposed into a sum of subsets each having no common part. Next, a set of keywords appearing in each of the subsets divided in the aforementioned manner is categorized into groups on the basis of a remainder resulting from dividing a hash value of each of the keywords by a certain fixed integer value. Thereby, index files for the respective groups are created. Among the index files prepared for the respective subsets of the document in the aforementioned manner, ones each having the same group number are merged. Thereby, integrated index files corresponding to the respective individual group numbers are created. Such index files, however, exist as many as the number of group numbers, and have not yet become an index corresponding to the entire document set. In this respect, the index files existing as many as the number of group numbers are next merged into one, and thereby, an index file corresponding to the entire document set is created.12-25-2008
20080319983METHOD AND APPARATUS FOR IDENTIFYING AND RESOLVING CONFLICTING DATA RECORDS - A method and apparatus for identifying and resolving conflicting data records are disclosed. The individual data fields of a master record are compared with the corresponding data fields of each source record in a particular data set. For each, one of various matching algorithms is used to assign a field matching score indicating the extent to which the data in the two data fields matches. The particular algorithm used to determine the extent of a match and to assign the corresponding score is dependent on the type of the data field. Once all of the data fields for a particular source record have been analyzed, the sum of the field matching scores is tallied to determine an overall record matching score for that particular source record.12-25-2008
20080319978HYBRID SYSTEM FOR NAMED ENTITY RESOLUTION - A method for named entity resolution includes parsing an input text string to identify a context in which an identified named entity of the input text string is used. The identified context is compared with at least one stored context in which the named entity in the stored context is associated with a class of named entity, the named entity class being selected from a plurality of classes, at least one of the plurality of classes corresponding to a metonymic use of a respective named entity. A named entity class is assigned to the identified named entity from the plurality of named entity classes, based on at least one of the identified context and the comparison.12-25-2008
20080319976IDENTIFICATION AND USE OF WEB SEARCHER EXPERTISE - A search expertise level system and method for determining a search expertise level of a search engine user and then using that information to improve the searcher's experience. The search expertise level system and method identifies the search expertise level of the searcher based on query behavior, post-query browsing behavior, and other behaviors of the searcher. One simple and important behavior that indicates a skilled searcher is the use of advanced query syntax and operators in the query. Once the search expertise level of a searcher is known, the search engine user interface can be modified and tailored to the needs of both skilled and novice searchers. The search expertise level also can be used to rank search results, such that search results for a novice searcher are ranked differently than those for a skilled searcher. The search expertise level also can be used in advertising and marketing.12-25-2008
20080319979INFORMATION PROCESSING APPARATUS AND COMPUTER-READABLE MEDIUM - A computer-readable medium stores a program causing a computer to execute information processing. The information processing includes: reading user information of a user who requests to provide first document information; generating second document information, based on (i) concealment region information associated with the first document information and (ii) the read user information relating to the user; and outputting the second document information. The concealment region information includes (i) information, for specifying a region that is to be concealed when the associated first document information is provided and (ii) concealment condition information used to determine as to whether or not the concealment region is concealed. In the second document information, a region in the first document information that is to be concealed from the user is concealed.12-25-2008
20080319977System for providing enhance search results on the internet - Apparatus and a method that control the number of sponsored links that are presented in search result webpages in response to a search string presented by a user, based on the past behavior of users to presentations of result in response to the same search string. When this past behavior indicates that users tend to select sponsored links, the number of sponsored links in the results sent to the user is increased, thereby a more responsive presentation. In addition to sending results with a controlled number of sponsored links, the apparatus and the method account for the response of the user so as to include the user's behavior in the information about past behavior of users.12-25-2008
20080319973RECOMMENDING CONTENT USING DISCRIMINATIVELY TRAINED DOCUMENT SIMILARITY - A generalized discriminative training framework for reconciling the training and evaluation objectives for document similarity is provided. Prior information about document relations and non-relations, are used to discriminatively train an ensemble of document similarity classification models. This result is a model set that can be used to compute similarity between seen documents in the training sets and new documents. The measure of similarity forms the basis of recommending documents to a user as well as being able to obtain metadata information such as keywords and tags for new documents not having such information.12-25-2008
20080313167System And Method For Intelligently Indexing Internet Resources - The present invention is a system and method for building an intelligent index of Internet web pages. A populator retrieves a web page, divides words within the web page into categories, and determines a relevancy rating for the words in each category, the relevancy rating based on the number of appearances of the word in the corresponding category. The populator then weights each relevancy rating by a weighting factor corresponding to the category, and sums the weighted relevancy ratings to determine a web page relevancy rating for each unique word. The categories include a header, hidden words, non-sentences, repetitive words, non-nouns, and nouns. Each category is further subdivided into subcategories of commonly used words and uncommonly used words. A relevancy rating is determined for each subcategory.12-18-2008
20080313169System and method for automated selection and distribution of media content - A media management system for and method of increasing value of media content are provided wherein content attributes associated with media content are stored, a target entry list is generated, and a resultant scenario calculated with an associated financial figure.12-18-2008
20080306935USING JOINT COMMUNICATION AND SEARCH DATA - Conventionally, there are communities of individuals who perform Internet searches and communities of individuals who utilized Internet communications. While there is commonly a large amount of overlap between the two communities, there is little interaction between the two communities. Internet searches can be used to recommend interesting people to a user. Furthermore, Internet communications can be used to recommend content that is likely to be of interest to the user. In addition, previously engaged communications or searches can be used to disambiguate terms in a subsequent search.12-11-2008
20080306936Method and apparatus for compiling user preferences for digital content streamed to a mobile handset - A method and apparatus for compiling user preferences and providing access to preferred digital content on a mobile handset is provided.12-11-2008
20090119280Hosted searching of private local area network information with support for add-on applications - Hosted searching of private LAN information is described. The apparatus includes a LAN crawler to automatically and repeatedly crawl a LAN having multiple devices, using a discovery module to discover the devices, a generic-probing module to attempt to collect the descriptive information according to a first set of probing requirements, and multiple specific-probing plug-ins each of which attempt to collect the descriptive information according to a second set of specific probing requirements. In another embodiment, the apparatus also includes a hosted on-demand search system including a centralized-search server to create and synchronize a private search database. The centralized-search server includes an application interface to receive a request to access the private search database from a third-party add-on application, to provide the accessed information to the third-party add-on application, and to receive from the third-party add-on application an application rendered component to be displayed on the user interface.05-07-2009
20090119277DIFFERENTIATION OF FIELD ATTRIBUTES AS VALUE CONSTRAINING VERSUS RECORD SET CONSTRAINING - Embodiments of the invention provide a method for creating value constraints and record constraints for entity based conditions, while (at least in some cases) reducing the amount of time and errors associated with manually composing query language statements (e.g., SQL). When composing an abstract query, a query interface may be provided for a user to input value constraints and record set constrains to create entity based conditions. The entity based conditions may specify a condition which is evaluated against all rows of data for an instance of a given model entity, and a record set constraint allows a user to specify a subset of records against which the entity based condition is applied.05-07-2009
20090119276Method and Internet-based Search Engine System for Storing, Sorting, and Displaying Search Results - A method and Internet-based search engine system for storing, sorting, and displaying search results. In a first step, a search is performed by keywords and will return results that are sorted by keyword score per result. In a second step, when a user clicks on a search result the system displays the subsequent page. In a third and final step, the viewing of the subsequent page may add to the results keyword score resulting in an increase in that listing's score for that particular keyword, depending upon how long the user views it for. The system also determines how long the user is viewing the subsequent page. The keyword score is determined by a result/listing's ability to engage the user who clicked on it. There are two types of sub-scores that determine the keyword score for the listing; the ‘7-score’; and the ‘15-score’.05-07-2009
20090119275METHOD OF MONITORING ELECTRONIC MEDIA - Consumer-generated media (CGM) and/or other media are monitored to allow an organization to become aware of, and respond to, issues that may affect how it is perceived by the public. An extract, transform, load (ETL) engine is used to process CGM and other media content, and an analytical engine utilizes a multi-step progressive filtering approach to identify those documents that are most relevant. The filtering approach includes executing broad queries to extract relevant content from different CGM and other sources, extracting text snippets from the relevant content and performing de-duplication, defining organizational identity (e.g., brand name, trade name, or company name) and hot-topic models using a rule-based and statistical-based approach, and using the models together in an orthogonal filtering approach to effectively generate alerts and reports. The methodology is found to be substantially more effective compared to a conventional keyword based approach.05-07-2009
20090119290ON-LINE E-MAIL SERVICE SYSTEM, AND SERVICE METHOD THEREOF - Embodiments of the present disclosure provide a method and system in which contextual information is obtained and inserted into the body of the e-mail. Users may authorize the access and analysis of the contents of their e-mails. Upon recognizing transmission of an email from one of the users, the content of the e-mail is analyzed to determine a topic or uncommon concept. The topic or uncommon concept of the email may be determined, for example, by analyzing the relative popularity or frequency of use of terms used in the e-mail. Information relevant to the e-mail's topic and content is then obtained by searching various sources, such as one or more databases, search servers, or the Internet. The relevant information may be inserted or otherwise included in the body of a reply e-mail or other related e-mail. The relevant information may be inserted in various forms, such as a hyperlink, or other formatted text. An e-mail may also be delivered to the sender to notify the sender of the information included in the reply e-mail. Furthermore, the e-mail and the relevant information may be translated into one or more languages depending on the preference of the sender or the receiver.05-07-2009
20090119287IMAGE PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER-READABLE STORAGE MEDIUM - The Object of the present invention is providing a filtering function that is easily used for filtering a document whose importance is changed as time passes. For that end, importance of each search condition and a valid period of the importance are set in association with each other. On searching log data matching the set search condition, calculation is performed on a score of log data matching the search condition on the basis of an execution time of a search, importance of the search condition and the valid period of the importance. Log data having the score thus calculated exceeding a predetermined threshold is extracted.05-07-2009
20090119284METHOD AND SYSTEM FOR CLASSIFYING DISPLAY PAGES USING SUMMARIES - A method and system for classifying display pages based on automatically generated summaries of display pages. A web page classification system uses a web page summarization system to generate summaries of web pages. The summary of a web page may include the sentences of the web page that are most closely related to the primary topic of the web page. The summarization system may combine the benefits of multiple summarization techniques to identify the sentences of a web page that represent the primary topic of the web page. Once the summary is generated, the classification system may apply conventional classification techniques to the summary to classify the web page. The classification system may use conventional classification techniques such as a Naïve Bayesian classifier or a support vector machine to identify the classifications of a web page based on the summary generated by the summarization system.05-07-2009
20090119279Graph caching - In a method and apparatus for analyzing nodes of a Deterministic Finite Automata (DFA), an accessibility ranking, based on a DFA graph geometrical configuration, may be determined in order to determine cacheable portions of the DFA graph in order to reduce the number of external memory accesses. A walker process may be configured to walk the graph in a graph cache as well as main memory. The graph may be generated in a manner allowing each arc to include information if the node it is pointing to is stored in the graph cache or in main memory. The walker may use this information to determine whether or not to access the next arc in the graph cache or in main memory.05-07-2009
20090119278Continual Reorganization of Ordered Search Results Based on Current User Interaction - Responsive to each user interaction with search results to network locations returned by a search engine, a search result reorganizer predicts user interest in the search results from each dynamic user interaction. Responsive to each prediction of user interest while a user interacts any of the search results, the search result reorganizer reorders the search results to reflect the user interest.05-07-2009
20090112839Media Enhancement Mechanism - A method to provide additional media objects for data objects containing one or more existing media objects is described. The existing media object is analyzed to determine additional related media available on the network, and the data object description is augmented with metadata to identify the additional media in an enhanced data object description. When the enhanced data object is rendered, the metadata facilitates incorporation of additional media objects in the displayed page.04-30-2009
20090100050CLIENT DEVICE FOR INTERACTING WITH A MIXED MEDIA REALITY RECOGNITION SYSTEM - The mobile device includes a client that has a number of modules, and the MMR Gateway and MMR matching unit are implemented as a server that has a number of modules. The implementation of the MMR system as a client and a server is advantageous because the modules may be distributed among the client and the server in a variety of configurations. The present invention includes a capture module, a preprocessing module, a feature extraction module, a retrieval module, a send message module, an action module, a prediction module, a feedback module, a sending module, an MMR database, a streaming module, an e-mail module, a voice recognition system and an audio database. These modules and systems are operational upon the client or the server. In one embodiment, the client includes only the capture module with the remaining modules operational on the server. In a second embodiment, the server includes the action module with the remaining modules operational on the client.04-16-2009
20090083263PARALLEL PROCESSING COMPUTER SYSTEMS WITH REDUCED POWER CONSUMPTION AND METHODS FOR PROVIDING THE SAME - This invention provides a computer system architecture and method for providing the same which can include a web page search node including a web page collection. The system and method can also include a web server configured to receive, from a given user via a web browser, a search query including keywords. The node is caused to search pages in its own collection that best match the search query. A search page returner may be provided which is configured to return, to the user, high ranked pages. The node may include a power-efficiency-enhanced processing subsystem, which includes M processors. The M processors are configured to emulate N virtual processors, and they are configured to limit a virtual processor memory access rate at which each of the N virtual processors accesses memory. The memory accessed by each of the N virtual processors may be RAM. In select embodiments, the memory accessed by each of the N virtual processors includes DRAM having a high capacity yet lower power consumption then SRAM.03-26-2009
20090083256Method and subsystem for searching media content within a content-search-service system - Various embodiments of the present invention include concept-service components of content-search-service systems which employ ontologies and vocabularies prepared for particular categories of content at particular times in order to score transcripts prepared from content items to enable a search-service component of a content-search-service system to assign estimates of the relatedness of portions of a content item to search criteria in order to render search results to clients of the content-search-service system. The concept-service component processes a search request to generate lists of related terms, and then employs the lists of related terms to process transcripts in order to score transcripts based on information contained in the ontologies.03-26-2009
20090063457AUGMENTING URL QUERIES - Computer-readable media, systems, and methods for augmenting URL queries are described. In embodiments, a URL query is received from a user and it is determined whether the URL query is a simple URL query. Further, if the URL query is a simple URL query, an augmented query is created by word-breaking at least a portion of the URL query and the augmented query is associated with one or more ranking preferences. In various other embodiments, a URL query is received from a user and it is determined whether the URL query is a complex URL query. Further, if the URL query is a complex URL query, an augmented query is created that is identical to the URL query and the augmented query is associated with one or more ranking preferences.03-05-2009
20080228764HYPERCUBE TOPOLOGY BASED ADVANCED SEARCH ALGORITHM - The present invention is a system and method of conducting an adaptive search from a plurality of data sources utilizing a hypercube topology. The system includes a search engine which utilizes a hypercube architecture having a plurality of hypercubes. Each hypercube indexes several data sources in a manner such that similar data sources are located in proximity with other similar data sources. In addition, the search engine utilizes a plurality of message passing ants providing a signal of a path taken for other message passing ants to follow.09-18-2008
20090006377SYSTEM, METHOD AND COMPUTER EXECUTABLE PROGRAM FOR INFORMATION TRACKING FROM HETEROGENEOUS SOURCES - A system for information clustering comprising a data accumulation part for accumulating documents in a document repository, the documents having loosely related attributes, and defining a cluster between the documents being time sliced so as to define chunks of the documents; a vector space generation part for generating document-keyword vectors, the document-keyword vectors consisting of sparse numeral values depending on presence of key words; a dimension reduction part for reducing dimensions of the keywords to create a dimension reduction matrix of the document-keyword matrix; a centroid vector determination part for generating a centroid vector of the cluster, the centroid vectors being defined from keywords and weight of documents within the cluster; and an item repository for storing the centroid vectors together with the keywords and the weights of the centroid vector.01-01-2009
20090006356CHANGING RANKING ALGORITHMS BASED ON CUSTOMER SETTINGS - Search term ranking algorithms can be generated and updated based on customer settings, such as where a ranking algorithm is modeled as a combination function of different ranking factors. An end user of a search system provides personalized preferences for weighted attributes, generally or for a single instance of the query. The user also can indicate the relative importance of one or more ranking factors by specifying different weights to the factors. Ranking factors can specify document attributes, such as document title, document body, document page rank, etc. Based on the attribute weights and the received user query, a ranking algorithm function will produce the relevant value for each document corresponding to the user preferences and personalization configurations.01-01-2009
20080313168RANKING DOCUMENTS BASED ON A SERIES OF DOCUMENT GRAPHS - Ranking documents based on a series of web graphs collected over time is provided. A ranking system provides multiple transition probability distributions representing different snapshots or times. Each transition probability distribution represents a probability of transitioning from one document to another document within a collection of documents using a link of the document. The ranking system determines a stationary probability distribution for each snapshot based on the transition probability distributions for that snapshot and the stationary probability distribution of the previous snapshot. The stationary probability distributions represent a ranking of the documents over time.12-18-2008
20080281807SEARCH ENGINE - A search engine comprising search indices for entities, wherein a tag reputation of a tag which classifies an entity is updated by said search engine depending on a rating input by said user and depending on a user reputation of said user.11-13-2008
20080270392Optimizing Execution of Database Queries Containing User-Defined Functions - A query engine (or optimizer) which supports database queries having user-defined functions maintains historical execution data with respect to each of multiple user-defined functions. The historical execution data is dynamically updated based on query execution performance. When executing a query having user-defined functions, the query engine uses the historical execution data to predict an optimal evaluation ordering for the query conditions and, preferably, to dynamically adjust the evaluation order when appropriate. Preferably, the historical execution data includes historical execution time of the user-defined function and proportion of evaluated records which satisfied the query parameters.10-30-2008
20080270393TECHNIQUES FOR PERSONALIZED AND ADAPTIVE SEARCH SERVICES - Techniques are presented for automatically selecting information sources that are most relevant to user queries. Results of searches returned by information sources for queries are analyzed and the information sources are ranked based on this analysis. The information sources that have high rankings for a query are subsequently used to search for relevant results. This process can be adaptive, as the returned results of old queries can be analyzed at a later date to update the ranking of the information sources, automatic searches can be performed to update the ranking of the information sources, new queries can be used for analysis and stored, new information sources added, and old information sources deleted. A linguistic library is used to store personal categories for one or more users and general categories. Each category is associated with keywords and ranked lists of information sources. The library also contains general categories, taxonomies, and dictionaries.10-30-2008
20080270394GENERATING DESCRIPTIONS OF MATCHING RESOURCES BASED ON THE KIND, QUALITY, AND RELEVANCE OF AVAILABLE SOURCES OF INFORMATION ABOUT THE MATCHING RESOURCES - Techniques are provided for generating descriptions of matching resources in a manner that takes into account the kind, quality, and relevance of the available sources of information about the matching resources. For example, after the search engine identifies matching resources based on the query terms, the search engine determines the kinds of available sources of information about each matching resource. For each matching resource, based on the kinds of available sources of information about the matching resource, one of a plurality of processes is selected to generate a description for the matching resource. Using the content-sensitive description generation techniques described herein, a single result set may include abstracts that were generated using several different processes, where the difference in process corresponds to a difference in the kind, quality, and relevance of the available sources of information about each matching resource.10-30-2008
20090083255Query spelling correction - A technology for query spelling correction is disclosed. In one method approach, web search results generated based on a query term are received. The web search results are used as a part of determining a correction candidate for the query term if the query term is incorrectly spelt.03-26-2009
20090248679INFORMATION DEVICE AND INFORMATION PRESENTATION METHOD - An information device and an information presentation method are disclosed. The neighboring environment in which a device is located or the behavior of the device user is recognized based on sensor data thereby to determine the situation data. The plural object information are acquired in accordance with the situation data. Plural related words related to the plural object information are retrieved from a database and thereby developed. Plural related words are displayed to permit any one of them to be selected. Once one of the related words is selected, the object information is reduced to only the one for the related word, and any one of the object information is displayed in a selectable manner.10-01-2009
20080256062METHOD AND SYSTEM FOR PROPAGATING ANNOTATIONS USING PATTERN MATCHING - Methods, systems, and articles of manufacture for propagating annotations created for data objects appearing in a variety of different application types are provided. Some embodiments present users collaborating on a project with an indication of data objects in a current document that have been annotated, or that related data objects have been annotated, in other documents. Users may then review the annotations and selectively associate the annotations with the related data object in the current document, thereby spreading the tacit knowledge reflected in the annotation about related data objects across many documents in an enterprise network. Further, an annotation management system may maintain a thesaurus of related terms and corresponding annotation points to find annotations for data objects that exist in other documents without having to inspect the data object(s) associated with each existing annotation.10-16-2008
20080256067File Search Engine and Computerized Method of Tagging Files with Vectors - The main purpose of the software, system and method of this invention is to help produce better searches for people utilizing their context as represented by a vector. The system allows for files (including websites) to be tagged with a vector. If a provider wants a searcher to find that provider's files, the file must be tagged with the vector that is sufficiently close to the searcher's corresponding vectors. A search user inputs not only a text search but also the vectors that have been created to show the context and preferences of that search user.10-16-2008
20080256057Optimizing a query using fuzzy matching - A system is disclosed for optimizing a user query. User queries often include issue terms, such as misspelled or mistyped terms. The disclosed system employs a fuzzy network to match an issue term with a valid term. The system optimizes the user query with the valid term. Thereafter, query results based on the optimized user query may be provided to the user.10-16-2008
20080256069Complete Context(tm) Query System10-16-2008
20080256060SYSTEM FOR DETERMINING THE QUALITY OF QUERY SUGGESTIONS USING A NETWORK OF USERS AND ADVERTISERS - A system is described for determining the quality of query suggestions using a network of users and advertisers. The system may include a memory and a processor. The memory may store a historical dataset, a residual value, a query-advertisement link value, a query suggestion value, and a data representing a network. The network may comprise a plurality of query items linked to a plurality of advertisement items via a plurality of query-advertisement link items. The processor may generate data representing the network and may identify a query-advertisement link item in the network. The processor may calculate the residual value of the query suggestion system represented by the match type of the query-advertisement link item. The processor may calculate the query-advertisement link value. The processor may add the residual value to the query-advertisement link value to determine a query suggestion value and may store the query suggestion value in the memory.10-16-2008
20080256068METHOD AND SYSTEM FOR CALCULATING IMPORTANCE OF A BLOCK WITHIN A DISPLAY PAGE - A method and system for identifying the importance of information areas of a display page. An importance system identifies information areas or blocks of a web page. A block of a web page represents an area of the web page that appears to relate to a similar topic. The importance system provides the characteristics or features of a block to an importance function that generates an indication of the importance of that block to its web page. The importance system “learns” the importance function by generating a model based on the features of blocks and the user-specified importance of those blocks. To learn the importance function, the importance system asks users to provide an indication of the importance of blocks of web pages in a collection of web pages.10-16-2008
20080256063TECHNIQUE FOR SEARCHING FOR KEYWORDS DETERMINING EVENT OCCURRENCE - A keyword search system including a text input unit for inputting subtexts obtained by dividing each text into parts, while associating the subtexts with an event through a process recorded in the text; a prediction device adjuster for adjusting a corresponding event prediction device to maximize the percentage of text in which the inputted event is identical to a prediction result in a first text group selected from the subtexts; a prediction processor for generating a prediction result for each section, by inputting each text in a second text group selected from the corresponding subtexts in the adjusted event prediction device; and a search unit for calculating the prediction precision for the second text group of the event prediction device using a comparison between the inputted event and the prediction result for each subtext, and searching for keywords in sections with a certain degree of prediction precision.10-16-2008
20080256061SYSTEM FOR GENERATING QUERY SUGGESTIONS BY INTEGRATING VALUABLE QUERY SUGGESTIONS WITH EXPERIMENTAL QUERY SUGGESTIONS USING A NETWORK OF USERS AND ADVERTISERS - A system is described for generating query suggestions by integrating valuable query suggestions with experimental query suggestions using a network of users and advertisers. The system may include a memory, an interface, and a processor. The memory may store a historical dataset, a plurality of query suggestions, a plurality of query suggestion values, a query exploit set, a query explore set, and a data describing a network. The processor may identify the plurality of query suggestions in the historical dataset and generate data describing the network based on the historical dataset. The processor may calculate the query suggestion value for each query suggestion and may rank the query suggestions based on the query suggestion values. The processor may generate an exploit set comprising the top ranked query suggestions and an explore set comprising the remainder. The processor may suggest the query suggestions in the exploit set and the explore set.10-16-2008
20080256059SYSTEM FOR GENERATING QUERY SUGGESTIONS USING A NETWORK OF USERS AND ADVERTISERS - A system is described for generating query suggestions using a network of users and advertisers. The system may include a memory, an interface, and a processor. The memory may store a data representing a network comprising query items linked to advertisement items via link items, wherein each link item comprises a weight representing the strength of the relationship between each query item and advertisement item, a search query item, and a relevance value for each query item. The processor may be operatively connected to the memory and the interface and may identify the data representing the network and receive a search query item. The processor may calculate a relevance value for each additional query item in the network based on its relationship to the received search query item. The processor may then suggest the query items with the highest relevance values to the user via the interface.10-16-2008
20080256054Computer-implemented method and system for targeting contents according to user preferences - A method, system, and device targets contents according to the preferences of a particular user. A content is associated with one or more content category alternatives, and is different from other contents. Pairwise comparisons for a particular user for a set of content category alternatives are input into a computer, wherein a pairwise comparison includes a judgment between preferences as a relative importance between two content category alternatives. A weighted prioritization of the content category alternatives of the pairwise comparisons for the particular user is prepared in the computer, according to an analytic hierarchy process. The weighted prioritization of the content category alternatives for the particular user is applied, in the computer, to the contents. A weight is associated with the content according to the weighted prioritizations of the content category alternative corresponding to the content categorization of the contents. The contents are provided according to the weight.10-16-2008
20080256053EXECUTION OF DATABASE QUERIES INCLUDING FILTERING - A query processing system has a query processor and a data manager. The query processor calls the data manager to carry out data access for a query including a filtering operation. The data manager accesses the data in a set of data and before returning the data, initiates a callback to the query processor to determine if the located data meets the filtering criteria. Where the data does not satisfy the filtering criteria, the data manager seeks additional data in the set of data, without having to return the first located data to the query processor.10-16-2008
20080256051CALCULATING IMPORTANCE OF DOCUMENTS FACTORING HISTORICAL IMPORTANCE - A method and system for determining temporal importance of documents having links between documents based on a temporal analysis of the links is provided. A temporal ranking system collects link information or snapshots indicating the links between documents at various snapshot times. The temporal ranking system calculates a current temporal importance of a document by factoring in the current importance of the document derived from the current snapshot (i.e., with the latest snapshot time) and the historical importance of the document derived from the past snapshots. To calculate the current temporal importance of a web page, the temporal ranking system aggregates the importance of the web page for each snapshot.10-16-2008
20080256050SYSTEM AND METHOD FOR MODELING USER SELECTION FEEDBACK IN A SEARCH RESULT PAGE - The present invention provides for improving the search relevance of a search results page by including a perceived relevance factor. The system, device and method monitors user selection of elements in the search results page, where these selections indicate relevance of the element compared with the original search request. A perceived relevance factor for the element is then determined based on probabilistic-based computations accounting for the element, which may include a description, a thumbnail and/or meta data, and the position of the element on the search results page. Thereby, for future searches and search results page generation, relevance factors may be calculated based on various factors, including the element attribute based relevant scores and the perceived relevance factor.10-16-2008
20080256052METHODS FOR DETERMINING HISTORICAL EFFICACY OF A DOCUMENT IN SATISFYING A USER'S SEARCH NEEDS - Documents returned by a search engine may be good keyword matches to the search query terms, but may not historically have been very effective in addressing user needs. Documents which have historically been effective in addressing user needs are said to have high efficacy. Disclosed are methods that try to assess the beginning and ending of user search sessions, assume that documents that are the last document looked at are those with the highest efficacy, and incorporate this notion of efficacy in returning-search results.10-16-2008
20080243830User suggested ordering to influence search result ranking - A method, apparatus, and system of user suggested ordering to influence search result ranking are disclosed. In one embodiment, a method includes generating a search result having a set of links each associated with a content data relevant to a search query, ranking individual ones of the set of links based on an algorithm in the search result, applying a weighting factor to certain ones of the set of links based on a user suggested ordering of the ranking of the individual ones of the set of links in relation to each other, and ordering the search result based on an application of the weighting factor on the search result.10-02-2008
20080208836Regression framework for learning ranking functions using relative preferences - A method and apparatus for determining a ranking function by regression using relative preference data. A number of iterations are performed in which to following is performed. The current ranking function is used to compare pairs of elements. The comparisons are checked against actual preference data to determine for which pairs the ranking function mis-predicted (contradicting pairs). A regression function is fitted to a set of training data that is based on contradicting pairs and a target value for each element. The target value for each element may be based on the value that the ranking function predicted for the other element in the pair. The ranking function for the next iteration is determined based, at least in part, on the regression function. The final ranking function is established based on the regression functions. For example, the final ranking function may be based on a linear combination of regression functions.08-28-2008
20090125503WEB PAGE CATEGORIZATION USING GRAPH-BASED TERM SELECTION - This disclosure describes systems and methods for categorizing web pages. Web pages and terms selected from those web pages are organized in a matrix. The number of terms in the matrix are filtered using a Laplacian score algorithm. A linear regression algorithm or some other algorithm may use the filtered set of terms to fit the web pages into pre-defined categories.05-14-2009
20090204601SOCIAL NETWORK SEARCH - A device, system and method to enable communications over a network wherein a user may conduct a search directed to target contacts within a social network. A knowledge base of prior social search responses may be searched for responses from the target contacts with the results being presented to the user. The results of the search can be sorted along with responses received from the target contacts. The selection of target contacts and presentation of results can be based on various attributes of target contacts or ranking of the prior search responses. The search responses received by the user along with attributes and rankings may be stored in the knowledge base for future use. The target contacts and search may be taken from contacts or the knowledge base of the contacts with greater than one degree of separation from the user.08-13-2009
20100030775System And Method For Storing And Retrieving Non-Text-Based Information - A method for non-text-based identification of a selected item of stored music. The first broad portion of the method focuses on building a music identification database. That process requires capturing a tag of the selected musical item, and processing the tag to develop reference key to the same. Then the tag is stored, together with the reference key and an association to the stored music. The database is built by collecting a multiplicity of tags. The second broad portion of the method is retrieving a desired item of stored music from the database. That process calls for capturing a query tag from a user, and processing the query tag to develop a query key to the same. The query tag is compared to reference keys stored in the database to identify the desired item of stored music.02-04-2010
20090300011CONTENTS RETRIEVAL DEVICE - The contents retrieval device (12-03-2009
20090300010SYSTEM, APPARATUS AND METHOD FOR GENERATING AND RANKING CONTACT INFORMATION AND RELATED ADVERTISEMENTS IN RESPONSE TO QUERY ON COMMUNICATION DEVICE - The present invention relates to a method, system, and apparatus to download contact information of one or more entities in one or more geographic areas from remote server into die contact list of a communication device. Communication network between remote server and communication device; and contact information databases having identical data fields is provided in remote server and communication device. According to another aspect, communication device application having means to determine communication device location; and having means to retrieve contact information from communication device contact list in response to user query; and sort retrieved contact information in order of their proximity to communication device location is provided. According to another aspect of the invention; apparatus, method, and system for advertising on communication devices in conjunction with contact information of entities is provided. According to yet another aspect of the invention; means to determine popularity and ranking of contact information of entities is provided.12-03-2009
20090300009Behavioral Targeting For Tracking, Aggregating, And Predicting Online Behavior - A pre-computed concept map represents concepts, concept metadata, and relationships between the plurality of concepts. Online user behavior may be predicted by correlating one or more online events of a user with one or more features of the concept map, aggregating a concept map history of the user to obtain online behavior over time, aggregating online behavior of the user and one or more other users to obtain aggregated online user behavior, and predicting future online behavior of the user based at least in part on the online behavior of the user and the aggregated online user behavior. The predicted behavior may be used to target ads that the user is likely to find relevant.12-03-2009
20090300001SERVER APPARATUS, CATALOG PROCESSING METHOD, AND COMPUTER-READABLE STORAGE MEDIUM - Some embodiments of the present invention provide that a web application server reads catalog information, and selects grouping data. Then, the web application server sets web-application-server grouping. When an instruction on execution of grouping is issued from a client PC, the web application server registers catalog data items for individual groups. When association data is selected by the client PC, the web application server registers association information to a database.12-03-2009
20090299998Keyword discovery tools for populating a private keyword database - Methods and systems disclosed herein relate to keyword discovery tools for populating a private keyword database. Keyword discovery relates to continuously and automatically in incrementing a working keyword data set for new periods of time based on retrieval of at least one of new traffic-generating keywords and new suggested keywords. Related user interfaces, applications, and computer program products are disclosed.12-03-2009
20090300008ADAPTIVE RECOMMENDER TECHNOLOGY - A computer implemented method for incorporating media item data for use in a media item recommender system comprising: accessing a first database comprising a plurality of media item identifiers and associated metadata corresponding to each of a plurality of media items identified by the media item identifiers; generating first correlation data based on a comparison of the metadata corresponding to pairs of the media item identifiers to detect similarities between the media items identified; accessing a second database comprising a plurality of media item identifier sets for identifying sets of media items; generating second correlation data based on an analysis of the media item identifier sets to determine incidence of selected subsets of media item identifiers occurring together in a same media item identifier set; accessing a third database comprising a plurality of consumed media item identifier sets, wherein the consumed media item identifier sets associate one or more media item identifiers in a particular set based on media item consumption data; generating third correlation data based on an analysis of the consumed media item identifier sets to determine incidence of selected subsets of the consumed media item identifiers occurring together in a same consumed media item identifier set; and merging the first, second, and third correlation data to generate media item recommender data.12-03-2009
20090300005SEARCH APPARATUS AND METHOD FOR CONTROLLING SEARCH APPARATUS - A method for controlling a search apparatus that searches a plurality of data each having an attribute value for each attribute item according to a search condition defined by the attribute value, the method includes detecting a change of the attribute value of one or more data of the plurality of data, changing the search condition including the changed attribute value according to the detected change of the attribute value, and performing a search according to the changed search condition.12-03-2009
20090300004CONTENTS DISPLAY DEVICE AND CONTENTS DISPLAY METHOD - Based on a content attribute serving as a coordinate axis of which the setting input is performed from an operation input unit, and the content identifier of a content of interest, a metadata storage unit is searched to select one or multiple other contents relating to the content of interest. The strength of relationship between each of the selected other contents and the content of interest is calculated based on the content attribute set as a coordinate axis, and information indicating correlation. The layout relations of other contents with the content of interest as the origin are calculated based on the content attribute serving as a coordinate axis, and the calculated strength of relationship. The display image of each of the other contents is disposed in accordance with the calculated layout relations.12-03-2009
20090300003APPARATUS AND METHOD FOR SUPPORTING KEYWORD INPUT - A keyword input supporting apparatus includes a document acquisition unit that acquires a document having a plurality of components containing text data, a main component selection unit that selects a component having many characters in the text data as a main component, a part-of-speech analysis unit that analyzes the part-of-speech of the text data contained in the main component, and adds a semantic attribute to each of words of the text data, a specific name extraction unit that extracts as a specific name a word, having a predetermined semantic attribute or part of speech, from the words, a specific name storage that stores the specific name together with the corresponding semantic attribute, a keyword candidate classification unit that performs classification of the specific name from the storage as a keyword candidate based on the semantic attribute, and a keyword candidate presentation unit that presents the keyword candidate to a user.12-03-2009
20090299997GROUPING WORK SUPPORT PROCESSING METHOD AND APPARATUS - This method includes: extracting plural feature expressions from plural documents, and categorizing the extracted feature expressions into plural sets; presenting a user with one of the plural sets in a manner that the feature expressions included in the set can be recognized; accepting, from the user, a grouping instruction including designation of the feature expression to be unified among the feature expressions included in a specific set, and counting, as a first value, the number of documents including the feature expression to be unified, which is included in the grouping instruction; counting, as a second value, the number of documents including the feature expression included in a set that is other than the specific set and identified by a grouping mode and/or state; judging based on the first and second values whether a predetermined condition is satisfied; upon detecting that the predetermined condition is satisfied, notifying the user of the completion of designation of the feature expression to be unified.12-03-2009
20090299995METHOD FOR OUTPUTTING DATA RECORDS, AND DEVICE THEREFOR - A method and a device are provided for outputting data records on the basis of input data records entered by a user, a set of data records present in a database being structured via a tree structure, and search criteria and filter information items being assigned to nodes in the tree structure which are not terminal nodes.12-03-2009
20090299992METHODS AND SYSTEMS FOR IDENTIFYING DESIRED INFORMATION - A method of identifying desired objects of information determines whether an existing rule is appropriate to identify a new desired object of information, defines a new rule to include at least one search query string when one of the existing rules is not appropriate to identify the new desired object of information, and defines an initial new search query string to identify the new desired object of information, wherein the initial search query string has a search query string input value. Furthermore, the method includes identifying objects having an object value equal to the search query string input value, and identifying the objects as the results of the processing operation and as having an equivalence relationship with the initial search query string. When the results do not satisfy the new rule, subsequent search query strings are defined to form a search query string chain.12-03-2009
20090299991RECOMMENDING QUERIES WHEN SEARCHING AGAINST KEYWORDS - A query including one or more current search terms is received from a user and executed against a target database. When the query yields a number of results less than a defined search threshold (a.k.a. an “unsuccessful” search), the current search terms are compared with an associations database. The associations database includes associations between search terms in previously-executed queries that yielded less than a threshold number of results and replacement search terms that were substituted to generate a successful query yielding at least the threshold number of results. Upon finding a match between one or more of the search terms and the current search terms, the associations between the search terms and the replacement search terms are used to identify suggested replacement search terms and present them to the user.12-03-2009
20090299990METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR PROVIDING CORRELATIONS BETWEEN INFORMATION FROM HETEROGENOUS SOURCES - An apparatus for providing correlations between information from heterogeneous sources may include a processor. The processor may be configured to analyze at least two different datasets in which each dataset includes entities with respective attributes corresponding to each of the entities, determine a set of correlations between entities in which at least one correlation determined includes a correlation between entities in different datasets, and filter the set of correlations to produce a focused set of correlations based on selecting correlations among the set of correlations that correspond to selection criteria for inclusion in the focused set of correlations. The set of correlations may be determined based at least in part on the respective attributes corresponding to each of the entities.12-03-2009
20090300007INFORMATION PROCESSING APPARATUS, FULL TEXT RETRIEVAL METHOD, AND COMPUTER-READABLE ENCODING MEDIUM RECORDED WITH A COMPUTER PROGRAM THEREOF - An information processing apparatus for creating a retrieval result displaying a list of retrieval documents is disclosed. Retrieval documents corresponding to a retrieval condition are classified into groups based on scores indicating degrees of relevance to the retrieval condition. A clustering process is conducted with respect to the retrieval documents in a group, for each of groups to which the retrieval documents belong.12-03-2009
20090292698METHOD FOR EXTRACTING A COMPACT REPRESENTATION OF THE TOPICAL CONTENT OF AN ELECTRONIC TEXT - An electronic document is parsed to remove irrelevant text and to identify the significant elements of the retained text. The elements are assigned scores representing their significance to the topical content of the document. A matrix of element-pairs is constructed such that the matrix nodes represent the result of one or more functions of the scores and other attributes of the paired elements. The resulting matrix is a compact representation of topical content that affords great precision in information retrieval applications that depend on measurements of the relatedness of topical content.11-26-2009