Zhiwei Li, Beijing CN

Patent application number	Description	Published
20080215561	SCORING RELEVANCE OF A DOCUMENT BASED ON IMAGE TEXT - A method and system for determining relevance of a document having text and images to a text string is provided. A scoring system identifies image text associated with an image of the document. The scoring system calculates an image score indicating relevance of the image text to the text string. The image score may be used in many applications, such as searching, summary generation, and document classification, image search, and image classification.	09-04-2008
20080263042	OBJECT SIMILARITY SEARCH IN HIGH-DIMENSIONAL VECTOR SPACES - An object search system generates a hierarchical clustering of objects of a collection based on similarity of the objects. The object search system generates a separate hierarchical clustering of objects for multiple features of the objects. To identify objects similar to a target object, the object search system first generates a feature vector for the target object. For each feature of the feature vector, the object search system uses the hierarchical clustering of objects to identify the cluster of objects that is most “feature similar” to that feature of the target object. The object search system indicates the similarity of each candidate object based on the features for which the candidate object is similar.	10-23-2008
20080313031	CLASSIFICATION OF IMAGES AS ADVERTISEMENT IMAGES OR NON-ADVERTISEMENT IMAGES - An advertisement image classification system trains a binary classifier to classify images as advertisement images or non-advertisement images and then uses the binary classifier to classify images of web pages as advertisement images or non-advertisement images. During a training phase, the classification system generates training data of feature vectors representing the images and labels indicating whether an image is an advertisement image or a non-advertisement image. The classification system trains a binary classifier to classify images using training data. During a classification phase, the classification system inputs a web page with an image and generates a feature vector for the image. The classification system then applies the trained binary classifier to the feature vector to generate a score indicating whether the image is an advertisement image or a non-advertisement image.	12-18-2008
20080313177	ADDING DOMINANT MEDIA ELEMENTS TO SEARCH RESULTS - A method and system for determining dominance of the media elements of display pages is provided. The dominance system provides a scoring mechanism for scoring the dominance of media elements of display pages based on features of each media element of the display page. To generate the scores for the media elements of the display page, the dominance system first identifies the media elements and then identifies the features of the media elements. The dominance system then scores the identified media elements using the provided scoring mechanism and the identified features.	12-18-2008
20090006189	DISPLAYING OF ADVERTISEMENT-INFUSED THUMBNAILS OF IMAGES - An image advertisement system of a computing device displays as part of a display page an advertisement-infused thumbnail of an image prior to displaying the image. The image advertisement system initially receives a display page with an indication of an image to be displayed as part of the display page. The image advertisement system generates an advertisement-infused thumbnail of the image by combining advertisement content with a thumbnail of the image. The image advertisement system then displays the display page with the advertisement-infused thumbnail of the image in place of the image. The image advertisement system then replaces the displayed advertisement-infused thumbnail with the image.	01-01-2009
20090041366	GENERATING SEARCH REQUESTS FROM MULTIMODAL QUERIES - A method and system for generating a search request from a multimodal query that includes a query image and query text is provided. The multimodal query system identifies images of a collection that are textually related to the query image based on similarity between words associated with each image and the query text. The multimodal query system then selects those images of the identified images that are visually related to the query image. The multimodal query system may formulate a search request based on keywords of web pages that contain the selected images and submit that search request to a search engine service.	02-12-2009
20090060351	Visual Language Modeling for Image Classification - Systems and methods for visual language modeling for image classification are described. In one aspect the systems and methods model training images corresponding to multiple image categories as matrices of visual words. Visual language models are generated from the matrices. In view of a given image, for example, provided by a user or from the Web, the systems and methods determine an image category corresponding to the given image. This image categorization is accomplished by maximizing the posterior probability of visual words associated with the given image over the visual language models. The image category, or a result corresponding to the image category, is presented to the user.	03-05-2009
20090063455	Bipartite Graph Reinforcement Modeling to Annotate Web Images - Systems and methods for bipartite graph reinforcement modeling to annotate web images are described. In one aspect the systems and methods implement bipartite graph reinforcement modeling operations to identify a set of annotations that are relevant to a Web image. The systems and methods annotate the Web image with the identified annotations. The systems and methods then index the annotated Web image. Responsive to receiving an image search query from a user, wherein the image search query comprises information relevant to at least a subset of the identified annotations, the image search engine service presents the annotated Web image to the user.	03-05-2009
20090074306	Estimating Word Correlations from Images - Word correlations are estimated using a content-based method, which uses visual features of image representations of the words. The image representations of the subject words may be generated by retrieving images from data sources (such as the Internet) using image search with the subject words as query words. One aspect of the techniques is based on calculating the visual distance or visual similarity between the sets of retrieved images corresponding to each query word. The other is based on calculating the visual consistence among the set of the retrieved images corresponding to a conjunctive query word. The combination of the content-based method and a text-based method may produce even better result.	03-19-2009
20090076800	Dual Cross-Media Relevance Model for Image Annotation - A dual cross-media relevance model (DCMRM) is used for automatic image annotation. In contrast to the traditional relevance models which calculate the joint probability of words and images over a training image database, the DCMRM model estimates the joint probability by calculating the expectation over words in a predefined lexicon. The DCMRM model may be advantageous because a predefined lexicon potentially has better behavior than a training image database. The DCMRM model also takes advantage of content-based techniques and image search techniques to define the word-to-image and word-to-word relations involved in image annotation. Both relations can be estimated by using image search techniques on the web data as well as available training data.	03-19-2009
20110058734	CLASSIFICATION OF IMAGES AS ADVERTISEMENT IMAGES OR NON-ADVERTISEMENT IMAGES - An advertisement image classification system trains a binary classifier to classify images as advertisement images or non-advertisement images and then uses the binary classifier to classify images of web pages as advertisement images or non-advertisement images. During a training phase, the classification system generates training data of feature vectors representing the images and labels indicating whether an image is an advertisement image or a non-advertisement Image. The classification system trains a binary classifier to classify Images using training data. During a classification phase, the classification system inputs a web page with an image and generates a feature vector for the image. The classification system then applies the trained binary classifier to the feature vector to generate a score indicating whether the image is an advertisement image or a non-advertisement image.	03-10-2011
20110078159	Long-Query Retrieval - Described herein is a technology that facilitates efficient large-scale similarity-based retrieval. In several embodiments documents, images, and/or other multimedia files are compactly represented and efficiently indexed to enable robust search using a long-query in a large-scale corpus. As described herein, these techniques include performing decomposition of a file, e.g., a document or document-like representation. The techniques use dimension reduction to obtain three parts, topic-related words (major semantics), document specific words (minor semantics), and background words, representing the major semantics in a feature vector and the minor semantics as keywords. Using the techniques described, file vectors are matched in a topic model and the results ranked based on the keywords.	03-31-2011
20110087660	SCORING RELEVANCE OF A DOCUMENT BASED ON IMAGE TEXT - A method and system for determining relevance of a document having text and images to a text string is provided. A scoring system identifies image text associated with an image of the document. The scoring system calculates an image score indicating relevance of the image text to the text string. The image score may be used in many applications, such as searching, summary generation, and document classification, image search, and image classification.	04-14-2011
20110194780	OBJECT SIMILARITY SEARCH IN HIGH-DIMENSIONAL VECTOR SPACES - An object search system generates a hierarchical clustering of objects of a collection based on similarity of the objects. The object search system generates a separate hierarchical clustering of objects for multiple features of the objects. To identify objects similar to a target object, the object search system first generates a feature vector for the target object. For each feature of the feature vector, the object search system uses the hierarchical clustering of objects to identify the cluster of objects that is most “feature similar” to that feature of the target object. The object search system indicates the similarity of each candidate object based on the features for which the candidate object is similar.	08-11-2011
20110264659	TRAINING A RANKING FUNCTION USING PROPAGATED DOCUMENT RELEVANCE - A method and system for propagating the relevance of labeled documents to a query to unlabeled documents is provided. The propagation system provides training data that includes queries, documents labeled with their relevance to the queries, and unlabeled documents. The propagation system then calculates the similarity between pairs of documents in the training data. The propagation system then propagates the relevance of the labeled documents to similar, but unlabeled, documents. The propagation system may iteratively propagate labels of the documents until the labels converge on a solution. The training data with the propagated relevances can then be used to train a ranking function.	10-27-2011
20110270821	ADDING DOMINANT MEDIA ELEMENTS TO SEARCH RESULTS - A method and system for determining dominance of the media elements of display pages is provided. The dominance system provides a scoring mechanism for scoring the dominance of media elements of display pages based on features of each media element of the display page. To generate the scores for the media elements of the display page, the dominance system first identifies the media elements and then identifies the features of the media elements. The dominance system then scores the identified media elements using the provided scoring mechanism and the identified features.	11-03-2011
20120072410	Image Search by Interactive Sketching and Tagging - Sketch and tagging based image search may include receiving a sketch query input and identifying an object in a desired image. The object or objects of the sketch query may be tagged with a text, and searching performed based on the objects. Certain implementations include indexing patches of the images, where the patches represent the objects. Relevant images can be returned based on the index of the patches.	03-22-2012
20120093371	GENERATING SEARCH REQUESTS FROM MULTIMODAL QUERIES - A method and system for generating a search request from a multimodal query that includes a query image and query text is provided. The multimodal query system identifies images of a collection that are textually related to the query image based on similarity between words associated with each image and the query text. The multimodal query system then selects those images of the identified images that are visually related to the query image. The multimodal query system may formulate a search request based on keywords of web pages that contain the selected images and submit that search request to a search engine service.	04-19-2012
20120269432	IMAGE RETRIEVAL USING SPATIAL BAG-OF-FEATURES - Local features of retrieved images are identified and for each image, an ordered bag-of-features is created that describes the features of the image. Spatial relationships between features of an image are captured in histogram descriptors created from linear or circular projections. Using the histogram descriptors, ordered bags-of-features describing the features of the images are developed. Calibrating the ordered bags-of-features to account for spatial variance leads to spatial bags-of-features.	10-25-2012
20120290577	IDENTIFYING VISUAL CONTEXTUAL SYNONYMS - Tools and techniques for identifying visual contextual synonyms are described herein. The described operations use visual words having similar contextual distributions as contextual synonyms to identify and describe visual objects that share semantic meaning. The contextual distribution of a visual word is described using the statistics of co-occurrence and spatial information averaged over image patches that share the visual word. In various implementations, the techniques are employed to construct a visual contextual synonym dictionary for a large visual vocabulary. In various implementations, the visual contextual synonym dictionary narrows the semantic gap for large-scale visual search.	11-15-2012
20120301014	LEARNING TO RANK LOCAL INTEREST POINTS - Tools and techniques for learning to rank local interest points from images using a data-driven scale-invariant feature transform (SIFT) approach termed “Rank-SIFT” are described herein. Rank-SIFT provides a flexible framework to select stable local interest points using supervised learning. A Rank-SIFT application detects interest points, learns differential features, and implements ranking model training in the Gaussian scale space (GSS). In various implementations a stability score is calculated for ranking the local interest points by extracting features from the GSS and characterizing the local interest points based on the features being extracted from the GSS across images containing the same visual objects.	11-29-2012
20130346416	Long-Query Retrieval - Described herein is a technology that facilitates efficient large-scale similarity-based retrieval. In several embodiments documents, images, and/or other multimedia files are compactly represented and efficiently indexed to enable robust search using a long-query in a large-scale corpus. As described herein, these techniques include performing decomposition of a file, e.g., an image, a document containing an image, or a document-like representation of an image. The techniques use dimension reduction to obtain three parts, low-dimensional representations (major semantics), file specific terms (minor semantics), and background words, representing the major semantics in a feature vector and the minor semantics as keywords. Using the techniques described, file vectors are matched in a topic model and the results ranked based on the keywords.	12-26-2013
20140029856	THREE-DIMENSIONAL VISUAL PHRASES FOR OBJECT RECOGNITION - The techniques discussed herein discover three-dimensional (3-D) visual phrases for an object based on a 3-D model of the object. The techniques then describe the 3-D visual phrases. Once described, the techniques use the 3-D visual phrases to detect the object in an image (e.g., object recognition).	01-30-2014
20140368620	USER INTERFACE FOR THREE-DIMENSIONAL MODELING - A method of acquiring a set of images useable to 3D model a physical object includes imaging the physical object with a camera, and displaying with the camera a current view of the physical object as imaged by the camera from a current perspective. The method further includes displaying with the camera a visual cue overlaying the current view and indicating perspectives from which the physical object is to be imaged to acquire the set of images.	12-18-2014

Patent applications by Zhiwei Li, Beijing CN

Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Zhiwei Li, Beijing CN

Zhiwei Li, Beijing CN