Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees


Anirban Dasgupta

Anirban Dasgupta, Albany, CA US

Patent application numberDescriptionPublished
20100228804CONSTRUCTING IMAGE CAPTCHAS UTILIZING PRIVATE INFORMATION OF THE IMAGES - An image CAPTCHA having one or more images, a challenge, and a correct answer to the challenge is constructed by selecting the one or more images from a plurality of candidate images based at least in part on each image's public information and private information. The private information of each of the images is accessible only to an entity responsible for constructing the CAPTCHA. Optionally, the one or more images are selected further based on the specific type of the CAPTCHA to be constructed.09-09-2010

Anirban Dasgupta, Berkeley, CA US

Patent application numberDescriptionPublished
20090164502SYSTEMS AND METHODS OF UNIVERSAL RESOURCE LOCATOR NORMALIZATION - Disclosed herein are method, systems and architectures for normalizing identifiers corresponding to resources using normalization rules that can be generalized for use with different resources. By way of a non-limiting example, an identifier can be a uniform resource locator (URL), and a normalization rule can be used to normalize URLs that correspond to different resources, e.g., content. A normalization rule can be generated by generalizing two or more normalization rules corresponding to different resources, such that a content determinative component is generalized. A normalization rule can be defined to include a context portion used to determine the rule's applicability to an identifier, and a transformation portion that identifies the transformations to be applied to an applicable identifier to yield a normalized form of the URL. A generalization of two or more normalization rules can include a normalization of one or both of the context and transformation portions.06-25-2009
20090171870System and method of feature selection for text classification using subspace sampling - An improved system and method is provided for feature selection for text classification using subspace sampling. A text classifier generator may be provided for selecting a small set of features using subspace sampling from the corpus of training data to train a text classifier for using the small set of features for classification of texts. To select the small set of features, a subspace of features from the corpus of training data may be randomly sampled according to a probability distribution over the set of features where a probability may be assigned to each of the features that is proportional to the square of the Euclidean norms of the rows of left singular vectors of a matrix of the features representing the corpus of training texts. The small set of features may classify texts using only the relevant features among a very large number of training features.07-02-2009
20090327168PLAYFUL INCENTIVE FOR LABELING CONTENT - Embodiments are directed towards employing a playful incentive to encourage users to provide feedback that is useable to train a classifier. The classifier being associated with any of a variety of different settings, including but not limited to classifying: messages as ham/spam, images, advertising, bookmarking, music, videos, photographs, shopping, or the like. An animated image, such as a pet, provides an interface to the classifier that encourages and responds to user feedback. Users may share their classifiers or aspects thereof with other users to enable a community of knowledge to be applied to a classification task, while preserving privacy of the user feedback. One form of sharing may be within the context of a competitive game. Various evaluations may be performed on a classifier to indicate user feedback consistency, or quality. Classifiers may also be used to provide users with advertisements, products, or services based on the user's feedback.12-31-2009
20100063881ALGORITHM FOR STORYBOARDING IN DISPLAY ADVERTISING - Methods and system for optimally allocating ad space to advertisers on a webpage viewed by a user in a single browsing session includes identifying a plurality of advertisement stories that match the content of the webpage. An advertisement pool is generated using the identified ad stories. Each ad story in the advertisement pool includes one or more advertisement pages and is associated with corresponding ad value. An ad story from the pool of ad stories is chosen by dynamically evaluating ad value associated with each ad story in the pool based on continued surfing by the user such that the identified ad story provides the maximum ad value when rendered on the webpage. The identified ad story is scheduled for rendering on the webpage while providing relevant ad content at the webpage.03-11-2010

Anirban Dasgupta, Santa Clara, CA US

Patent application numberDescriptionPublished
20090037447Mail Compression Scheme with Individual Message Decompressability - Embodiments of the present inversion relate to a two-pass compression scheme that achieves compression performance on par with existing methods while admitting individual message decompression. These methods provide both storage savings and lower end-user latency. They preserve the advantages of standard text compression in exploiting short-range similarities in data, while introducing a second step to take advantage of long-range similarities often present in certain types of structured data, e.g. email archival files.02-05-2009
20090112865HIERARCHICAL STRUCTURE ENTROPY MEASUREMENT METHODS AND SYSTEMS - Methods and apparatuses are provided for accessing taxonomic data associated with an item as classified into a taxonomy having a hierarchical structure, establishing dependency data associated with a distribution represented in the taxonomic data, and determining entropic data for the item based, at least in part, on the distribution and established dependency.04-30-2009
20090164582JURY SYSTEM FOR USE IN ONLINE ANSWERS ENVIRONMENT - A system for resolving disputes in an online answers community is disclosed. The system improves the system's ability to resolve reports of abuse.06-25-2009