Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Anirban Dasgupta

Anirban Dasgupta, Albany, CA US

Patent application number	Description	Published
20100228804	CONSTRUCTING IMAGE CAPTCHAS UTILIZING PRIVATE INFORMATION OF THE IMAGES - An image CAPTCHA having one or more images, a challenge, and a correct answer to the challenge is constructed by selecting the one or more images from a plurality of candidate images based at least in part on each image's public information and private information. The private information of each of the images is accessible only to an entity responsible for constructing the CAPTCHA. Optionally, the one or more images are selected further based on the specific type of the CAPTCHA to be constructed.	09-09-2010
20110173142	APPARATUS AND METHODS FOR CLASSIFYING SENDERS OF UNSOLICITED BULK EMAILS - Disclosed are methods and apparatus for facilitating the filtering of unsolicited bulk electronic mail (email) sent from spammers. A plurality of recipient patterns for a plurality of emails from known spammers is logged. A plurality of recipient patterns for a plurality of emails from known non-spammers is also logged. A probabilistic model for predicting whether an unknown sender identity is a spammer is generated or modified based on the logged recipient patterns for the emails from known spammers and known non-spammers.	07-14-2011
20120144016	System and Method for Counting Network Users - Embodiments presented herein provide methods, systems and computer program products for determining a count of network users. One method identifies one or more login access requests, from one or more server logs. Each of the one or more login access requests comprises a login cookie, and a user identifier. The method then forms one or more connected components based on the one or more login access requests. Each connected component comprises one or more user identifiers, wherein each of the one or more user identifiers is associated with one or more login cookies. The method finally determines a count of login users based, at least in part, on the user identifiers in the one or more connected components.	06-07-2012

Patent applications by Anirban Dasgupta, Albany, CA US

Anirban Dasgupta, Berkeley, CA US

Patent application number	Description	Published
20090164502	SYSTEMS AND METHODS OF UNIVERSAL RESOURCE LOCATOR NORMALIZATION - Disclosed herein are method, systems and architectures for normalizing identifiers corresponding to resources using normalization rules that can be generalized for use with different resources. By way of a non-limiting example, an identifier can be a uniform resource locator (URL), and a normalization rule can be used to normalize URLs that correspond to different resources, e.g., content. A normalization rule can be generated by generalizing two or more normalization rules corresponding to different resources, such that a content determinative component is generalized. A normalization rule can be defined to include a context portion used to determine the rule's applicability to an identifier, and a transformation portion that identifies the transformations to be applied to an applicable identifier to yield a normalized form of the URL. A generalization of two or more normalization rules can include a normalization of one or both of the context and transformation portions.	06-25-2009
20090171870	System and method of feature selection for text classification using subspace sampling - An improved system and method is provided for feature selection for text classification using subspace sampling. A text classifier generator may be provided for selecting a small set of features using subspace sampling from the corpus of training data to train a text classifier for using the small set of features for classification of texts. To select the small set of features, a subspace of features from the corpus of training data may be randomly sampled according to a probability distribution over the set of features where a probability may be assigned to each of the features that is proportional to the square of the Euclidean norms of the rows of left singular vectors of a matrix of the features representing the corpus of training texts. The small set of features may classify texts using only the relevant features among a very large number of training features.	07-02-2009
20090327168	PLAYFUL INCENTIVE FOR LABELING CONTENT - Embodiments are directed towards employing a playful incentive to encourage users to provide feedback that is useable to train a classifier. The classifier being associated with any of a variety of different settings, including but not limited to classifying: messages as ham/spam, images, advertising, bookmarking, music, videos, photographs, shopping, or the like. An animated image, such as a pet, provides an interface to the classifier that encourages and responds to user feedback. Users may share their classifiers or aspects thereof with other users to enable a community of knowledge to be applied to a classification task, while preserving privacy of the user feedback. One form of sharing may be within the context of a competitive game. Various evaluations may be performed on a classifier to indicate user feedback consistency, or quality. Classifiers may also be used to provide users with advertisements, products, or services based on the user's feedback.	12-31-2009
20100063881	ALGORITHM FOR STORYBOARDING IN DISPLAY ADVERTISING - Methods and system for optimally allocating ad space to advertisers on a webpage viewed by a user in a single browsing session includes identifying a plurality of advertisement stories that match the content of the webpage. An advertisement pool is generated using the identified ad stories. Each ad story in the advertisement pool includes one or more advertisement pages and is associated with corresponding ad value. An ad story from the pool of ad stories is chosen by dynamically evaluating ad value associated with each ad story in the pool based on continued surfing by the user such that the identified ad story provides the maximum ad value when rendered on the webpage. The identified ad story is scheduled for rendering on the webpage while providing relevant ad content at the webpage.	03-11-2010
20110196722	MARKETPLACE FOR CAPTCHA DEVELOPERS - Techniques are described herein for providing a marketplace for CAPTCHA developers. A CAPTCHA is a type of challenge-response test that a content provider may present to users for authorizing the users to access content that is hosted by the content provider. CAPTCHA developers may propose CAPTCHAs for testing in the marketplace. A server is configured to provide instances of proposed CAPTCHAs to users along with instances of reference CAPTCHAs that have known difficulties to determine the relative difficulties of the proposed CAPTCHAs for the users. The server may determine a reward to be provided to a developer of a proposed CAPTCHA based on a difficulty that the predetermined automated systems encounter when they attempt to solve the proposed CAPTCHA and an ease with which the users solve the proposed CAPTCHA. The server may determine rewards to be provided to developers of predetermined automated systems that solve the proposed CAPTCHAs.	08-11-2011
20110197268	CAPTCHAS THAT INCLUDE OVERLAPPED CHARACTERS, PROJECTIONS ON VIRTUAL 3D SURFACES, AND/OR VIRTUAL 3D OBJECTS - Techniques are described herein for generating CAPTCHAs that include overlapped characters, projections on virtual three-dimensional (3D) surfaces, and/or virtual 3D objects. A CAPTCHA is a type of challenge-response test that a content provider may present to users for authorizing the users to access content that the content provider hosts. For example, when a user attempts to access content, a CAPTCHA may be generated in accordance with one or more of the techniques described herein and provided to the user. The user may be asked to identify characters that overlap in the CAPTCHA, characters that are projected on a virtual 3D surface, and/or a designated virtual 3D object, so that the user may be authorized to access the content. The user may enter the characters and/or select the designated virtual 3D object that is identified in the CAPTCHA using an input device, such as a keyboard, touch screen, pointing device, etc.	08-11-2011
20120084832	Time Managed Challenge-Response Test - A method of generating a time managed challenge-response test is presented. The method identifies a geometric shape having a volume and generates an entry object of the time managed challenge-response test. The entry object is overlaid onto the geometric shape, such that the entry object is distributed over a surface of the geometric shape, and a portion of the entry object is hidden at any point in time. The geometric shape is rotated, which reveals the portion of the entry object that is hidden. A display region on a display is identified for rendering the geometric shape and the geometric shape is presented in the display region of the display.	04-05-2012
20120166379	CLUSTERING COOKIES FOR IDENTIFYING UNIQUE MOBILE DEVICES - Embodiments are directed towards clustering cookies for identifying unique mobile devices for associating activities over a network with a given mobile device. The cookies are clustered based on a Bayes Factor similarity model that is trained from cookie features of known mobile devices. The clusters may be used to determine the number of unique mobile devices that access a website. The clusters may also be used to provide targeted content to each unique mobile device.	06-28-2012
20130031059	METHOD AND SYSTEM FOR FAST SIMILARITY COMPUTATION IN HIGH DIMENSIONAL SPACE - Method, system, and programs for computing similarity. Input data is first received from one or more data sources and then analyzed to obtain an input feature vector that characterizes the input data. An index is then generated based on the input feature vector and is used to archive the input data, where the value of the index is computed based on an improved Johnson-Lindenstrass transformation (FJLT) process. With the improved FJLT process, first, the sign of each feature in the input feature vector is randomly flipped to obtain a flipped vector. A Hadamard transformation is then applied to the flipped vector to obtain a transformed vector. An inner product between the transformed vector and a sparse vector is then computed to obtain a base vector, based on which the value of the index is determined.	01-31-2013
20130159227	CLUSTERING COOKIES FOR IDENTIFYING UNIQUE MOBILE DEVICES - Embodiments are directed towards clustering cookies for identifying unique mobile devices for associating activities over a network with a given mobile device. The cookies are clustered based on a Bayes Factor similarity model that is trained from cookie features of known mobile devices. The clusters may be used to determine the number of unique mobile devices that access a website. The clusters may also be used to provide targeted content to each unique mobile device.	06-20-2013

Patent applications by Anirban Dasgupta, Berkeley, CA US

Anirban Dasgupta, Santa Clara, CA US

Patent application number	Description	Published
20090037447	Mail Compression Scheme with Individual Message Decompressability - Embodiments of the present inversion relate to a two-pass compression scheme that achieves compression performance on par with existing methods while admitting individual message decompression. These methods provide both storage savings and lower end-user latency. They preserve the advantages of standard text compression in exploiting short-range similarities in data, while introducing a second step to take advantage of long-range similarities often present in certain types of structured data, e.g. email archival files.	02-05-2009
20090112865	HIERARCHICAL STRUCTURE ENTROPY MEASUREMENT METHODS AND SYSTEMS - Methods and apparatuses are provided for accessing taxonomic data associated with an item as classified into a taxonomy having a hierarchical structure, establishing dependency data associated with a distribution represented in the taxonomic data, and determining entropic data for the item based, at least in part, on the distribution and established dependency.	04-30-2009
20090164582	JURY SYSTEM FOR USE IN ONLINE ANSWERS ENVIRONMENT - A system for resolving disputes in an online answers community is disclosed. The system improves the system's ability to resolve reports of abuse.	06-25-2009
20130346322	Jury system for use in online answers environment - A system for resolving disputes in an online answers community is disclosed. The system improves the system's ability to resolve reports of abuse.	12-26-2013