Patent application number | Description | Published |
20080319978 | HYBRID SYSTEM FOR NAMED ENTITY RESOLUTION - A method for named entity resolution includes parsing an input text string to identify a context in which an identified named entity of the input text string is used. The identified context is compared with at least one stored context in which the named entity in the stored context is associated with a class of named entity, the named entity class being selected from a plurality of classes, at least one of the plurality of classes corresponding to a metonymic use of a respective named entity. A named entity class is assigned to the identified named entity from the plurality of named entity classes, based on at least one of the identified context and the comparison. | 12-25-2008 |
20090204596 | SEMANTIC COMPATIBILITY CHECKING FOR AUTOMATIC CORRECTION AND DISCOVERY OF NAMED ENTITIES - A computer implemented system and method for processing text are disclosed. Partially processed text, in which named entities have been extracted by a standard named entity system, is processed to identify attributive relations between a named entity or proper noun and a corresponding attribute. A concept for the attribute is identified and, in the case of a named entity, compared with the named entity's context, enabling a confirmation or conflict between the two to be determined. In the case of a proper name, the attribute's context can be associated with the proper name, allowing the proper name to be recognized as a new named entity. | 08-13-2009 |
20100082331 | SEMANTICALLY-DRIVEN EXTRACTION OF RELATIONS BETWEEN NAMED ENTITIES - A system and method of developing rules for text processing enable retrieval of instances of named entities in a predetermined semantic relation (such as the DATE and PLACE of an EVENT) by extracting patterns from text strings in which attested examples of named entities satisfying the semantic relation occur. The patterns are generalized to form rules which can be added to the existing rules of a syntactic parser and subsequently applied to text to find candidate instances of other named entities in the predetermined semantic relation. | 04-01-2010 |
20100318398 | NATURAL LANGUAGE INTERFACE FOR COLLABORATIVE EVENT SCHEDULING - A collaborative event scheduling method and system are provided which allow participants and an event initiator to interact with a scheduler in a natural language form. Participants provide a respective availability announcement, which is processed to generate a representation of the user's availability within a time window specified for the event by the initiator. This includes extracting a temporal expression from the availability announcement, normalizing, if the temporal expression is determined to be referential, identifying an availability modality for each extracted temporal expression from a set of availability modalities. The generated representation is output for establishing a suitable time for the event within the time window based on the availability announcements of the participants. | 12-16-2010 |
20110099052 | AUTOMATIC CHECKING OF EXPECTATION-FULFILLMENT SCHEMES - A system, apparatus, method, and computer program product encoding the method are provided for expectation fulfillment evaluation. The system includes a natural language processing component that extracts sets of normalized tasks from an input expectation document and an input fulfillment document. A task list comparison component compares the two sets of tasks and identifies each match between a normalized task in the first set and a normalized task in the second set, each normalized task in the first set which has no matching task in the second set, and each normalized task in the second set which has no matching task in the first set. A report generator outputs a report based on the comparison. The report may further include one or more of statistics generated from the comparison, information on an opinion generated by opinion mining a third document, and as a list of the normalized tasks and an indication of whether the tasks were fulfilled, derived from analysis of temporal expression in the two documents. The system may be implemented as software in memory by an associated computer processor. | 04-28-2011 |
20110123967 | DIALOG SYSTEM FOR COMPREHENSION EVALUATION - An automated system, apparatus and method for evaluation of comprehension are disclosed. The method includes receiving an input text and natural language processing the text to identify dependencies between text elements in the input text. Grammar rules are applied to generate questions and associated answers from the processed text, at least some of the questions being based on the identified dependencies. A set of the generated questions is posed to a reader of the input text and the comprehension of the reader evaluated, based on the reader's responses to the questions posed. | 05-26-2011 |
20110225155 | SYSTEM AND METHOD FOR GUIDING ENTITY-BASED SEARCHING - A system and method are provided for refining a user's query. An entity index, generated from a corpus of text documents, is provided. The entity index includes a set of entity structures, each including a plurality of terms. Each of the terms of an entity structure is a feature of the same entity. Entity structures can be retrieved from the entity index which match at least a portion of the user's query. Clusters of the retrieved entity structures are identified which have at least one of their terms in common. A cluster hierarchy is generated from the identified clusters in which nodes of the hierarchy are defined by one or more of the terms of the retrieved entity structures. At least a portion of the cluster hierarchy is presented to the user for facilitating refinement of the user's query through user selection of a node which, when formulated as a search, retrieves one or more responsive documents from the corpus of documents. | 09-15-2011 |
20120035905 | SYSTEM AND METHOD FOR HANDLING MULTIPLE LANGUAGES IN TEXT - A system and method for processing text are disclosed. The method includes receiving text to be processed. A main language of the text is identified. At least one unknown sequence in the text is identified, each unknown sequence comprising at least one word that is unknown in the main language. For a secondary language, for each of the at least one unknown sequence, the method includes determining whether the unknown sequence includes a first word recognized in the secondary language and, if so, identifying a sequence of words in the secondary language which includes at least the first word. The identifying of the sequence of words in the secondary language includes applying an algorithm for determining whether the sequence of words in the secondary language is expandable beyond the first word to include adjacent words. The text is labeled based on the identified sequences of words in the secondary language. | 02-09-2012 |
20120035914 | System and method for handling multiple languages in text - A system and method for processing text are disclosed. The method includes receiving text to be processed. A main language of the text is identified. At least one unknown sequence in the text is identified, each unknown sequence comprising at least one word that is unknown in the main language. For a secondary language, for each of the at least one unknown sequence, the method includes determining whether the unknown sequence includes a first word recognized in the secondary language and, if so, identifying a sequence of words in the secondary language which includes at least the first word. The identifying of the sequence of words in the secondary language includes applying an algorithm for determining whether the sequence of words in the secondary language is expandable beyond the first word to include adjacent words. The text is labeled based on the identified sequences of words in the secondary language. | 02-09-2012 |
20120226707 | Linguistically enhanced email detector - A computer-implemented system and method are provided for warning a user of a missing attachment to an email. The method may include automatically recognizing a natural language of text of an email and selecting a keyword list from a plurality of keyword lists, based on the recognized natural language. Each keyword list is associated with a respective natural language and includes at least one keyword. At least one of the keyword lists includes a multi-sense keyword having a plurality of senses. A first of the plurality of senses is recognized as referring to an attachment and a second of the plurality of senses is recognized as not referring to an attachment. The text of the email is processed to identify an instance, where present, of a keyword that is in the selected keyword list and, for a keyword which is a multi-sense keyword, at least one sense-related rule is applied to a portion of the text which includes the instance of the multi-sense keyword. Based on the application of the at least one sense-related rule, where the email lacks an attachment, a notification is provided to the user. | 09-06-2012 |
20120245923 | CORPUS-BASED SYSTEM AND METHOD FOR ACQUIRING POLAR ADJECTIVES - A system, method, and computer program product for generating a polar vocabulary are provided. The method includes extracting textual content from each review in a corpus of reviews. Each of the reviews includes an author's rating, e.g., of a specific product or service to which the textual content relates. A set of frequent nouns is identified from the textual content of the reviews. Adjectival terms are extracted from the textual content of the reviews. Each adjectival term is associated in the textual content with one of the frequent nouns. A polar vocabulary including at least some of the extracted adjectival terms is generated. A polarity measure is associated with each adjectival term in the vocabulary which is based on the ratings of those reviews from which the adjectival term was extracted. | 09-27-2012 |
20120245924 | CUSTOMER REVIEW AUTHORING ASSISTANT - An authoring assistant includes a parser which automatically identifies opinion expressions in input text. The text may include an author's review of an item, such as a product or service. A computer-implemented opinion review component generates an analysis of the text, which is based on the identified opinion expressions. The opinion review component computes an effective opinion of the text as a function of a measure of polarity associated with the identified opinion expressions. A representation generator generates a representation of the analysis for display on an associated user interface. The representation of the analysis includes a representation of the effective opinion. In the case of a review, the authoring assistant may allow the author to modify the review to reduce incoherence with a rating of the item. | 09-27-2012 |
20130080152 | LINGUISTICALLY-ADAPTED STRUCTURAL QUERY ANNOTATION - A system and method for natural language processing of queries are provided. A lexicon includes text elements that are recognized as being a proper noun when capitalized. A natural language query includes a sequence of text elements including words. The query is processed. The processing includes a preprocessing step, in which part of speech features are assigned to the text elements in the query. This includes identifying, from a lexicon, a text element in the query which starts with a lowercase letter and assigning recapitalization information to the text element in the query, based on the lexicon. This information includes a part of speech feature of the capitalized form of the text element. Then parts of speech for the text elements in the query are disambiguated, which includes applying rules for recapitalizing text elements based on the recapitalization information. | 03-28-2013 |
20130096909 | SYSTEM AND METHOD FOR SUGGESTION MINING - A system and method for extraction of suggestions for improvement form a corpus of documents, such as customer reviews, are disclosed. A structured terminology provided or a topic includes a set of semantic classes, each including a set of terms. A thesaurus of terms relating to suggestions of improvement is provided. Text elements of text strings in the documents which are instances of terms in the structured terminology are labeled with the corresponding semantic class and text elements which are instances of terms in the thesaurus are also labeled. A set of patterns is applied to the labeled text strings to identify suggestions of improvement expressions. The patterns define syntactic relations between text elements, some of which are required to be instances of one of the terms in a particular semantic class or thesaurus. A set of suggestions for improvements is output based on the identified suggestions of improvement expressions. | 04-18-2013 |
20130218914 | SYSTEM AND METHOD FOR PROVIDING RECOMMENDATIONS BASED ON INFORMATION EXTRACTED FROM REVIEWERS' COMMENTS - A recommendation method includes receiving a user's review of an item that includes a textual comment. Deficient features of the reviewed item are identified from the text by applying a set of extraction patterns. Each pattern is satisfied when a term in the text, which is associated in a structured terminology with one of a predefined set of features, is in a syntactic relation with another term in the text, such as a polar adjective or expression of a wish or a lack. When such a pattern is satisfied, the corresponding feature is considered a deficient feature. Feature attributes of the reviewed item are compared with corresponding feature attributes of a set of items to identify any improved items whose attribute for the deficient feature is better than that for the reviewed item. The improved item or items can be recommended to the user or to others reading the review. | 08-22-2013 |
20140067369 | METHODS AND SYSTEMS FOR ACQUIRING USER RELATED INFORMATION USING NATURAL LANGUAGE PROCESSING TECHNIQUES - Systems and methods for acquiring information associated with a user by using NLP techniques are disclosed. One or more phrases are classified in one or more categories at least partly on the basis of a period for which a product has been used by the user, the user's experience with the product, preferences of the user, or needs of the user by applying one or more natural language processing (NLP) techniques. The one or more phrases are extractable from an electronic publication at least partly on the basis of on a predefined set of verbs, a predefined set of domain-specific terms, and terms indicative of temporal information. One or more terms from the classified phrases are extracted, in which the one or more terms are indicative of the information about the user. | 03-06-2014 |
20140067370 | LEARNING OPINION-RELATED PATTERNS FOR CONTEXTUAL AND DOMAIN-DEPENDENT OPINION DETECTION - A method for extracting opinion-related patterns includes receiving a corpus of reviews, the reviews each including an explicit rating of a topic. The reviews are partitioned among a predefined plurality of classes, based on the ranking. Syntactic relations are identified in each review. The syntactic relations may each include an adjective and a noun. A set of patterns is generated, each of the patterns having at least one of the identified syntactic relations as an instance and the patterns clustered into a set of clusters based on a set of features. At least one of the features is based on occurrences, in the predefined classes, of the instances of the patterns. A polarity is assigned to ones of the clusters and propagated to patterns in the respective clusters. The polarity-labeled patterns can each be instantiated as a contextual rule for opinion mining. | 03-06-2014 |