Patent application number | Description | Published |
20080221874 | Method and Apparatus for Fast Semi-Automatic Semantic Annotation - A method, apparatus and computer instructions is provided for fast semi-automatic semantic annotation. Given a limited annotated corpus, the present invention assigns a tag and a label to each word of the next limited annotated corpus using a parser engine, a similarity engine, and a SVM engine. A rover then combines the parse trees from the three engines and annotates the next chunk of limited annotated corpus with confidence, such that the efforts required for human annotation is reduced. | 09-11-2008 |
20080270115 | SYSTEM AND METHOD FOR DIACRITIZATION OF TEXT - A system and method for restoration of diacritics includes making classification decisions regarding an utterance in accordance with an aggregate of a plurality of information sources in a diacritization model for diacritic restoration. A best diacritic representation is determined for graphemes in the utterance based upon a best match with the diacritization model. A diacritically restored representation of the utterance is output. | 10-30-2008 |
20090018833 | MODEL WEIGHTING, SELECTION AND HYPOTHESES COMBINATION FOR AUTOMATIC SPEECH RECOGNITION AND MACHINE TRANSLATION - A translation method and system include a recognition engine having a plurality of models each being employed to decode a same utterance to provide an output. A model combiner is configured to assign probabilities to each model output and configured to assign weights to the outputs of the plurality of models based on the probabilities to provide a best performing model for the context of the utterance. | 01-15-2009 |
20090248394 | MACHINE TRANSLATION IN CONTINUOUS SPACE - A system and method for training a statistical machine translation model and decoding or translating using the same is disclosed. A source word versus target word co-occurrence matrix is created to define word pairs. Dimensionality of the matrix may be reduced. Word pairs are mapped as vectors into continuous space where the word pairs are vectors of continuous real numbers and not discrete entities in the continuous space. A machine translation parametric model is trained using an acoustic model training method based on word pair vectors in the continuous space. | 10-01-2009 |
20110191096 | GAME BASED METHOD FOR TRANSLATION DATA ACQUISITION AND EVALUATION - A method of generating a statistical machine translation database through a game in which a monolingual structure is provided to a plurality of players. A first translation attempt is received from each of the plurality of players. The first translation attempt from each of the plurality of players is compared. Feedback is provided to each of the plurality of players and the attempts are received and compared to provide feedback to iteratively converge subsequent translations from each of the plurality of players into a final translated structure. | 08-04-2011 |
20110282648 | Machine Translation with Side Information - A method of identifying and using side information available to statistical machine translation systems within an enterprise setting, the method including extracting user-specific interaction and non-interaction-based information from at least one corresponding database within the enterprise for each of a plurality of users, aggregating the user-specific interaction and non-interaction based information from a plurality of users, by using a processor on a computer, to tune and adapt background translation and language models, and updating all relevant models within the enterprise after user activity based on the tuned and adapted translation and language models. | 11-17-2011 |
20120245897 | Virtualized Abstraction with Built-in Data Alignment and Simultaneous Event Monitoring in Performance Counter Based Application Characterization and Tuning - Techniques for monitoring a set of one or more event counters of application execution are provided. The techniques include constructing a virtual performance monitoring counter (VPMC) layer as a unified abstraction of a physical performance monitoring counter (PMC) architecture, and incorporating one or more programming interfaces (PIs) in connection with the virtual performance monitoring counter, wherein the one or more programming interfaces facilitate simultaneous access and data monitoring across a set of one or more event counters. | 09-27-2012 |
20130024403 | AUTOMATICALLY INDUCED CLASS BASED SHRINKAGE FEATURES FOR TEXT CLASSIFICATION - A method and apparatus are provided for automatically inducing class based shrinkage features. The method includes clustering each word in a set of word groupings of a given type into a respective one of a plurality of classes. The method further includes selecting and extracting a set of class-based shrinkage features from the set of word groupings based on the plurality of classes. The set of class-based shrinkage features is specifically selected for an intended classification application. | 01-24-2013 |
20130073276 | MT Based Spoken Dialog Systems Customer/Machine Dialog - Operation of an automated dialog system is described using a source language to conduct a real time human machine dialog process with a human user using a target language. A user query in the target language is received and automatically machine translated into the source language. An automated reply of the dialog process is then delivered to the user in the target language. If the dialog process reaches an initial assistance state, a first human agent using the source language is provided to interact in real time with the user in the target language by machine translation to continue the dialog process. Then if the dialog process reaches a further assistance state, a second human agent using the target language is provided to interact in real time with the user in the target language to continue the dialog process. | 03-21-2013 |