Patent application number | Description | Published |
20090030686 | Method and system for computing or determining confidence scores for parse trees at all levels - In a confidence computing method and system, a processor may interpret speech signals as a text string or directly receive a text string as input, generate a syntactical parse tree representing the interpreted string and including a plurality of sub-trees which each represents a corresponding section of the interpreted text string, determine for each sub-tree whether the sub-tree is accurate, obtain replacement speech signals for each sub-tree determined to be inaccurate, and provide output based on corresponding text string sections of at least one sub-tree determined to be accurate. | 01-29-2009 |
20090055164 | Method and System of Optimal Selection Strategy for Statistical Classifications in Dialog Systems - An optimal selection or decision strategy is described through an example that includes use in dialog systems. The selection strategy or method includes receiving multiple predictions and multiple probabilities. The received predictions predict the content of a received input and each of the probabilities corresponds to one of the predictions. In an example dialog system, the received input includes an utterance. The selection method includes dynamically selecting a set of predictions from the received predictions by generating ranked predictions. The ranked predictions are generated by ordering the plurality of predictions according to descending probability. | 02-26-2009 |
20090055176 | Method and System of Optimal Selection Strategy for Statistical Classifications - An optimal selection or decision strategy is described through an example that includes use in dialog systems. The selection strategy or method includes receiving multiple predictions and multiple probabilities. The received predictions predict the content of a received input and each of the probabilities corresponds to one of the predictions. In an example dialog system, the received input includes an utterance. The selection method includes dynamically selecting a set of predictions from the received predictions by generating ranked predictions. The ranked predictions are generated by ordering the plurality of predictions according to descending probability. | 02-26-2009 |
20090119104 | Switching Functionality To Control Real-Time Switching Of Modules Of A Dialog System - Systems and methods are described that automatically control modules of dialog systems. The systems and methods include a dialog module that receives and processes utterances from a speaker and outputs data used to generate synthetic speech outputs as responses to the utterances. A controller is coupled to the dialog module, and the controller detects an abnormal output of the dialog module when the dialog module is processing in an automatic mode. The controller comprises a mode control for an agent to control the dialog module by correcting the abnormal output and transferring a corrected output to a downstream dialog module that follows, in a processing path, the dialog module. The corrected output is used in further processing the utterances. | 05-07-2009 |
20090119586 | Automatic Generation of Interactive Systems From a Formalized Description Language - Systems and methods are described that automatically generate interactive systems configured for collecting dialog data of human-machine interactions in dialog systems. The systems and methods comprise receiving a task flow that describes operations of a dialog system. A formal description of the task flow is generated, and an interactive system comprising a graphical user interface (GUI) is automatically generated from the formal description. The GUI consists of templates for control of the dialog system and real-time collection and annotating of dialog data during a live dialog between only the dialog system and callers to the dialog system. The dialog data consists of data of the live dialog. | 05-07-2009 |
20090125813 | METHOD AND SYSTEM FOR PROCESSING MULTIPLE DIALOG SESSIONS IN PARALLEL - A dialog system and method may generate and maintain in parallel multiple dialog sessions, determine to which dialog session a user speech input applies, selectively provide control to one of the dialog sessions, at any one time, to output data to the user, synchronize multiple dialog sessions, and support user interruptions at any time during the dialog sessions. | 05-14-2009 |
20090216711 | Method and system for implementing multiple web services for a service query - In a system and method for fulfilling a service query for a user, a processor may parse the query into a set of operations, identify a set of service providers that each provides functionality for performing at least one respective operation of the set of operations, and, for each of the set of operations, select a respective one of the set of service providers to perform the operation, and interface with the service provider selected for the operation to cause the service provider to perform the operation. | 08-27-2009 |
20090306995 | System and Method for Automated Testing of Complicated Dialog Systems - Embodiments of an automated dialog system testing method and component are described. This automated testing method and system supplements real human-based testing with simulated user input and incorporates a set of evaluation measures that focus on three basic aspects of task-oriented dialog systems, namely, understanding ability, efficiency, and the appropriateness of system actions. These measures are first applied on a corpus generated between a dialog system and a group of human users to demonstrate the validity of these measures with the human users' satisfaction levels. Results generally show that these measures are significantly correlated with these satisfaction levels. A regression model is then built to predict the user satisfaction scores using these evaluation measures. The regression model is applied on a simulated dialog corpus trained from the above real user corpus, and show that the user satisfaction score estimated from the simulated dialogs do not significantly differ from the real users' satisfaction scores. These evaluation measures can then be used to assess the system performance based on the estimated user satisfaction. | 12-10-2009 |
20090313016 | System and Method for Detecting Repeated Patterns in Dialog Systems - Embodiments of a method and system for detecting repeated patterns in dialog systems are described. The system includes a dynamic time warping (DTW) based pattern comparison algorithm that is used to find the best matching parts between a correction utterance and an original utterance. Reference patterns are generated from the correction utterance by an unsupervised segmentation scheme. No significant information about the position of the repeated parts in the correction utterance is assumed, as each reference pattern is compared with the original utterance from the beginning of the utterance to the end. A pattern comparison process with DTW is executed without knowledge of fixed end-points. A recursive DTW computation is executed to find the best matching parts that are considered as the repeated parts as well as the end-points of the utterance. | 12-17-2009 |
20100057463 | System and Method for Generating Natural Language Phrases From User Utterances in Dialog Systems - Embodiments of a dialog system that employs a corpus-based approach to generate responses based on a given number of semantic constraint-value pairs are described. The system makes full use of the data from the user input to produce dialog system responses in combination with a template generator. The system primarily utilizes constraint values in order to realize efficiencies based on the more frequent tasks performed in real dialog systems although rhetorical or discourse aspects of the dialog could also be included in a similar way, that is, labeling the data with such information and performing a training process. The benefits of this system include higher quality user-aligned responses, broader coverage, faster response time, and shorter development cycles. | 03-04-2010 |
20100124325 | System and Method for Interacting with Live Agents in an Automated Call Center - Embodiments of an interface system that enables a call center agent to access and intervene in an interaction between an automated call center system and a caller whenever necessary for complex application tasks is described. The system includes a user interface that presents the agent with one or more categories of information, including the conversation flow, obtained semantic information, the recognized utterances, and access to the utterance waveforms. This information is cross-linked and attached with a confidence level for better access and navigation within the dialog system for the generation of appropriate responses to the caller. | 05-20-2010 |
20100125456 | System and Method for Recognizing Proper Names in Dialog Systems - Embodiments of a dialog system that utilizes contextual information to perform recognition of proper names are described. Unlike present name recognition methods on large name lists that generally focus strictly on the static aspect of the names, embodiments of the present system take into account of the temporal, recency and context effect when names are used, and formulates new questions to further constrain the search space or grammar for recognition of the past and current utterances. | 05-20-2010 |
20100204982 | System and Method for Generating Data for Complex Statistical Modeling for use in Dialog Systems - Embodiments of a dialog system that utilizes grammar-based labeling scheme to generate labeled sentences for use in training statistical models. During the process of training data development, a grammar is constructed manually based on the application domain or adapted from a general grammar rule. An annotation schema is created accordingly based on the application requirements, such as syntactic and semantic information. Such information is then included in the grammar specification. After the labeled grammar is constructed, a generation algorithm is then used to generate sentences for training various statistical models. | 08-12-2010 |
20100241431 | System and Method for Multi-Modal Input Synchronization and Disambiguation - Embodiments of a dialog system that utilizes a multi-modal input interface for recognizing user input in human-machine interaction (HMI) systems are described. Embodiments include a component that receives user input from a plurality of different user input mechanisms (multi-modal input) and performs certain synchronization and disambiguation processes. The multi-modal input components synchronizes and integrates the information obtained from different modalities, disambiguates the input, and recovers from any errors that might be produced with respect to any of the user inputs. Such a system effectively addresses any ambiguity associated with the user input and corrects for errors in the human-machine interaction. | 09-23-2010 |
20100250237 | INTERACTIVE MANUAL, SYSTEM AND METHOD FOR VEHICLES AND OTHER COMPLEX EQUIPMENT - A method and system of providing an interactive manual, including a speech engine to receive and process speech from a user, convert the speech into a word sequence, and identify meaning structures from the word sequence, a structured manual including information related to an operation of a device, a visual model to relate visual representation of the information, a dialog management arrangement to interpret the meaning structures in a context and to extract pertinent information and the visual representation from the structured manual and the visual model, and an output arrangement to output the information and visual representation. | 09-30-2010 |
20110022292 | METHOD AND SYSTEM FOR IMPROVING SPEECH RECOGNITION ACCURACY BY USE OF GEOGRAPHIC INFORMATION - A method for speech recognition includes providing a source of geographical information within a vehicle. The geographical information pertains to a current location of the vehicle, a planned travel route of the vehicle, a map displayed within the vehicle, and/or a gesture marked by a user on a map. Words spoken within the vehicle are recognized by use of a speech recognition module. The recognizing is dependent upon the geographical information. | 01-27-2011 |
20110050589 | GESTURE-BASED INFORMATION AND COMMAND ENTRY FOR MOTOR VEHICLE - A method of receiving input from a user includes providing a surface within reach of a hand of the user. A plurality of locations on the surface that are touched by the user are sensed. An alphanumeric character having a shape most similar to the plurality of touched locations on the surface is determined. The user is audibly or visually informed of the alphanumeric character and/or a word in which the alphanumeric character is included. Feedback is received from the user regarding whether the alphanumeric character and/or word is an alphanumeric character and/or word that the user intended to be determined in the determining step. | 03-03-2011 |
20110144909 | PERSONALIZED AND CONTEXT-SENSITIVE 3D ROUTE GUIDANCE - A method of displaying a navigation map includes determining a route of a vehicle. Three-dimensional data is received. The data is associated with buildings that are disposed within a geographic area. The geographic area includes the route of the vehicle. At least one of the buildings in the data that is adjacent to at least one street of the route is rendered in a first opaque visual style. All of the buildings in the data that are nonadjacent to at least one street of the route are rendered in at least one second visual style different from the first opaque visual style. | 06-16-2011 |
20120101823 | SYSTEM AND METHOD FOR RECOGNIZING PROPER NAMES IN DIALOG SYSTEMS - Embodiments of a dialog system that utilizes contextual information to perform recognition of proper names are described. Unlike present name recognition methods on large name lists that generally focus strictly on the static aspect of the names, embodiments of the present system take into account of the temporal, recency and context effect when names are used, and formulates new questions to further constrain the search space or grammar for recognition of the past and current utterances. | 04-26-2012 |
20120105613 | ROBUST VIDEO-BASED HANDWRITING AND GESTURE RECOGNITION FOR IN-CAR APPLICATIONS - A method of receiving input from a user includes sensing a first trajectory of a center of mass of a hand of the user during a gesture made by the hand. A second trajectory of a finger tip of the hand of the user during the gesture made by the hand is also sensed. An alphanumeric character represented by the gesture made by the hand is determined dependent upon both the first trajectory and the second trajectory. | 05-03-2012 |
20120271631 | SPEECH RECOGNITION USING MULTIPLE LANGUAGE MODELS - In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating a frequency count of each utterance in the plurality of utterances, generating a high-frequency plurality of utterances from the plurality of utterances having a frequency that exceeds a predetermined frequency threshold, generating a low-frequency plurality of utterances from the plurality of utterances having a frequency that is below the predetermined frequency threshold, generating a grammar-based language model using the high-frequency plurality of utterances as training data, and generating a statistical language model using the low-frequency plurality of utterances as training data. | 10-25-2012 |
20120303267 | METHOD AND SYSTEM FOR IMPROVING SPEECH RECOGNITION ACCURACY BY USE OF GEOGRAPHIC INFORMATION - A method for speech recognition includes providing a source of geographical information within a vehicle. The geographical information pertains to a current location of the vehicle, a planned travel route of the vehicle, a map displayed within the vehicle, and/or a gesture marked by a user on a map. Words spoken within the vehicle are recognized by use of a speech recognition module. The recognizing is dependent upon the geographical information. | 11-29-2012 |
20120303355 | Method and System for Text Message Normalization Based on Character Transformation and Web Data - A method for generating non-standard tokens that correspond to standard tokens used in speech synthesis systems has been developed. The method includes selecting a standard token from a plurality of standard tokens stored in memory, using a random field model to select a predetermined operation to perform on each character in the selected token, performing the selected operation on each character to generate an output token, and storing the output token in the memory in association with the selected token. The output token is different from each token in the plurality of standard tokens. | 11-29-2012 |
20130024113 | Selecting and Controlling the Density of Objects Rendered in Two-Dimensional and Three-Dimensional Navigation Maps - A method of displaying an electronic map includes receiving map data associated with a plurality of objects that are disposed within a geographic area. The map data is analyzed to thereby determine a state or value of a metric associated with one of the objects. The associated object is rendered in a low density or high density within the map depending upon the state or value of the metric. | 01-24-2013 |
20130132086 | METHODS AND SYSTEMS FOR ADAPTING GRAMMARS IN HYBRID SPEECH RECOGNITION ENGINES FOR ENHANCING LOCAL SR PERFORMANCE - A speech recognition method includes providing a processor communicatively coupled to each of a local speech recognition engine and a server-based speech recognition engine. A first speech input is inputted into the server-based speech recognition engine. A first recognition result from the server-based speech recognition engine is received at the processor. The first recognition result is based on the first speech input. The first recognition result is stored in a memory device in association with the first speech input. A second speech input is inputted into the local speech recognition engine. The first recognition result is retrieved from the memory device. A second recognition result is produced by the local speech recognition engine. The second recognition result is based on the second speech input and is dependent upon the retrieved first recognition result. | 05-23-2013 |
20130173258 | Broad-Coverage Normalization System For Social Media Language - A method for identification of a standard text token in a dictionary that corresponds to a non-standard token identified in text includes identification of a first standard token that is associated with the non-standard using a predetermined conditional random field (CRF) model and identification of a second standard token that is associated with the non-standard token using a spell checker. The method further includes identification of noisy channel scores using data from the CRF model and the spell checker for the first standard token and the second standard token, respectively. The method further includes presentation of one of the first and second standard tokens having the greatest identified noisy channel score to a user with a user interface device. | 07-04-2013 |
20130173268 | SPEAKER VERIFICATION IN A HEALTH MONITORING SYSTEM - A method for verifying that a person is registered to use a telemedical device includes identifying an unprompted trigger phrase in words spoken by a person and received by the telemedical device. The telemedical device prompts the person to state a name of a registered user and optionally prompts the person to state health tips for the person. The telemedical device verifies that the person is the registered user using utterance data generated from the unprompted trigger phrase, name of the registered user, and health tips. | 07-04-2013 |
20130311916 | System and Method for Autocompletion and Alignment of User Gestures - In accordance with one embodiment, a navigation system includes a display device configured to detect user gestures, a data storage device including program instructions stored therein, a map stored within the data storage device, and a controller operatively coupled to the data storage device and the display device and configured to execute the program instructions to receive first user gesture data associated with a first portion of a first user gesture from the display device, generate a first predicted user gesture data based upon the stored map and the first user gesture data, and control the display device to render the first predicted user gesture data as the user is continuing the first user gesture. | 11-21-2013 |
20140019522 | System And Method Of Conversational Assistance For Automated Tasks With Integrated Intelligence - A method of providing information assistance services includes generating a plurality of service requests for a plurality of request elements that are generated from a single client request received by a processor. The service requests are sent to both software application service providers that are executed by the processor and remote service providers that are connected to the local processor through a data network. The processor receives a plurality of service responses from the service providers, generating at least one output message element corresponding to the service responses, and sending the output message data to at least one output device that is operatively connected to the processor to produce a response to the client request. | 01-16-2014 |
20140058584 | System And Method For Multimodal Interaction With Reduced Distraction In Operating Vehicles - A method of interaction with an in-vehicle information system includes receiving first and second inputs from an operator with first and second input devices, respectively. The method further includes identifying a service request corresponding to the first input, and a parameter of the service request with a value that is included in the second input with a controller in the in-vehicle information system. The controller executes stored program instructions to perform the identified service request with reference to the identified parameter. | 02-27-2014 |
20140121883 | System And Method For Using Gestures In Autonomous Parking - A method of providing parking assistance in a vehicle includes identifying with a controller in a vehicle a plurality of available parking spaces for the vehicle, generating with a video output device operatively connected to the controller an interface with a graphical depiction of the vehicle and the plurality of available parking spaces, receiving a first input gesture with a gesture input device to select one parking space from the plurality of available parking spaces, and operating the vehicle to park the vehicle in the one parking space using the controller configured with a parking assistance service in the vehicle. | 05-01-2014 |
20140172427 | System And Method For Event Summarization Using Observer Social Media Messages - A method for processing messages pertaining to an event includes receiving a plurality of messages pertaining to the event from electronic communication devices associated with a plurality of observers of the event, generating a first message stream that includes only a portion of the plurality of messages corresponding to a first participant in the event, identifying a first sub-event in the first message stream with reference to a time distribution of messages and content distribution of messages in the first message stream, generating a sub-event summary with reference to a portion of the plurality of messages in the first message stream that are associated with the first sub-event, and transmitting the sub-event summary to a plurality of electronic communication devices associated with a plurality of users who are not observers of the event. | 06-19-2014 |
20140270382 | System and Method for Identifying Handwriting Gestures In An In-Vehicle Information System - An in-vehicle information system includes a camera and a controller that accept gesture input. A controller receives frames of video data and generates trajectory data for a movement of a hand in the video data. The controller uses a first hidden Markov model (HMM) to decode a sequence of strokes from the trajectory data, removes a starting and ending stroke to form an edited stroke sequence, and re-normalizes the strokes in the edited stroke sequence. The controller uses a second HMM corresponding to a predetermined set of characters to identify a character corresponding to the re-normalized edited stroke sequence. | 09-18-2014 |
20140281957 | System and Method for Transitioning Between Operational Modes of an In-Vehicle Device Using Gestures - A method of receiving user input includes operating a software program in a first operating mode and modifying a graphical interface of the software program in response to receiving gesture. The method also includes receiving a haptic input and operating the software program in a second operating mode in response to the haptic input. In the second operating mode, the gesture produces a different modification to the graphical interface than in the first operating mode. | 09-18-2014 |
20140365219 | Speaker Verification in a Health Monitoring System - A method for verifying that a person is registered to use a telemedical device includes identifying an unprompted trigger phrase in words spoken by a person and received by the telemedical device. The telemedical device prompts the person to state a name of a registered user and optionally prompts the person to state health tips for the person. The telemedical device verifies that the person is the registered user using utterance data generated from the unprompted trigger phrase, name of the registered user, and health tips. | 12-11-2014 |