Patent application number | Description | Published |
20100049516 | METHOD OF USING MICROPHONE CHARACTERISTICS TO OPTIMIZE SPEECH RECOGNITION PERFORMANCE - A system and method for tuning a speech recognition engine to an individual microphone using a database containing acoustical models for a plurality of microphones. Microphone performance characteristics are obtained from a microphone at a speech recognition engine, the database is searched for an acoustical model that matches the characteristics, and the speech recognition engine is then modified based on the matching acoustical model. | 02-25-2010 |
20110010171 | Singular Value Decomposition for Improved Voice Recognition in Presence of Multi-Talker Background Noise - A system and method for providing speech recognition functionality offers improved accuracy and robustness in noisy environments having multiple speakers. The described technique includes receiving speech energy and converting the received speech energy to a digitized form. The digitized speech energy is decomposed into features that are then projected into a feature space having multiple speaker subspaces. The projected features fall either into one of the multiple speaker subspaces or outside of all speaker subspaces. A speech recognition operation is performed on a selected one of the multiple speaker subspaces to resolve the utterance to a command or data. | 01-13-2011 |
20110046953 | METHOD OF RECOGNIZING SPEECH - A method for recognizing speech involves reciting, into a speech recognition system, an utterance including a numeric sequence that contains a digit string including a plurality of tokens and detecting a co-articulation problem related to at least two potentially co-articulated tokens in the digit string. The numeric sequence may be identified using i) a dynamically generated possible numeric sequence that potentially corresponds with the numeric sequence, and/or ii) at least one supplemental acoustic model. Also disclosed herein is a system for accomplishing the same. | 02-24-2011 |
20110125500 | AUTOMATED DISTORTION CLASSIFICATION - A method of and system for automated distortion classification. The method includes steps of (a) receiving audio including a user speech signal and at least some distortion associated with the signal; (b) pre-processing the received audio to generate acoustic feature vectors; (c) decoding the generated acoustic feature vectors to produce a plurality of hypotheses for the distortion; and (d) post-processing the plurality of hypotheses to identify at least one distortion hypothesis of the plurality of hypotheses as the received distortion. The system can include one or more distortion models including distortion-related acoustic features representative of various types of distortion and used by a decoder to compare the acoustic feature vectors with the distortion-related acoustic features to produce the plurality of hypotheses for the distortion. | 05-26-2011 |
20110250933 | METHOD OF CONTROLLING DIALING MODES IN A VEHICLE - A dialing mode of a telematics unit in a vehicle is controlled by monitoring for dialing digits from a vehicle occupant, determining whether the type of dialing digits are continuous dialing digits or discrete dialing digits, establishing a continuous mode for receiving continuous dialing digits or a discrete mode for receiving discrete dialing digits based on the determination, and if the type of dialing digits changes, switching the established mode. | 10-13-2011 |
20110282663 | TRANSIENT NOISE REJECTION FOR SPEECH RECOGNITION - A method of and system for transient noise rejection for improved speech recognition. The method comprises the steps of (a) receiving audio including user speech and at least some transient noise associated with the speech, (b) converting the received audio into digital data, (c) segmenting the digital data into acoustic frames, and (d) extracting acoustic feature vectors from the acoustic frames. The method also comprises the steps of (e) evaluating the acoustic frames for transient noise on a frame-by-frame basis, (f) rejecting those acoustic frames having transient noise, (g) accepting as speech frames those acoustic frames having no transient noise and, thereafter, (h) recognizing the user speech using the speech frames. | 11-17-2011 |
20110282668 | SPEECH ADAPTATION IN SPEECH SYNTHESIS - A method of and system for speech synthesis. First and second text inputs are received in a text-to-speech system, and processed into respective first and second speech outputs corresponding to stored speech respectively from first and second speakers using a processor of the system. The second speech output of the second speaker is adapted to sound like the first speech output of the first speaker. | 11-17-2011 |
20120109649 | SPEECH DIALECT CLASSIFICATION FOR AUTOMATIC SPEECH RECOGNITION - Automatic speech recognition including receiving speech via a microphone, pre-processing the received speech to generate acoustic feature vectors, classifying dialect of the received speech, selecting at least one of an acoustic model or a lexicon specific to the classified dialect, decoding the acoustic feature vectors using a processor and at least one of the selected dialect-specific acoustic model or selected lexicon to produce a plurality of hypotheses for the received speech, and post-processing the plurality of hypotheses to identify one of the plurality of hypotheses as the received speech. | 05-03-2012 |
20120149356 | METHOD OF INTELLIGENT VEHICLE DIALING - A method of operating a vehicle telematics unit includes determining the location of a vehicle equipped with a vehicle telematics unit; determining if telematics dialing software operated by the vehicle telematics unit includes a verbal dialing protocol used at the determined vehicle location; if not, identifying one or more verbal dialing protocols used at the determined location of the vehicle; requesting telematics dialing software that includes the one or more identified verbal dialing protocols; receiving the requested telematics dialing software from a central facility; and storing the received telematics dialing software at the vehicle. | 06-14-2012 |
20120150541 | MALE ACOUSTIC MODEL ADAPTATION BASED ON LANGUAGE-INDEPENDENT FEMALE SPEECH DATA - A method of generating proxy acoustic models for use in automatic speech recognition includes training acoustic models from speech received via microphone from male speakers of a first language, and adapting the acoustic models in response to language-independent speech data from female speakers of a second language, to generate proxy acoustic models for use during runtime of speech recognition of an utterance from a female speaker of the first language. | 06-14-2012 |
20120209609 | USER-SPECIFIC CONFIDENCE THRESHOLDS FOR SPEECH RECOGNITION - A method of automatic speech recognition includes receiving an utterance from a user via a microphone that converts the utterance into a speech signal, pre-processing the speech signal using a processor to extract acoustic data from the received speech signal, and identifying at least one user-specific characteristic in response to the extracted acoustic data. The method also includes determining a user-specific confidence threshold responsive to the at least one user-specific characteristic, and using the user-specific confidence threshold to recognize the utterance received from the user and/or to assess confusability of the utterance with stored vocabulary. | 08-16-2012 |
20120245934 | SPEECH RECOGNITION DEPENDENT ON TEXT MESSAGE CONTENT - A method of automatic speech recognition. An utterance is received from a user in reply to a text message, via a microphone that converts the reply utterance into a speech signal. The speech signal is processed using at least one processor to extract acoustic data from the speech signal. An acoustic model is identified from a plurality of acoustic models to decode the acoustic data, and using a conversational context associated with the text message. The acoustic data is decoded using the identified acoustic model to produce a plurality of hypotheses for the reply utterance. | 09-27-2012 |
20120323577 | SPEECH RECOGNITION FOR PREMATURE ENUNCIATION - Methods of automatic speech recognition for premature enunciation. In one method, a) a user is prompted to input speech, then b) a listening period is initiated to monitor audio via a microphone, such that there is no pause between the end of step a) and the beginning of step b), and then the begin-speaking audible indicator is communicated to the user during the listening period. In another method, a) at least one audio file is played including both a prompt for a user to input speech and a begin-speaking audible indicator to the user, b) a microphone is activated to monitor audio, after playing the prompt but before playing the begin-speaking audible indicator in step a), and c) speech is received from the user via the microphone. | 12-20-2012 |
20130080172 | OBJECTIVE EVALUATION OF SYNTHESIZED SPEECH ATTRIBUTES - A method of evaluating attributes of synthesized speech. The method includes processing a text input into a synthesized speech utterance using a processor of a text-to-speech system, applying a human speech utterance to a speech model to obtain a reference wherein the human speech utterance corresponds to the text input, applying the synthesized speech utterance to at least one of the speech model or an other speech model to obtain a test, and calculating a difference between the test and the reference. The method also can be used in a speech synthesis method. | 03-28-2013 |
20130080173 | CORRECTING UNINTELLIGIBLE SYNTHESIZED SPEECH - A method and system of speech synthesis. A text input is received in a text-to-speech system and, using a processor of the system, the text input is processed into synthesized speech which is established as unintelligible. The text input is reprocessed into subsequent synthesized speech and output to a user via a loudspeaker to correct the unintelligible synthesized speech. In one embodiment, the synthesized speech can be established as unintelligible by predicting intelligibility of the synthesized speech, and determining that the predicted intelligibility is lower than a minimum threshold. In another embodiment, the synthesized speech can be established as unintelligible by outputting the synthesized speech to the user via the loudspeaker, and receiving an indication from the user that the synthesized speech is not intelligible. | 03-28-2013 |
20130211832 | SPEECH SIGNAL PROCESSING RESPONSIVE TO LOW NOISE LEVELS - A method of speech recognition in a vehicle. Audio including noise and a speech signal representative of an utterance from a user is received via a microphone, and a signal-to-noise ratio (SNR) for the received audio is calculated using a processor. It is determined whether the calculated SNR is greater than a predetermined SNR. If so, then a noise distribution is identified for addition to the received audio, and noise corresponding to the identified noise distribution is injected into the received audio to produce noise-injected audio including the speech signal. | 08-15-2013 |
20140019135 | SENDER-RESPONSIVE TEXT-TO-SPEECH PROCESSING - A method of speech synthesis including receiving a text input sent by a sender, processing the text input responsive to at least one distinguishing characteristic of the sender to produce synthesized speech that is representative of a voice of the sender, and communicating the synthesized speech to a recipient user of the system. | 01-16-2014 |
20140316782 | METHODS AND SYSTEMS FOR MANAGING DIALOG OF SPEECH SYSTEMS - Methods and systems are provided for managing speech dialog of a speech system. In one embodiment, a method includes: receiving a first utterance from a user of the speech system; determining a first list of possible results from the first utterance, wherein the first list includes at least two elements that each represent a possible result; analyzing the at least two elements of the first list to determine an ambiguity of the elements; and generating a speech prompt to the user based on partial orthography and the ambiguity. | 10-23-2014 |
20150056951 | VEHICLE TELEMATICS UNIT AND METHOD OF OPERATING THE SAME - A vehicle telematics unit and method of operating the same is provided. In one embodiment, a method includes storing an application access code provided from a telematics service user, initiating a call from a vehicle to the application, and receiving a request for the access code from the application during the call. Furthermore, the method includes determining that the application has requested the access code using a speech recognition function at the vehicle and sending the stored access code to the application based on the determination of the speech recognition function. | 02-26-2015 |
20150110287 | METHODS AND APPARATUS FOR PROCESSING MULTIPLE AUDIO STREAMS AT A VEHICLE ONBOARD COMPUTER SYSTEM - A method for processing a plurality of audio streams at a computer system onboard a vehicle is provided. The method receives the plurality of audio streams from a plurality of locations within a vehicle; prioritizes each of the plurality of audio streams to obtain a prioritization result; and completes a task associated with each of the plurality of audio streams, according to the prioritization result. | 04-23-2015 |
20150142428 | IN-VEHICLE NAMETAG CHOICE USING SPEECH RECOGNITION - According to an embodiment of the disclosure, there is provided a method of choosing a nametag using automatic speech recognition (ASR). The method includes receiving a spoken nametag via a microphone; performing a first speech recognition analysis on the spoken nametag; determining that the first speech recognition analysis outputs only handheld wireless device nametags; performing a second speech recognition analysis that excludes the handheld wireless device nametags stored at the handheld wireless device; and combining the results of the first speech recognition analysis and the second speech recognition analysis. | 05-21-2015 |
Patent application number | Description | Published |
20080225866 | Reducing network traffic to teredo server - Methods and systems for reducing network load on a Teredo server. | 09-18-2008 |
20080225867 | Faster NAT detection for Teredo client - Methods and systems for allowing a Teredo client behind a network address translation device to detect more rapidly the type of network address translation device. | 09-18-2008 |
20080225868 | Allowing IPv4 clients to communicate using Teredo addresses when both clients are behind a NAT - Methods and systems for allowing clients behind a network address translation device to communicate when those clients are assigned Teredo IPv6 addresses. | 09-18-2008 |
20080240132 | Teredo connectivity between clients behind symmetric NATs - Methods and systems for communicating between Teredo clients behind symmetric or restricted Network Address Translators. | 10-02-2008 |
20090006595 | Edge traversal service dormancy - A system maintains a dormant state in the host, in which no beacons (or “bubbles”) are transmitted from the host when no application or service (collectively, “processes”) of the host is accepting unsolicited traffic via the edge traversal service. When at least one application or service begins to accept unsolicited traffic via the edge traversal service, the host enters a qualified state and begins transmitting the beacons. As each additional application or service begins to accept such traffic, the number of accepting applications and services is maintained. As applications and services terminate acceptance of such traffic, the number of accepting applications and services is decremented. When the last application or service terminates acceptance of unsolicited traffic via the edge traversal service, the host re-enters the dormant state and ceases transmission of its beacons. | 01-01-2009 |
20100088418 | EDGE TRAVERSAL SERVICE DORMANCY - A system maintains a dormant state in the host, in which no beacons (or “bubbles”) are transmitted from the host when no application or service (collectively, “processes”) of the host is accepting unsolicited traffic via the edge traversal service. When at least one application or service begins to accept unsolicited traffic via the edge traversal service, the host enters a qualified state and begins transmitting the beacons. As each additional application or service begins to accept such traffic, the number of accepting applications and services is maintained. As applications and services terminate acceptance of such traffic, the number of accepting applications and services is decremented. When the last application or service terminates acceptance of unsolicited traffic via the edge traversal service, the host re-enters the dormant state and ceases transmission of its beacons. | 04-08-2010 |
20110302314 | EDGE TRAVERSAL SERVICE DORMANCY - A system maintains a dormant state in the host, in which no beacons (or “bubbles”) are transmitted from the host when no application or service (collectively, “processes”) of the host is accepting unsolicited traffic via the edge traversal service. When at least one application or service begins to accept unsolicited traffic via the edge traversal service, the host enters a qualified state and begins transmitting the beacons. As each additional application or service begins to accept such traffic, the number of accepting applications and services is maintained. As applications and services terminate acceptance of such traffic, the number of accepting applications and services is decremented. When the last application or service terminates acceptance of unsolicited traffic via the edge traversal service, the host re-enters the dormant state and ceases transmission of its beacons. | 12-08-2011 |
20120195305 | AGGREGATING ENDPOINT CAPABILITIES FOR A USER - A method and system for aggregating capabilities from multiple endpoints associated with a user are provided. The system aggregates the capabilities of the endpoints associated with a user into an aggregate view of available modes of communication for reaching the user. Then, the system publishes the aggregate view so that other users who want to send communications to the user will know the modes of communication available for that user. In addition, the system may designate certain modes of communication as preferred or as capable of reaching the user. | 08-02-2012 |
20130198383 | Network Access Based on Social-Networking Information - In one embodiment, a method includes receiving a request at a wireless access point from a client system to access a network through the wireless access point. The wireless access point sends an identifier associated with the client system to a social-networking system, the social-networking system including user profiles arranged in one or more social graphs. The social graphs store relationships between the user profiles. The method further includes receiving at the wireless access point from the social-networking system an authorization determination. The authorization determination is based upon a first user profile of the plurality of user profiles. The first user profile includes the identifier associated with the client system. The method further includes providing the client system with access to the network through the wireless access point in accordance with the authorization determination. | 08-01-2013 |
20140068038 | Maintain Persistent Connections Between Servers and Mobile Clients - In one embodiment, a method includes iteratively waiting for a time interval; sending a keep-alive message from a first computing device to a second computing device; and incrementing a current value of the time interval when the first computing device receives from the second computing device a response to a current keep-alive message sent during the current iteration, until the first computing device does not receive from the second computing device any responses to the current keep-alive message sent during the current iteration; and setting the current value of the time interval to a previous value of the time interval from an immediately-previous iteration. | 03-06-2014 |
20140114738 | Automatic Check-In Using Social-Networking Information - In one embodiment, a method includes detecting by a computer system of an entity a client system identifier. The detection is accomplished using a first wireless communication method. The method further includes sending by the computer system the client system identifier to a social-networking system. The social-networking system includes multiple user profiles arranged in at least one social graph that stores relationships between the user profiles. The method also includes receiving at the computer system from the social-networking system user profile information from a user profile of the user profiles, the user profile identified by the social-networking system as being associated with the client system identifier. The method further includes sending information associated with the entity to the client system in response to receiving the user profile information. | 04-24-2014 |
20140172973 | Mobile Push Notification - In one embodiment, a method includes sending to a mobile client computing device a first notification through a real-time push service, the first notification including content and being associated with a stateful object; the method also includes, in response to a state change of the stateful object, sending to the mobile client computing device a second notification through the real-time push service, the second notification including an update to the content based on the state change of the stateful object. | 06-19-2014 |