Sensory, Incorporated Patent applications |
Patent application number | Title | Published |
20150317980 | ENERGY POST QUALIFICATION FOR PHRASE SPOTTING - In one embodiment, a computing device can detect an utterance of a target phrase within an acoustic input signal. The computing device can further determine a first estimate of cumulative signal and noise energy for the detected utterance in the acoustic input signal with respect to a first time period spanning the duration of the detected utterance, and a second estimate of noise energy in the acoustic input signal with respect to a second time period preceding (or following) the first time period. The computing device can then calculate a signal-to-noise ratio (SNR) for the detected utterance based on the first and second estimates and can reject the detected utterance if the SNR is below an SNR threshold. | 11-05-2015 |
20140257812 | Background Speech Recognition Assistant Using Speaker Verification - In one embodiment, a method includes receiving an acoustic input signal at a speech recognizer. A user is identified that is speaking based on the acoustic input signal. The method then determines speaker-specific information previously stored for the user and a set of responses based on the recognized acoustic input signal and the speaker-specific information for the user. It is determined if the response should be output and the response is outputted if it is determined the response should be output. | 09-11-2014 |
20140195236 | SPEAKER VERIFICATION AND IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK-BASED SUB-PHONETIC UNIT DISCRIMINATION - In one embodiment, a computer system stores speech data for a plurality of speakers, where the speech data includes a plurality of feature vectors and, for each feature vector, an associated sub-phonetic class. The computer system then builds, based on the speech data, an artificial neural network (ANN) for modeling speech of a target speaker in the plurality of speakers, where the ANN is configured to discriminate between instances of sub-phonetic classes uttered by the target speaker and instances of sub-phonetic classes uttered by other speakers in the plurality of speakers. | 07-10-2014 |
20140180691 | SYSTEMS AND METHODS FOR HANDS-FREE VOICE CONTROL AND VOICE SEARCH - In one embodiment the present invention includes a method comprising receiving an acoustic input signal and processing the acoustic input signal with a plurality of acoustic recognition processes configured to recognize the same target sound. Different acoustic recognition processes start processing different segments of the acoustic input signal at different time points in the acoustic input signal. In one embodiment, initial states in the recognition processes may be configured on each time step. | 06-26-2014 |
20130183944 | Information Access and Device Control Using Mobile Phones and Audio in the Home Environment - Embodiments of the present invention are directed toward systems, methods and devices for improving information access to and device control in a home automation environment. Functionality of multiple household device, such as lights, sound, entertainment, HVAC, and communication devices can be activated via voice commands. The voice commands are detected by a nearby control device and relayed via a network communication medium to another control device to which the desired device or system that the user wants to operate is connected. Each control device, disposed throughout the home, can detect a voice command intended for another control box and household device and relay the voice command to the intended control box. In such systems, a user can initiate a telephone call by saying a voice command to a local control box that will forward on the control signal to a mobile phone connected to another control box. | 07-18-2013 |
20130080171 | BACKGROUND SPEECH RECOGNITION ASSISTANT - In one embodiment, a method receives an acoustic input signal at a speech recognizer configured to recognize the acoustic input signal in an always on mode. A set of responses based on the recognized acoustic input signal is determined and ranked based on criteria. A computing device determines if the response should be output based on a ranking of the response. The method determines an output method in a plurality of output methods based on the ranking of the response and outputs the response using the output method if it is determined the response should be output. | 03-28-2013 |
20130080167 | Background Speech Recognition Assistant Using Speaker Verification - In one embodiment, a method includes receiving an acoustic input signal at a speech recognizer. A user is identified that is speaking based on the acoustic input signal. The method then determines speaker-specific information previously stored for the user and a set of responses based on the recognized acoustic input signal and the speaker-specific information for the user. It is determined if the response should be output and the response is outputted if it is determined the response should be output. | 03-28-2013 |
20130054242 | REDUCING FALSE POSITIVES IN SPEECH RECOGNITION SYSTEMS - Embodiments of the present invention improve methods of performing speech recognition. In one embodiment, the present invention includes a method comprising receiving a spoken utterance, processing the spoken utterance in a speech recognizer to generate a recognition result, determining consistencies of one or more parameters of component sounds of the spoken utterance, wherein the parameters are selected from the group consisting of duration, energy, and pitch, and wherein each component sound of the spoken utterance has a corresponding value of said parameter, and validating the recognition result based on the consistency of at least one of said parameters. | 02-28-2013 |
20130054235 | TRULY HANDSFREE SPEECH RECOGNITION IN HIGH NOISE ENVIRONMENTS - Embodiments of the present invention improve content manipulation systems and methods using speech recognition. In one embodiment, the present invention includes a method comprising configuring a recognizer to recognize utterances in the presence of a background audio signal having particular audio characteristics. A composite signal comprising a first audio signal and a spoken utterance of a user is received by the recognizer, where the first audio signal comprises the particular audio characteristics used to configure the recognizer so that the recognizer is desensitized to the first audio signal. The spoke utterance is recognized in the presence of the first audio signal when the spoken utterance is one of the predetermined utterances. An operation is performed on the first audio signal. | 02-28-2013 |
20110166855 | Systems and Methods for Hands-free Voice Control and Voice Search - In one embodiment the present invention includes a method comprising receiving an acoustic input signal and processing the acoustic input signal with a plurality of acoustic recognition processes configured to recognize the same target sound. Different acoustic recognition processes start processing different segments of the acoustic input signal at different time points in the acoustic input signal. In one embodiment, initial states in the recognition processes may be configured on each time step. | 07-07-2011 |
20090204410 | VOICE INTERFACE AND SEARCH FOR ELECTRONIC DEVICES INCLUDING BLUETOOTH HEADSETS AND REMOTE SYSTEMS - Systems and methods for improving the interaction between a user and a small electronic device such as a Bluetooth headset are described. The use of a voice user interface in electronic devices may be used. In one embodiment, recognition processing limitations of some devices are overcome by employing speech synthesizers and recognizers in series where one electronic device responds to simple audio commands and sends audio requests to a remote device with more significant recognition analysis capability. Embodiments of the present invention may include systems and methods for utilizing speech recognizers and synthesizers in series to provide simple, reliable, and hands-free interfaces with users. | 08-13-2009 |
20090204409 | Voice Interface and Search for Electronic Devices including Bluetooth Headsets and Remote Systems - Systems and methods for improving the interaction between a user and a small electronic device such as a Bluetooth headset are described. The use of a voice user interface in electronic devices may be used. In one embodiment, recognition processing limitations of some devices are overcome by employing speech synthesizers and recognizers in series where one electronic device responds to simple audio commands and sends audio requests to a remote device with more significant recognition analysis capability. Embodiments of the present invention may include systems and methods for utilizing speech recognizers and synthesizers in series to provide simple, reliable, and hands-free interfaces with users. | 08-13-2009 |
20090150160 | SYSTEMS AND METHODS OF PERFORMING SPEECH RECOGNITION USING GESTURES - Embodiments of the present invention improve methods of performing speech recognition using human gestures. In one embodiment, the present invention includes a speech recognition method comprising detecting a gesture, selecting a first recognition set based on the gesture, receiving a speech input signal, and recognizing the speech input signal in the context of the first recognition set. | 06-11-2009 |
20090132255 | Systems and Methods of Performing Speech Recognition with Barge-In for use in a Bluetooth System - Embodiments of the present invention improve methods of performing speech recognition with barge-in. In one embodiment, the present invention includes a speech recognition method comprising starting a synthesis of recorded speech, receiving a user speech input signal providing information regarding a user choice, detecting an initial portion of the user speech input signal, selectively altering the synthesis of recorded speech, and recognizing the user choice. | 05-21-2009 |
20090094033 | Systems and methods of performing speech recognition using historical information - Embodiments of the present invention improve speech recognition using historical information. In one embodiment, the present invention includes a method of performing speech recognition comprising receiving an identifier specifying a user of a kiosk, retrieving history information about the user using the identifier, receiving speech input, recognizing said speech input in the context of a first recognition set, resulting in first recognition results, and modifying the first recognition results using the history information. | 04-09-2009 |
20090094032 | Systems and methods of performing speech recognition using sensory inputs of human position - Embodiments of the present invention improve methods of performing speech recognition using sensory inputs of human position. In one embodiment, the present invention includes a speech recognition method comprising sensing a change in position of at least one part of a human body, selecting a recognition set based on the change of position, receiving a speech input signal, and recognizing the speech input signal in the context of the first recognition set. | 04-09-2009 |
20090043580 | System and Method for Controlling the Operation of a Device by Voice Commands - The present invention includes a speech recognition system comprising a light element, a power control switch, the power control switch varying the power delivered to the light element, a controller, a microphone, a speech recognizer coupled to the microphone for recognizing speech input signals and transmitting recognition results to the controller, and a speech synthesizer coupled to the controller for generating synthesized speech, wherein the controller varies the power to the light element in accordance with the recognition results received from the speech recognizer. Embodiments of the invention may alternatively include a low power wake up circuit. In another embodiment, the present invention is a method of controlling a device by voice commands. | 02-12-2009 |
20080304360 | Systems and Methods of Sonic Communication - In one embodiment the present invention includes a method of wireless communication. The method comprises receiving a sonic signal and determining a sequence of sonic tones from a received sonic signal. The receiving includes receiving the sonic signal at an electronic device using a microphone. The sonic signal includes a sequence of sonic tones. The receiving results in the received sonic signal. The sequence of sonic tones contains predefined timing. The timing includes the duration of each sonic tone and a set of intervals between successive sonic tones. | 12-11-2008 |
20080275699 | Systems and methods of performing speech recognition using global positioning (GPS) information - Embodiments of the present invention improve content selection systems and methods using speech recognition. In one embodiment, the present invention includes a speech recognition method comprising receiving location parameters from a global positioning system, retrieving location data using the location parameters, and configuring one or more recognition sets of a speech recognizer using the location data. | 11-06-2008 |
20080228481 | Content selelction systems and methods using speech recognition - Embodiments of the present invention improve content selection systems and methods using speech recognition. In one embodiment, the present invention includes a speech recognition method comprising storing content on an electronic device, wherein the content is associated with a plurality of content attribute values, adding the content attribute values to a first recognition set of a speech recognizer, receiving a speech input signal in said speech recognizer, generating a plurality of likelihood values in response to the speech input signal, wherein each likelihood value is associated with one content attribute value in the recognition set; and accessing the stored content based on the likelihood values. | 09-18-2008 |