Patent application number | Description | Published |
20090055177 | APPARATUS AND METHOD FOR GENERATING NOISE ADAPTIVE ACOUSTIC MODEL FOR ENVIRONMENT MIGRATION INCLUDING NOISE ADAPTIVE DISCRIMINATIVE ADAPTATION METHOD - Provided are an apparatus and method for generating a noise adaptive acoustic model including a noise adaptive discriminative adaptation method. The method includes: generating a baseline model parameter from large-capacity speech training data including various noise environments; and receiving the generated baseline model parameter and applying a discriminative adaptation method to the generated results to generate an migrated acoustic model parameter suitable for an actually applied environment. | 02-26-2009 |
20090076813 | METHOD FOR SPEECH RECOGNITION USING UNCERTAINTY INFORMATION FOR SUB-BANDS IN NOISE ENVIRONMENT AND APPARATUS THEREOF - According to a method and apparatus for speech recognition in noise environment of the present invention using uncertainty information for sub-band, uncertainty information of each sub-band is extracted from estimated clean speech using noise modeling, and helps to extract speech features that are robust to noise using the extracted uncertainty information as a weight with respect to each sub-band. Also, an acoustic model is converted according to each sub-band weight, and speech recognition is performed based on the converted acoustic model and the extracted speech features. As a result, while the noise modeling over time is not so accurate, noise influence resulted from sub-bands having high corruption can be reduced according to the uncertainty information of the corresponding sub-band, and speech recognition performance in complex noise environments can be improved. | 03-19-2009 |
20090150146 | MICROPHONE ARRAY BASED SPEECH RECOGNITION SYSTEM AND TARGET SPEECH EXTRACTING METHOD OF THE SYSTEM - A microphone-array-based speech recognition system using a blind source separation (BBS) and a target speech extraction method in the system are provided. The speech recognition system performs an independent component analysis (ICA) to separate mixed signals input through a plurality of microphone into sound-source signals, extracts one target speech spoken for speech recognition from the separated sound-source signals by using a Gaussian mixture model (GMM) or a hidden Markov Model (HMM), and automatically recognizes a desired speech from the extracted target speech. Accordingly, it is possible to obtain a high speech recognition rate even in a noise environment. | 06-11-2009 |
20090157399 | APPARATUS AND METHOD FOR EVALUATING PERFORMANCE OF SPEECH RECOGNITION - An apparatus for evaluating the performance of speech recognition includes a speech database for storing N-number of test speech signals for evaluation. A speech recognizer is located in an actual environment and executes the speech recognition of the test speech signals reproduced using a loud speaker from the speech database in the actual environment to produce speech recognition results. A performance evaluation module evaluates the performance of the speech recognition by comparing correct recognition results answers with the speech recognition results. | 06-18-2009 |
20090265168 | NOISE CANCELLATION SYSTEM AND METHOD - A noise cancellation apparatus includes a noise estimation module for receiving a noise-containing input speech, and estimating a noise therefrom to output the estimated noise; a first Wiener filter module for receiving the input speech, and applying a first Wiener filter thereto to output a first estimation of clean speech; a database for storing data of a Gaussian mixture model for modeling clean speech; and an MMSE estimation module for receiving the first estimation of clean speech and the data of the Gaussian mixture model to output a second estimation of clean speech. The apparatus further includes a final clean speech estimation module for receiving the second estimation of clean speech from the MMSE estimation module and the estimated noise from the noise estimation module, and obtaining a final Wiener filter gain therefrom to output a final estimation of clean speech by applying the final Wiener filter gain. | 10-22-2009 |
20090280379 | Electrode binder solution composition for polymer electrolyte fuel cell - The present invention relates to an electrode binder solution composition for a polymer electrolyte fuel cell comprising a mixture of a solvent and a nonsolvent. The electrode binder solution composition can significantly improve electrode activity by maximizing formation of a three-phase interface of catalyst, binder and fuel at the electrode catalytic layer of the polymer electrolyte fuel cell. The present invention relates to a preparation method of an electrode binder solution for a polymer electrolyte fuel cell, the electrode binder solution for a polymer electrolyte fuel cell comprising a sulfonated proton exchange hydrocarbon-based polymer and a mixture of a solvent and a nonsolvent. The present invention also relates to a preparation method of an electrode catalyst slurry comprising the steps of: mixing an electrode binder solution composition for a polymer electrolyte fuel cell with a platinum catalyst and drying the mixture; and heat-treating the dried mixture to maximize interface between the electrode binder and the catalyst. | 11-12-2009 |
20100154015 | METADATA SEARCH APPARATUS AND METHOD USING SPEECH RECOGNITION, AND IPTV RECEIVING APPARATUS USING THE SAME - A metadata search apparatus using speech recognition includes a metadata processor for processing contents metadata to obtain allomorph of target vocabulary required for speech recognition and search; a metadata storage unit for storing the contents metadata; a speech recognizer for performing speech recognition on speech data uttered by a user by searching the allomorph of the target vocabulary; a query language processor for extracting a keyword from the vocabulary speech-recognized by the speech recognizer; and a search processor for searching the metadata storage unit to extract the contents metadata corresponding to the keyword. An IPTV receiving apparatus employs the metadata search apparatus to provide IPTV services through the functions of speech recognition. | 06-17-2010 |
20100158271 | METHOD FOR SEPARATING SOURCE SIGNALS AND APPARATUS THEREOF - A method for separating a sound source from a mixed signal, includes Transforming a mixed signal to channel signals in frequency domain; and grouping several frequency bands for each channel signal to form frequency clusters. Further, the method for separating the sound source from the mixed signal includes separating the frequency clusters by applying a blind source separation to signals in frequency domain for each frequency cluster; and integrating the spectrums of the separated signal to restore the sound source in a time domain wherein each of the separated signals expresses one sound source. | 06-24-2010 |
20100161326 | SPEECH RECOGNITION SYSTEM AND METHOD - A speech recognition system includes: a speed level classifier for measuring a moving speed of a moving object by using a noise signal at an initial time of speech recognition to determine a speed level of the moving object; a first speech enhancement unit for enhancing sound quality of an input speech signal of the speech recognition by using a Wiener filter, if the speed level of the moving object is equal to or lower than a specific level; and a second speech enhancement unit enhancing the sound quality of the input speech signal by using a Gaussian mixture model, if the speed level of the moving object is higher than the specific level. The system further includes an end point detection unit for detecting start and end points, an elimination unit for eliminating sudden noise components based on a sudden noise Gaussian mixture model. | 06-24-2010 |
20100161329 | VITERBI DECODER AND SPEECH RECOGNITION METHOD USING SAME - A Viterbi decoder includes: an observation vector sequence generator for generating an observation vector sequence by converting an input speech to a sequence of observation vectors; a local optimal state calculator for obtaining a partial state sequence having a maximum similarity up to a current observation vector as an optimal state; an observation probability calculator for obtaining, as a current observation probability, a probability for observing the current observation vector in the optimal state; a buffer for storing therein a specific number of previous observation probabilities; a non-linear filter for calculating a filtered probability by using the previous observation probabilities stored in the buffer and the current observation probability; and a maximum likelihood calculator for calculating a partial maximum likelihood by using the filtered probability. The filtered probability may be a maximum value, a mean value or a median value of the previous observation probabilities and the current observation probability. | 06-24-2010 |
20100161334 | UTTERANCE VERIFICATION METHOD AND APPARATUS FOR ISOLATED WORD N-BEST RECOGNITION RESULT - An utterance verification method for an isolated word N-best speech recognition result includes: calculating log likelihoods of a context-dependent phoneme and an anti-phoneme model based on an N-best speech recognition result for an input utterance; measuring a confidence score of an N-best speech-recognized word using the log likelihoods; calculating distance between phonemes for the N-best speech-recognized word; comparing the confidence score with a threshold and the distance with a predetermined mean of distances; and accepting the N-best speech-recognized word when the compared results for the confidence score and the distance correspond to acceptance. | 06-24-2010 |
20110077939 | MODEL-BASED DISTORTION COMPENSATING NOISE REDUCTION APPARATUS AND METHOD FOR SPEECH RECOGNITION - A model-based distortion compensating noise reduction apparatus for speech recognition, includes: a speech absence probability calculator for calculating the probability distribution for absence and existence of a speech using the sound absence and existence information for the frames; a noise estimation updater for estimating a more accurate noise component by updating the variance of the clean speech and noise for each frame; and a speech absence probability-based noise filter for outputting a first clean speech through the speech absence probability transmitted from the speech absence probability calculator and a first noise filter. Further, the model-based distortion compensating noise reduction apparatus includes a post probability calculator for calculating post probabilities for mixtures using a GMM containing a clean speech in the first clean speech; and a final filter designer for forming a second noise filter and outputting an improved final clean speech signal using the second noise filter. | 03-31-2011 |
20120136659 | APPARATUS AND METHOD FOR PREPROCESSING SPEECH SIGNALS - Disclosed herein are an apparatus and method for preprocessing speech signals to perform speech recognition. The apparatus includes a voiced sound interval detection unit, a preprocessing method determination unit, and a clipping signal processing unit. The voiced sound interval detection unit detects a voiced sound interval including a voiced sound signal in a voice interval. The preprocessing method determination unit detects a clipping signal present in the voiced sound interval. The clipping signal processing unit extracts signal samples adjacent to the clipping signal, and performs interpolation on the clipping signal using the adjacent signal samples. | 05-31-2012 |
20120150539 | METHOD FOR ESTIMATING LANGUAGE MODEL WEIGHT AND SYSTEM FOR THE SAME - Method of the present invention may include receiving speech feature vector converted from speech signal, performing first search by applying first language model to the received speech feature vector, and outputting word lattice and first acoustic score of the word lattice as continuous speech recognition result, outputting second acoustic score as phoneme recognition result by applying an acoustic model to the speech feature vector, comparing the first acoustic score of the continuous speech recognition result with the second acoustic score of the phoneme recognition result, outputting first language model weight when the first coustic score of the continuous speech recognition result is better than the second acoustic score of the phoneme recognition result and performing a second search by applying a second language model weight, which is the same as the output first language model, to the word lattice. | 06-14-2012 |
20120166194 | METHOD AND APPARATUS FOR RECOGNIZING SPEECH - Disclosed herein are an apparatus and method for recognizing speech. The apparatus includes a frame-based speech recognition unit, a segment division unit, a segment feature extraction unit, a segment speech recognition performance unit, and a combination and synchronization unit. The frame-based speech recognition unit extracts frame speech feature vectors from a speech signal, and performs speech recognition on frames of the speech signal using the frame speech feature vectors and a frame-based probability model. The segment division unit divides the speech signal into segments. The segment feature extraction unit extracts segment speech feature vectors around a boundary between the segments. The segment speech recognition performance unit performs speech recognition on the segments of the speech signal using the segment speech feature vectors and a segment-based probability model. The combination and synchronization unit combines results of the speech recognition for the frames with results of the speech recognition for the segments. | 06-28-2012 |
20130035938 | APPARATUS AND METHOD FOR RECOGNIZING VOICE - The present invention includes a hierarchical search process. The hierarchical search process includes three steps. In a first step, a word boundary is determined using a recognition method of determining a following word dependent on a preceding word, and a word boundary detector. In a second step, word unit based recognition is performed in each area by dividing an input voice into a plurality of areas based on the determined word boundary. Finally, in a third step, a language model is applied to induce an optimal sentence recognition result with respect to a candidate word that is determined for each area. The present invention may improve the voice recognition performance, and particularly, the sentence unit based consecutive voice recognition performance. | 02-07-2013 |
20140129233 | APPARATUS AND SYSTEM FOR USER INTERFACE - Disclosed is apparatus and system for user interface. The apparatus for user interface comprises a body unit including a groove which is corresponding to a structure of an oral cavity and operable to be mounted on upper part of the oral cavity; a user input unit receiving a signal from the user's tongue in a part of the body unit; a communication unit transmitting the signal received from the user input unit; and a charging unit supplying an electrical energy generated from vibration or pressure caused by movement of the user's tongue. | 05-08-2014 |
20140132836 | METHOD AND APPARATUS FOR GENERATING SUMMARIZED INFORMATION, AND SERVER FOR THE SAME - The present invention relates to automatic summarization so as to recognize entire contents of multimedia data. A method of generating summarized information according to the present invention includes: generating index information on a specific audio signal or a specific video signal among input signals; synchronizing text information extracted from the input signal or received for the input signal with the index information; and generating first summarized information by using the synchronized text information and index information. | 05-15-2014 |
20140163986 | VOICE-BASED CAPTCHA METHOD AND APPARATUS - Disclosed herein is a voice-based CAPTCHA method and apparatus which can perform a CAPTCHA procedure using the voice of a human being. In the voice-based CAPTCHA) method, a plurality of uttered sounds of a user are collected. A start point and an end point of a voice from each of the collected uttered sounds are detected and then speech sections are detected. Uttered sounds of the respective detected speech sections are compared with reference uttered sounds, and then it is determined whether the uttered sounds are correctly uttered sounds. It is determined whether the uttered sounds have been made by an identical speaker if it is determined that the uttered sounds are correctly uttered sounds. Accordingly, a CAPTCHA procedure is performed using the voice of a human being, and thus it can be easily checked whether a human being has personally made a response using a voice online | 06-12-2014 |
20140171149 | APPARATUS AND METHOD FOR CONTROLLING MOBILE DEVICE BY CONVERSATION RECOGNITION, AND APPARATUS FOR PROVIDING INFORMATION BY CONVERSATION RECOGNITION DURING MEETING - An apparatus for controlling a mobile device according to the present invention includes: a conversation recognition unit configured to recognize a conversation between users through mobile devices; a user intent verification unit configured to verify an intent of at least one user among the users based on the recognition result; and an additional function control unit configured to execute an additional function corresponding to the verified user's intent in a mobile device of the user. According to the present invention, great contribution may be made to improve communication between users by recognizing the conversation between the users, thereby directly providing information associated with the conversation or providing a service. | 06-19-2014 |
20140221043 | MOBILE COMMUNICATION TERMINAL AND OPERATING METHOD THEREOF - Provided is a mobile communication terminal including: a camera module which captures an image of a set area; a microphone module which, when a sound including a voice of a user is input, extracts a sound level corresponding to the sound and a sound generating position; and a control module which estimates a position of a lip of the user from the image, extracts a voice level from the sound level corresponding to the position of the lip of the user and a voice generating position from the sound generating position, and recognizes the voice of the user based on at least one of the voice level and the voice generating position. | 08-07-2014 |
20140343935 | APPARATUS AND METHOD FOR PERFORMING ASYNCHRONOUS SPEECH RECOGNITION USING MULTIPLE MICROPHONES - An apparatus and method for performing asynchronous speech recognition using multiple microphones are disclosed. The apparatus includes a microphone selection unit, a signal-to-noise ratio measurement unit, a speech recognition and verification unit, and a final recognition result output unit. The microphone selection unit selects two or more microphones responsive to a user's voice from among a plurality of microphones distributed around the user. The signal-to-noise ratio measurement unit measures the signal to noise ratios of inputs of the selected two or more microphones. The speech recognition and verification unit performs speech recognition using the input of the microphone having a highest signal to noise ratio, and verifies the speech recognition using the inputs of the remaining microphones. The final recognition result output unit outputs the final recognition results of the user's voice based on the results of the speech recognition and verification unit. | 11-20-2014 |
20140378185 | SMART WATCH - A smart watch in accordance with an embodiment of the present invention comprises: a first smart member configured to receive a voice signal sent from a mobile terminal, transform the input voice of a user to a voice signal, and send the voice signal to the mobile terminal while in talk mode; and a second smart member configured to input a control command about the talk mode into the first smart member, and transform the voice signal to voice and output the voice. | 12-25-2014 |