Patent application number | Description | Published |
20080201147 | Distributed speech recognition system and method and terminal and server for distributed speech recognition - Provided are a distributed speech recognition system, a distributed speech recognition speech method, and a terminal and a server for distributed speech recognition. The distributed speech recognition system includes a terminal which decodes a feature vector that is extracted from an input speech signal into a sequence of phonemes and generates the final recognition result by rescoring a candidate list provided from the outside; and a server which generates the candidate list by performing symbol matching on the recognized sequence of phonemes provided from the terminal and transmits the candidate list for the rescoring to the terminal. | 08-21-2008 |
20080249770 | Method and apparatus for searching for music based on speech recognition - Provided is a method and apparatus for searching music based on speech recognition. By calculating search scores with respect to a speech input using an acoustic model, calculating preferences in music using a user preference model, reflecting the preferences in the search scores, and extracting a music list according to the search scores in which the preferences are reflected, a personal expression of a search result using speech recognition can be achieved, and an error or imperfection of a speech recognition result can be compensated for. | 10-09-2008 |
20090055174 | Method and apparatus for automatically completing text input using speech recognition - Provided are a method and apparatus for automatically completing a text input using speech recognition. The method includes: receiving a first part of a text from a user through a text input device; recognizing a speech of the user, which corresponds to the text; and completing a remaining part of the text based on the first part of the text and the recognized speech. Therefore, accuracy of the text input and convenience of the speech recognition can be ensured, and a non-input part of the text can be easily input based on the input part of the text and the recognized speech at a high speed. | 02-26-2009 |
20090055179 | Method, medium and apparatus for providing mobile voice web service - Provided are a method and apparatus for providing a mobile voice web service in a mobile terminal. The method includes analyzing a web history of a user from web search logs of the user and generating a voice access list based on the analysis results, and performing voice recognition by dynamically generating a voice recognition syntax according to the generated voice access list. Accordingly, by limiting syntax required for voice recognition by generating a syntax suitable for a web context of the user, efficient voice recognition, which can be performed in a terminal not a server, can be implemented. | 02-26-2009 |
20090123021 | System, method, and medium indexing photos semantically - A system, method and medium indexing a plurality of photos semantically based on a user's annotation. The method includes analyzing the user's annotation and extracting a shared index from the user's annotation, detecting a situation change in the plurality of photos, and indexing the plurality of photos according to the situation change based on the shared index. | 05-14-2009 |
20090157383 | VOICE QUERY EXTENSION METHOD AND SYSTEM - A voice query extension method and system. The voice query extension method includes: detecting voice activity of a user from an input signal and extracting a feature vector from the voice activity; converting the feature vector into at least one phoneme sequence and generating the at least one phoneme sequence; matching the at least one phoneme sequence with words registered in a dictionary, extracting a string of the matched words with a linguistic meaning, and selecting the string of the matched words as a query; determining whether the query is in a predetermined first language, and when the query is not in the first language as a result of the determining, converting the query using a phoneme to grapheme rule, and generating a query in the first language; and searching using the query in the first language. | 06-18-2009 |
20090157398 | Method and apparatus for detecting noise - A method of and apparatus for detecting noise are provided. The method of detecting noise includes: receiving an input of a voice frame and converting the voice frame into a filter bank vector; converting the converted filter bank vector into band data; calculating a weight Gaussian mixture model (GMM) for each band by using the converted band data; and detecting noise in the voice frame based on the calculation result. | 06-18-2009 |
20100286987 | APPARATUS AND METHOD FOR GENERATING AVATAR BASED VIDEO MESSAGE - An apparatus and method for generating an avatar based video message are provided. The apparatus and method are capable of generating an avatar based video message based on speech of a user. The avatar based video message apparatus and method displays information that corresponds to input user speech. The avatar based video message apparatus and method edits the input user speech according to a user input signal with reference to the displayed information, generates avatar animation according to the edited speech, and generates an avatar based video message based on the edited speech and the avatar animation. | 11-11-2010 |
20110029301 | METHOD AND APPARATUS FOR RECOGNIZING SPEECH ACCORDING TO DYNAMIC DISPLAY - A speech recognition apparatus and method that can improve speech recognition rate and recognition speed by reflecting information for dynamic display, are provided. The speech recognition apparatus generates a display variation signal indicating that variations have occurred on a screen and creates display information about the varied screen. The speech recognition apparatus adjusts a word weight for at least one word related to the varied screen and a domain weight for at least one domain included in the varied screen, according to the display variation signal and the display information. The adjusted word weight and the adjusted domain weight are dynamically reflected in a language model that is used for speech recognition. | 02-03-2011 |
20120095766 | SPEECH RECOGNITION APPARATUS AND METHOD - A speech recognition apparatus is provided. The speech recognition apparatus includes a primary speech recognition unit configured to perform speech recognition on input speech and thus to generate word lattice information, a word string generation unit configured to generate one or more word strings based on the word lattice information, a language model score calculation unit configured to calculate bidirectional language model scores of the generated word strings selectively using forward and backward language models for each of words in each of the generated word strings, and a sentence output unit configured to output one or more of the generated word strings with high scores as results of the speech recognition of the input speech based on the calculated bidirectional language model scores. | 04-19-2012 |