Patent application number | Description | Published |
20090259461 | Gain Control System, Gain Control Method, and Gain Control Program - Disclosed is a gain control system in which speech model constituted from a sound pressure and a feature is stored in a speech model storage unit for each of a plurality of phonemes or for each of clusters into which a speech is divided. When an input signal is given, a feature conversion unit calculates a feature and a sound pressure of the input signal. A sound pressure comparison unit determines a sound pressure ratio between the input signal and each of speech models. A distance calculation unit calculates a distance between the feature of the input signal and the feature of each of the speech models. A gain calculation unit calculates a gain value from the sound pressure ratio and information on the distance. A sound pressure compensation unit thereby compensates for the sound pressure of the input signal. | 10-15-2009 |
20100070277 | VOICE RECOGNITION DEVICE, VOICE RECOGNITION METHOD, AND VOICE RECOGNITION PROGRAM - A voice recognition device that recognizes a voice of an input voice signal, comprises a voice model storage unit that stores in advance a predetermined voice model having a plurality of detail levels, the plurality of detail levels being information indicating a feature property of a voice for the voice model; a detail level selection unit that selects a detail level, closest to a feature property of an input voice signal, from the detail levels of the voice model stored in the voice model storage unit; and a parameter setting unit that sets parameters for recognizing the voice of an input voice according to the detail level selected by the detail level selection unit. | 03-18-2010 |
20100268532 | SYSTEM, METHOD AND PROGRAM FOR VOICE DETECTION - A system for voice detection includes a feature value calculation unit that calculates a feature value from an input signal sliced on a per frame basis, a provisional voice/non-voice decision unit that provisionally decides a voiced interval and a non-voiced interval from the feature value calculated on a per frame basis, and a voice/non-voice decision unit that determines a voiced interval duration threshold value or a non-voiced interval duration threshold value, using a ratio of the feature value found on a per frame basis to a threshold value for the feature value and that re-decides the voiced interval and the non-voiced interval, using the voiced interval duration threshold value determined and the non-voiced interval duration threshold value determined. By determining the voiced interval duration threshold value and the non-voiced interval duration threshold value, using the feature value found on a per frame basis and the threshold value for the feature value, the constraint of the shaping rule may be made weaker, or stronger in case the feature value found on a per frame basis can be regarded as being reliable or not, thereby allowing voice detection to be made without dependency upon a noise environment. | 10-21-2010 |
20110071825 | DEVICE, METHOD AND PROGRAM FOR VOICE DETECTION AND RECORDING MEDIUM - To this end, a voice detection device includes a band-based power calculation unit that calculates a total of signal power values (sub-band power) of signals entered from the microphones from one preset frequency width (sub-band) to another. The voice detection device also includes a band-based noise estimation unit that estimates the sub-band based noise power, and a sub-band based SNR calculation unit. The sub-band based SNR calculation unit calculates a sub-band SNR from one sub-band to another to output the largest one of the sub-band SNRs as an SNR for a microphone of interest. The voice detection device further includes a voice/non-voice decision unit that determines the voice/non-voice using the SNR for the microphone of interest. | 03-24-2011 |
20110202339 | SPEECH SOUND DETECTION APPARATUS - A speech sound detection apparatus receives an input audio signal (as a sound reception unit), and computes input power that indicates a magnitude of the sound represented by the audio signal (as an input power computation unit). The apparatus estimates a correction function that is a continuous function defining a relation between a certain frequency and a correction coefficient used to approximate the input power computed at that frequency to the reference power predetermined for that frequency (as a correction function estimation unit). The apparatus corrects the input power at every frequency, based upon the correction coefficient that is obtained in accordance with the relation defined by the estimated correction function (as an input power correcting unit). The apparatus further determines whether or not the sound represented by the received audio signal is speech sound, based upon the corrected input power (as a speech sound detection unit). | 08-18-2011 |
20110225439 | SIGNAL CORRECTION APPARATUS - A signal correction apparatus receives an input audio signal (serving as a first sound reception means). The signal correction apparatus computes, at every frequency, first power that indicates magnitude of sound represented by the input audio signal (serving as a first power computation means). The signal correction apparatus estimates a correction function that is a continuous function defining a relation between each frequency and a correction coefficient used to approximate the first power computed at that frequency to the reference power predetermined for that frequency (serving as a correction function estimation means). The signal correction apparatus multiplies the computed first power by the correction coefficient acquired in accordance with the relation defined by the estimated correction function so as to correct the first power at every frequency (serving as a power correcting means). | 09-15-2011 |
20110246185 | VOICE ACTIVITY DETECTOR, VOICE ACTIVITY DETECTION PROGRAM, AND PARAMETER ADJUSTING METHOD - A frame extracting means | 10-06-2011 |
20120029915 | METHOD FOR PROCESSING MULTICHANNEL ACOUSTIC SIGNAL, SYSTEM THEREFOR, AND PROGRAM - A method for processing multichannel acoustic signals which processes input signals of a plurality of channels including the voices of a plurality of speaking persons. The method is characterized by detecting the voice section of each speaking person or each channel, detecting overlapped sections wherein the detected voice sections are common between channels, determining a channel to be subjected to crosstalk removal and the section thereof by use of at least voice sections not including the detected overlapped sections, and removing crosstalk in the sections of the channel to be subjected to the crosstalk removal. | 02-02-2012 |
20120029916 | METHOD FOR PROCESSING MULTICHANNEL ACOUSTIC SIGNAL, SYSTEM THEREFOR, AND PROGRAM - A method for processing multichannel acoustic signals which is characterized by calculating the feature quantity of each channel from the input signals of a plurality of channels, calculating similarity between the channels in the feature quantity of each channel, selecting channels having high similarity, and separating signals using the input signals of the selected channels. | 02-02-2012 |
20120046940 | METHOD FOR PROCESSING MULTICHANNEL ACOUSTIC SIGNAL, SYSTEM THEREOF, AND PROGRAM - A method for processing multichannel acoustic signals, whereby input signals of a plurality of channels including the voices of a plurality of speaking persons are processed. The method is characterized by comprising: calculating the first feature quantity of the input signals of the multichannels for each channel; calculating similarity of the first feature quantity of each channel between the channels; selecting channels having high similarity; separating signals using the input signals of the selected channels; inputting the input signals of the channels having low similarity and the signals after the signal separation; and detecting a voice section of each speaking person or each channel. | 02-23-2012 |