Patent application number | Description | Published |
20080270131 | METHOD, PREPROCESSOR, SPEECH RECOGNITION SYSTEM, AND PROGRAM PRODUCT FOR EXTRACTING TARGET SPEECH BY REMOVING NOISE - The present invention relates to a method, preprocessor, speech recognition system, and program product for extracting a target speech by removing noise. In an embodiment of the invention target speech is extracted from two input speeches, which are obtained through at least two speech input devices installed in different places in a space, applies a spectrum subtraction process by using a noise power spectrum (Uω) estimated by one or both of the two speech input devices (Xω(T)) and an arbitrary subtraction constant (α) to obtain a resultant subtracted power spectrum (Yω(T)). The invention further applies a gain control based on the two speech input devices to the resultant subtracted power spectrum to obtain a gain-controlled power spectrum (Dω(T)). The invention further applies a flooring process to said resultant gain-controlled power spectrum on the basis of arbitrary Flooring factor (β) to obtain a power spectrum for speech recognition (Zω(T)). | 10-30-2008 |
20090043570 | METHOD FOR PROCESSING SPEECH SIGNAL DATA - Method for processing speech signal data. A speech signal is divided into frames. Each frame is characterized by a frame number T representing a unique interval of time. Each speech signal is characterized by a power spectrum with respect to frame T and frequency band ω. A speech segment and a reverberation segment of the speech signal is determined. L filter coefficients W(k) (k=1, 2, . . . , L) respectively corresponding to L frames immediately preceding frame T are computed such that the L filter coefficients minimize a function Φ that is a linear combination of sum of squares of a residual speech power in the reverberation segment and a sum of squares of a subtracted speech power in the speech segment. The computed L filter coefficients are stored within storage media of the computing apparatus. | 02-12-2009 |
20090210224 | SYSTEM, METHOD AND PROGRAM FOR SPEECH PROCESSING - The present invention relates to a system, method and program for speech recognition. In an embodiment of the invention a method for processing a speech signal consists of receiving a power spectrum of a speech signal and generating a log power spectrum signal of the power spectrum. The method further consists of performing discrete cosine transformation on the log power spectrum signal and cutting off cepstrum upper and lower terms of the discrete cosine transformed signal. The method further consists of performing inverse discrete cosine transformation on the signal from which the cepstrum upper and lower terms are cut off. The method further consists of converting the inverse discrete cosine transformed signal so as to bring the signal back to a power spectrum domain and filtering the power spectrum of the speech signal by using, as a filter, the signal which is brought back to the power spectrum domain. | 08-20-2009 |
20090222258 | VOICE ACTIVITY DETECTION SYSTEM, METHOD, AND PROGRAM PRODUCT - A voice activity detection method in a low SNR environment. The voice activity detection is performed by extracting a long-term spectrum variation component and a harmonic structure as feature vectors from a speech signal and increasing difference in feature vectors between speech and non-speech (i) using the long-term spectrum variation component feature or (ii) using a long-term spectrum variation component extraction and a harmonic structure feature extraction. A correct rate and an accuracy rate of the voice activity detection is improved over conventional methods by using a long-term spectrum variation component having a window length over an average phoneme duration of an utterance in the speech signal. The voice activity detection system and method provides speech processing, automatic speech recognition, and speech output capable of very accurate voice activity detection. | 09-03-2009 |
20100033711 | METHOD OF MEASURING PHYSICAL QUANTITY OF OBJECT TO BE MEASURED, AND METHOD OF CONTROLLING THE SAME - This invention relates to optical sensing technology to measure and control a physical quantity of an object that exists on or within a microstructure object, utilizing Brillouin scattering decreases. The measurement method prepares an optical waveguide one-, two- or three-dimensionally, on or within a micro-chemical chip, IC chip, or other element, and measures a physical quantity of the object on the basis of a property variation of light attributed to Brillouin scattering occurring in the optical waveguide. | 02-11-2010 |
20100040108 | METHOD FOR MEASURING TEMPERATURE DISTRIBUTION OF OBJECT, AND SENSOR UNIT - The present invention relates to a measuring method and a sensor unit of measuring temperature distribution of an object by using an optical fiber sensing technology of BOCDA system. In the measuring method, an optical fiber that functions as a BOCDA-type optical fiber sensor is disposed two-dimensionally or three-dimensionally with respect to a predetermined measurement region of the object, and thereby the temperature distribution of the object can be measured at a high speed and a high accuracy, in the predetermined measurement region configuring a surface or space where the optical fiber is disposed. | 02-18-2010 |
20110131044 | TARGET VOICE EXTRACTION METHOD, APPARATUS AND PROGRAM PRODUCT - An apparatus, program product and method is provided for separating a target voice from a plurality of other voices having different directions of arrival. The method comprises the steps of disposing a first and a second voice input device at a predetermined distance from one another and upon receipt of voice signals at said devices calculating discrete Fourier transforms for the signals and calculating a CSP (cross-power spectrum phase) coefficient by superpositioning multiple frequency-bin components based on correlation of the two spectra signals received and then calculating a weighted CSP coefficient from said two discrete Fourier-transformed speech signals. A target voice is separated when received by said devices from other voice signals in a spectrum by using the calculated weighted CSP coefficient. | 06-02-2011 |
Patent application number | Description | Published |
20080306742 | APPARATUS, METHOD, AND PROGRAM FOR SUPPORTING SPEECH INTERFACE DESIGN - For design of a speech interface accepting speech control options, speech samples are stored on a computer-readable medium. A similarity calculating unit calculates a certain indication of similarity of first and second sets of ones of the speech samples, the first set of speech samples being associated with a first speech control option and the second set of speech samples being associated with a second speech control option. A display unit displays the similarity indication. | 12-11-2008 |
20090076815 | Speech Recognition Apparatus, Speech Recognition Apparatus and Program Thereof - Provided is a method for canceling background noise of a sound source other than a target direction sound source in order to realize highly accurate speech recognition, and a system using the same. In terms of directional characteristics of a microphone array, due to a capability of approximating a power distribution of each angle of each of possible various sound source directions by use of a sum of coefficient multiples of a base form angle power distribution of a target sound source measured beforehand by base form angle by using a base form sound, and power distribution of a non-directional background sound by base form, only a component of the target sound source direction is extracted at a noise suppression part. In addition, when the target sound source direction is unknown, at a sound source localization part, a distribution for minimizing the approximate residual is selected from base form angle power distributions of various sound source directions to assume a target sound source direction. Further, maximum likelihood estimation is executed by using voice data of the component of the sound source direction passed through these processes, and a voice model obtained by predetermined modeling of the voice data, and speech recognition is carried out based on an obtained assumption value. | 03-19-2009 |
20100008516 | METHOD AND SYSTEM FOR POSITION DETECTION OF A SOUND SOURCE - A position detection method, system, and computer readable article of manufacture tangibly embodying computer readable instructions for executing the method for detecting the position of a sound source using at least two microphones. The method includes the steps of: emitting a reproduced sound from the sound source; observing the reproduced sound and an observed sound at the microphones; converting the reproduced sound and the observed sound into electrical signals; transforming the signals of the reproduced sound and of the observed sound into frequency spectra by a frequency spectrum transformer apparatus; calculating Crosspower Spectrum Phase (CSP) coefficients of the frequency spectra of the signals by a CSP coefficient calculator apparatus; and calculating distances between the position of the sound source and the positions of the microphones based on the calculated CSP coefficients by a distance calculating apparatus, thereby detecting the position of the sound source. | 01-14-2010 |
20110301945 | SPEECH SIGNAL PROCESSING SYSTEM, SPEECH SIGNAL PROCESSING METHOD AND SPEECH SIGNAL PROCESSING PROGRAM PRODUCT FOR OUTPUTTING SPEECH FEATURE - A speech signal processing system which outputs a speech feature, divides an input speech signal into frames so that each pair of consecutive frames have a frame shift length equal to at least one period of the speech signal and have an overlap equal to at least a predetermined length, applies discrete Fourier transform to each of the frames, calculates a CSP coefficient for the pair, searches a predetermined search range in which a speech wave lags a period equal to at least one period to obtain the maximum value of the CSP coefficient for the pair, and generates time-series data of the maximum CSP coefficient values arranged in the order in which the frames appear. A method and a computer readable article of manufacture for the implementing the same are also provided. | 12-08-2011 |
20120185243 | SPEECH FEATURE EXTRACTION APPARATUS, SPEECH FEATURE EXTRACTION METHOD, AND SPEECH FEATURE EXTRACTION PROGRAM - A speech feature extraction apparatus, speech feature extraction method, and speech feature extraction program. A speech feature extraction apparatus includes: first difference calculation module to: (i) receive, as an input, a spectrum of a speech signal segmented into frames for each frequency bin; and (ii) calculate a delta spectrum for each of the frame, where the delta spectrum is a difference of the spectrum within continuous frames for the frequency bin; and first normalization module to normalize the delta spectrum of the frame for the frequency bin by dividing the delta spectrum by a function of an average spectrum; where the average spectrum is an average of spectra through all frames that are overall speech for the frequency bin; and where an output of the first normalization module is defined as a first delta feature. | 07-19-2012 |
20120330657 | SPEECH FEATURE EXTRACTION APPARATUS, SPEECH FEATURE EXTRACTION METHOD, AND SPEECH FEATURE EXTRACTION PROGRAM - A speech feature extraction apparatus, speech feature extraction method, and speech feature extraction program. A speech feature extraction apparatus includes: first difference calculation module to: (i) receive, as an input, a spectrum of a speech signal segmented into frames for each frequency bin; and (ii) calculate a delta spectrum for each of the frame, where the delta spectrum is a difference of the spectrum within continuous frames for the frequency bin; and first normalization module to normalize the delta spectrum of the frame for the frequency bin by dividing the delta spectrum by a function of an average spectrum; where the average spectrum is an average of spectra through all frames that are overall speech for the frequency bin; and where an output of the first normalization module is defined as a first delta feature. | 12-27-2012 |
20140337026 | METHOD, APPARATUS, AND PROGRAM FOR GENERATING TRAINING SPEECH DATA FOR TARGET DOMAIN - A method and system for generating training data for a target domain using speech data of a source domain. The training data generation method including: reading out a Gaussian mixture model (GMM) of a target domain trained with a clean speech data set of the target domain; mapping, by referring to the GMM of the target domain, a set of source domain speech data received as an input to the set of target domain speech data on a basis of a channel characteristic of the target domain speech data; and adding a noise of the target domain to the mapped set of source domain speech data to output a set of pseudo target domain speech data. | 11-13-2014 |
Patent application number | Description | Published |
20090098349 | DECORATED SHEET AND MOLDED ARTICLE INTEGRATED WITH THE SAME - There is provided a decorated sheet having a good adhesiveness with a molding resin regardless of a pattern of a colored part. A decorated sheet comprising: a transparent layer coating on the entire surface of a thermoplastic substrate sheet or at least in the areas of said substrate sheet having no coloring layer formed, said transparent layer coated by a transparent ink with as common varnish composition as the ink for coloring layer formation, and an adhesive layer formed as the outermost layer of said decorated sheet on the side provided with the colored layers of the substrate sheet. A molded article is fabricated by integrating the decorated sheet with a molding resin. | 04-16-2009 |
20130185067 | NOISE REDUCTION METHOD. PROGRAM PRODUCT AND APPARATUS - A probability model represented as the product of the probability distribution of a mismatch vector g (or clean speech x) with an observed value y as a factor and the probability distribution of a mismatch vector g (or clean speech x) with a confidence index β for each band as a factor, executes MMSE estimation on the probability model, and estimates a clean speech estimated value x̂. As a result, each band influences the result of MMSE estimation, with a degree of contribution in accordance with the level of its confidence. Further, the higher the S/N ratio of observation speech, the more the output value becomes shifted to the observed value. As a result, the output of a front-end is optimized. | 07-18-2013 |
20130238324 | LOCAL PEAK WEIGHTED-MINIMUM MEAN SQUARE ERROR (LPW-MMSE) ESTIMATION FOR ROBUST SPEECH - A system and method for noise reduction applied to a speech recognition front-end. An output of a front-end is optimized by giving, as a weight to the output for each band, a confidence index representing the remarkableness of the harmonic structure of observation speech. In a first method, when clean speech is estimated by executing MMSE estimation on a model that gives a probability distribution of noise-removed speech generated from observation speech, the posterior probability of the MMSE estimation is weighted using the confidence index as a weight. In a second method, linear interpolation is executed, for each band, between an observed value of observation speech and an estimated value of clean speech, with the confidence index serving as a weight. The first method and the second method can be combined. | 09-12-2013 |
Patent application number | Description | Published |
20080294430 | Noise reduction device, program and method - A noise reduction device is configured by use of: means for calculating a predetermined constant, and a predetermined reference signal Rω(T) in the frequency domain, respectively by use of adaptive coefficients Wω(m), and for thereby obtaining estimated values Nω and Qω(T) respectively of stationary noise components, and non-stationary noise components corresponding to the reference signal, which are included in a predetermined observed signal Xω(T) in the frequency domain; means and for applying a noise reduction process to the observed signal on the basis of each of the estimated values, and for updating each of the adaptive coefficients on the basis of a result of the process; and an adaptive learning means and for repeating the obtaining of the estimated values and the updating of the adaptive coefficients, and for thereby learning each of the adaptive coefficients. | 11-27-2008 |
20080306734 | Signal Noise Reduction - Provision to reduce production of musical noise. A noise reduction device includes: means for calculating a rank for each element included in a first region having predetermined sizes in the time axis direction and in the frequency axis direction, depending on a value of the element, in a noise section of an observed signal indicating variation of a frequency spectrum with time; means for calculating a rank for each element included in a second region, depending on a value of the element, the second region having predetermined sizes in the time axis direction and in the frequency axis direction in the observed signal; and means for subtracting, from the values of the respective elements in the second region, values based on the values of the respective elements in the first region whose ranks correspond to ranks of respective elements in the second region. | 12-11-2008 |
Patent application number | Description | Published |
20120025268 | SEMICONDUCTOR WAFER, METHOD OF PRODUCING SEMICONDUCTOR WAFER, AND ELECTRONIC DEVICE - There is provided a compound semiconductor wafer that is suitably used as a semiconductor wafer to form a plurality of different types of devices such as an HBT and an FET thereon. The semiconductor wafer includes a first semiconductor, a carrier-trapping layer that is formed on the first semiconductor and has an electron-trapping center or a hole-trapping center, a second semiconductor that is epitaxially grown on the carrier-trapping layer and serves as a channel in which a free electron or a free hole moves, and a third semiconductor including a stack represented by n-type semiconductor/p-type semiconductor/n-type semiconductor or represented by p-type semiconductor/n-type semiconductor/p-type semiconductor, where the stack epitaxially grown on the second semiconductor. | 02-02-2012 |
20120104462 | SEMICONDUCTOR WAFER, METHOD OF PRODUCING SEMICONDUCTOR WAFER, AND ELECTRONIC DEVICE - A semiconductor wafer includes a first semiconductor, and a second semiconductor formed directly or indirectly on the first semiconductor. The second semiconductor contains a first impurity atom exhibiting p-type or n-type conductivity, and a second impurity atom selected such that the Fermi level of the second semiconductor containing both the first and second impurity atoms is closer to the Fermi level of the second semiconductor containing neither the first impurity atom nor the second impurity atom, than the Fermi level of the second semiconductor containing the first impurity atom is. For example, the majority carrier of the second semiconductor is an electron, and the Fermi level of the second semiconductor containing the first and second impurity atoms is lower than the Fermi level of the second semiconductor containing the first impurity atom. | 05-03-2012 |
20140091433 | METHOD OF PRODUCING SEMICONDUCTOR WAFER, AND SEMICONDUCTOR WAFER - There is provided a method of producing a semiconductor wafer, including: forming a compound semiconductor layer on a base wafer by epitaxial growth; cleansing a surface of the compound semiconductor layer by means of a cleansing agent containing a selenium compound; and forming an insulating layer on the surface of the compound semiconductor layer. Examples of the selenium compound include a selenium oxide. Examples of the selenium oxide include H | 04-03-2014 |