Follow us on Twitter
twitter icon@FreshPatents


Speech Recognition patents

      

This page is updated frequently with new Speech Recognition-related patent applications.

SALE: 220+ Speech Recognition-related patent PDFs



 Apparatuses and methods for enhanced speech recognition in variable environments patent thumbnailnew patent Apparatuses and methods for enhanced speech recognition in variable environments
Systems, apparatuses, and methods are described to increase a signal-to-noise ratio difference between a main channel and reference channel. The increased signal-to-noise ratio difference is accomplished with an adaptive threshold for a desired voice activity detector (dvad) and shaping filters.
Kopin Corporation


 Speech recognition circuit using parallel processors patent thumbnailnew patent Speech recognition circuit using parallel processors
A speech recognition circuit comprises an input buffer for receiving processed speech parameters. A lexical memory contains lexical data for word recognition.
Zentian Limited


 Mixed speech recognition patent thumbnailnew patent Mixed speech recognition
The claimed subject matter includes a system and method for recognizing mixed speech from a source. The method includes training a first neural network to recognize the speech signal spoken by the speaker with a higher level of a speech characteristic from a mixed speech sample.
Microsoft Technology Licensing, Llc


 Apparatus and  normalizing input data of acoustic model and speech recognition apparatus patent thumbnailnew patent Apparatus and normalizing input data of acoustic model and speech recognition apparatus
An apparatus for normalizing input data of an acoustic model includes a window extractor configured to extract windows of frame data to be input to an acoustic model from frame data of a speech to be recognized, and a normalizer configured to normalize the frame data to be input to the acoustic model in units of the extracted windows.. .
Samsung Electronics Co., Ltd.


 Speech interaction apparatus and method patent thumbnailSpeech interaction apparatus and method
According to one embodiment, a speech interaction apparatus for performing an interaction with a user based on a scenario includes a speech recognition unit, a determination unit, a selection unit and an execution unit. The speech recognition unit recognizes a speech of the user and generates a recognition result text.
Kabushiki Kaisha Toshiba


 Information processing system, and vehicle-mounted device patent thumbnailInformation processing system, and vehicle-mounted device
This invention can enhance the convenience of a user. An information processing system 1 includes: a vehicle-mounted device 3 which has a sound pickup unit 36 that picks up a speech sound, and a transmitting unit that transmits speech data that is generated based on the speech sound that is picked up to a control server 8; and the control server 8 which has a server storage unit 82 that stores a pictogram correspondence table 82a in which recognition keywords and pictogram ids indicating a plurality of pictograms that correspond to the recognition keywords are associated, and a server control unit 81 which executes pictogram processing that selects a recognition keyword that corresponds to text representing a speech sound that is generated by speech recognition based on speech data from among the recognition keywords included in the pictogram correspondence table 82a, and in accordance with a predetermined condition, selects a single pictogram id from among a plurality of pictogram ids that are associated with the selected recognition keyword..
Clarion Co., Ltd.


 Flexible schema for language model customization patent thumbnailFlexible schema for language model customization
The customization of language modeling components for speech recognition is provided. A list of language modeling components may be made available by a computing device.
Microsoft Technology Licensing, Llc


 Dynamically adding or removing functionality to speech recognition systems patent thumbnailDynamically adding or removing functionality to speech recognition systems
A system and method of changing features of an existing automatic speech recognition (asr) system includes: monitoring speech received from a vehicle occupant for one or more keywords identifying a feature to remove from or add to the asr system; detecting the keywords in the monitored speech; and adding the identified feature to or removing the identified feature from from the asr system.. .
Gm Global Technology Operations Llc


 Techniques to provide a standard interface to a speech recognition platform patent thumbnailTechniques to provide a standard interface to a speech recognition platform
Techniques and systems to provide speech recognition services over a network using a standard interface are described. In an embodiment, a technique includes accepting a speech recognition request that includes at least audio input, via an application program interface (api).
Microsoft Technology Licensing, Llc


 Speech recognition apparatus and method with acoustic modelling patent thumbnailSpeech recognition apparatus and method with acoustic modelling
Provided is a speech recognition apparatus. The apparatus includes a preprocessor configured to extract select frames from all frames of a first speech of a user, and a score calculator configured to calculate an acoustic score of a second speech, made up of the extracted select frames, by using a deep neural network (dnn)-based acoustic model, and to calculate an acoustic score of frames, of the first speech, other than the select frames based on the calculated acoustic score of the second speech..
Samsung Electronics Co., Ltd.


Voice language communication device and system

A voice language communication device and system that includes: a speaker; a microphone; a display panel; a control panel; a power button; a record button; software stored on a hard drive; a language database, where software accesses the language database during operation; a plurality of languages stored on the language database; speech recognition functions related to the software, where the speech recognition functions recognizes a user's language as an input language; and an output language, where the output language is a translation of the input language and the output language is instantaneously emitted to the speaker.. .

Streamlined navigational speech recognition

A system and method of performing automatic speech recognition (asr) includes: receiving speech at a vehicle microphone; communicating the received speech to an asr system; measuring an amount of time that elapses while speech is received; selecting a point-of-interest (poi) context or an address context based on the measured amount of received time; and processing the received speech using a poi context-based grammar when a poi context is selected or an address-based grammar when an address context is selected.. .
Gm Global Technology Operations Llc

Speech recognition system and gain setting system

When an instruction to start voice input is received from the user, a gain controller acquires, from a gain table which defines a correspondence between vehicle speed ranges and gains, a gain corresponding to a vehicle speed range including the vehicle speed of a vehicle detected by a vehicle speed detector, and sets the acquired gain as the gain of an input amplifier that amplifies an input audio signal output by a microphone. As a gain corresponding to each vehicle speed range, the gain table records a gain of the input amplifier corresponding, in an experimentally determined frequency distribution of peak values in the vehicle speed range, to a maximum frequency in the range of magnitude of voice output as an input audio signal by the microphone and to be input to a speech recognition engine as voice having a magnitude within the input range of the speech recognition engine..
Alpine Electronics, Inc.

Incremental utterance decoder combination for efficient and accurate decoding

An incremental speech recognition system. The incremental speech recognition system incrementally decodes a spoken utterance using an additional utterance decoder only when the additional utterance decoder is likely to add significant benefit to the combined result.
Microsoft Technology Licensing, Llc

System and determining recipient of spoken command in a control system

Disclosed is an apparatus and method for determining which controllable device an audible command is directed towards, the method comprising: receiving at each of two or more controlling devices the audible command signal, the audible command being directed to control at least one of two or more controllable devices controlled by a respective one of the two or more controlling devices; digitizing each of the received audible command signals; attaching a unique identifier to each digitized audible command so as to uniquely correlate it to a respective controlling device; determining a magnitude of each of the digitized audible command; determining a digitized audible command with the greatest magnitude, and further determining to which controlling device the audible command is directed to on the basis of the unique identifier associated with the digitized audible command with the greatest magnitude; performing speech recognition on the digitized audible command with the greatest magnitude; and forwarding a command to the controlling device corresponding to the digitized audible command with the greatest magnitude, the command corresponding to the audible command that can be implemented on the controllable device controlled by the controlling device.. .
Crestron Electronics, Inc.

Semiconductor device, system, electronic device, and speech recognition method

A semiconductor device is provided with a data storage unit configured to store speech reproduction data that includes transition destination information or speech recognition option data that includes transition destination information, and a processor configured to perform processing for generating an output speech signal using speech reproduction data read out from the data storage unit or perform speech recognition processing on an input speech signal using speech recognition option data read out from the data storage unit, and to read out, based on the transition destination information included in speech reproduction data or speech recognition option data used in the processing, speech recognition option data or speech reproduction data to be used in the next processing from the data storage unit.. .
Seiko Epson Corporation

Methods for speech enhancement and speech recognition using neural networks

The present invention relates to implementing a system and method to improve speech recognition and speech enhancement of noisy speech. The present invention discloses a way to improve the noise robustness of a speech recognition system by providing additional input to a neural network speech classifier.

Dynamic adaptation of language models and semantic tracking for automatic speech recognition

Generally, this disclosure provides systems, devices, methods and computer readable media for adaptation of language models and semantic tracking to improve automatic speech recognition (asr). A system for recognizing phrases of speech from a conversation may include an asr circuit configured to transcribe a user's speech to a first estimated text sequence, based on a generalized language model.
Intel Corporation

Multichannel raw-waveform neural networks

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using neural networks. One of the methods includes receiving, by a neural network in a speech recognition system, first data representing a first raw audio signal and second data representing a second raw audio signal, the first raw audio signal and the second raw audio signal for the same period of time, generating, by a spatial filtering convolutional layer in the neural network, a spatial filtered output the first data and the second data, generating, by a spectral filtering convolutional layer in the neural network, a spectral filtered output using the spatial filtered output, and processing, by one or more additional layers in the neural network, the spectral filtered output to predict sub-word units encoded in both the first raw audio signal and the second raw audio signal..
Google Inc.

Hotword detection on multiple devices

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance.
Google Inc.

Apparatus and speech recognition, and training transformation parameter

Provided are a method and an apparatus for speech recognition, and a method and an apparatus for training transformation parameter. A speech recognition apparatus includes an acoustic score calculator configured to use an acoustic model to calculate an acoustic score of a speech input, an acoustic score transformer configured to transform the calculated acoustic score into an acoustic score corresponding to standard pronunciation by using a transformation parameter, and a decoder configured to decode the transformed acoustic score to output a recognition result of the speech input..
Samsung Electronics Co., Ltd.

Automatic speech recognition confidence classifier

The described technology provides normalization of speech recognition confidence classifier (cc) scores that maintains the accuracy of acceptance metrics. A speech recognition cc scores quantitatively represents the correctness of decoded utterances in a defined range (e.g., [0,1]).
Microsoft Technology Licensing, Llc

Automatic speech recognition with detection of at least one contextual element, and application management and maintenance of aircraft

An automatic speech recognition with detection of at least one contextual element, and application to aircraft flying and maintenance are provided. The automatic speech recognition device comprises a unit for acquiring an audio signal, a device for detecting the state of at least one contextual element, and a language decoder for determining an oral instruction corresponding to the audio signal.
Dassault Aviation

Apparatus and generating acoustic model, and speech recognition

Described are an apparatus and method for generating to generate an acoustic model. The apparatus and method include a processor a processor configured to calculate a noise representation that represents noise data by using a noise model, and generate the acoustic model through training using training noisy speech data, which comprises speech data and the noise data, a string of phonemes corresponding to the speech data, and the noise representation..
Samsung Electronics Co., Ltd.

Methods and speech recognition using a garbage model

Methods and apparatus for performing speech recognition using a garbage model. The method comprises receiving audio comprising speech and processing at least some of the speech using a garbage model to produce a garbage speech recognition result.
Nuance Communication, Inc

Microphone placement for sound source direction estimation

Architectures of numbers of microphones and their positioning in a device for sound source direction estimation and source separation are presented. The directions of sources are front, back, left, right, top, and bottom of the device, and can be determined by amplitude and phase differences of microphone signals with proper microphone positioning.
Microsoft Technology Licensing, Llc

Method and device for speech recognition

Embodiments of the present disclosure provide a method and device for speech recognition. The solution comprises: receiving a first speech signal issued by a user; performing analog to digital conversion on the first speech signal to generate a first digital signal after the analog to digital conversion; extracting a first speech parameter from the first digital signal, the first speech parameter describing a speech feature of the first speech signal; if the first speech parameter coincides with a first prestored speech parameter in a sample library, executing control signalling instructed by the first digital signal, the sample library prestoring prestored speech parameters of n users, n≧1.
Beijing Boe Multimedia Technology Co., Ltd.

Speech recognition apparatus and method

An apparatus includes a language model group identifier configured to identify a language model group based on determined characteristic data of a user, and a language model generator configured to generate a user-based language model by interpolating a general language model for speech recognition based on the identified language model group.. .
Samsung Electronics Co., Ltd.

Method and system for remotely training and commanding the speech recognition system on a cockpit via a carry-on-device in a connected aircraft

A method for implementing a speaker-independent speech recognition system with reduced latency is provided. The method includes capturing voice data at a carry-on-device from a user during a pre-flight check-in performed by the user for an upcoming flight; extracting features associated with the user from the captured voice data at the carry-on-device; uplinking the extracted features to the speaker-independent speech recognition system onboard the aircraft; and adapting the extracted features with an acoustic feature model of the speaker-independent speech recognition system..
Honeywell International Inc.

Adapting a speech system to user pronunciation

A system and method of adapting a speech system includes the steps of: receiving confirmation of a phonetic transcription of one or more names, receiving confirmation of a selected stored text result, and storing the phonetic transcription with the selected stored text result using an automatic speech recognition (asr) system, a text-to-speech (tts) system, or both.. .
Gm Global Technology Operations Llc

Enhanced speech endpointing

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.. .
Google Inc.

Enhanced speech endpointing

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.. .
Google Inc.

Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment

Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (asr) output from a media presentation and a transcription of the media presentation.
At&t Intellectual Property I, L.p.

Audio-visual speech recognition with scattering operators

Aspects described herein are directed towards methods, computing devices, systems, and computer-readable media that apply scattering operations to extracted visual features of audiovisual input to generate predictions regarding the speech status of a subject. Visual scattering coefficients generated according to one or more aspects described herein may be used as input to a neural network operative to generate the predictions regarding the speech status of the subject.
Nuance Communications, Inc.

Building of n-gram language model for automatic speech recognition (asr)

A method, a system, and a computer program product for building an n-gram language model for an automatic speech recognition. The method includes reading training text data and additional text data both for the n-gram language model from a storage, and building the n-gram language model by a smoothing algorithm having discount parameters for n-gram counts.
International Business Machines Corporation

Method and improving a neural network language model, and speech recognition method and apparatus

According to one embodiment, an apparatus for improving a neural network language model of a speech recognition system includes a word classifying unit, a language model training unit and a vector incorporating unit. The word classifying unit classifies words in a lexicon of the speech recognition system.
Kabushiki Kaisha Toshiba

Method and improving a language model, and speech recognition method and apparatus

According to one embodiment, an apparatus for improving a language model of a speech recognition system includes an extracting unit, a classifying unit, and a setting unit. The extracting unit extracts user words from a user document provided by a user.
Kabushiki Kaisha Toshiba

Topic shift detector

Aspects detect or recognize shifts in topics in computer implemented speech recognition processes as a function of mapping keywords to non-verbal cues. An initial topic is mapped to one or more keywords extracted from a first spoken query within a user keyword ontology mapping.
International Business Machines Corporation

Speech recognition apparatus and method

A speech recognition apparatus and method. The speech recognition apparatus includes a first recognizer configured to generate a first recognition result of an audio signal, in a first linguistic recognition unit, by using an acoustic model, a second recognizer configured to generate a second recognition result of the audio signal, in a second linguistic recognition unit, by using a language model, and a combiner configured to combine the first recognition result and the second recognition result to generate a final recognition result in the second linguistic recognition unit and to reflect the final recognition result in the language model.
Samsung Electronics Co., Ltd.

Speech recognition apparatus, vehicle having the speech recognition apparatus, and controlling the vehicle

Disclosed herein are speech recognition apparatuses, vehicles having the speech recognition apparatuses, and methods for controlling vehicles. According to an aspect, a speech recognition apparatus includes a speech input unit configured to receive a speech command from a user, a communication unit configured to receive the result of processing for speech recognition acquired by at least one user terminal located near the user, and a controller configured to compare the result of processing for speech recognition acquired from the speech command received by the speech input unit to the result of processing for speech recognition acquired by the at least one user terminal, thus processing the speech command according to the result of the comparison..
Hyundai Motor Company

Speech recognition system with abbreviated training

A method of adapting a speech recognition system to its user includes gathering information about a user of a speech recognition system, selecting at least a part of a speech model reflecting estimated speech attributes of the user based on the information about the user, running, in the speech recognition system, a speech model including the selected at least a part of a speech model, and training, in the speech recognition system, other parts of the speech model to reflect identified speech attributes of the user.. .
Toyota Motor Engineering & Manufacturing North America, Inc.

Order statistic techniques for neural networks

According to some aspects, a method of classifying speech recognition results is provided, using a neural network comprising a plurality of interconnected network units, each network unit having one or more weight values, the method comprising using at least one computer, performing acts of providing a first vector as input to a first network layer comprising one or more network units of the neural network, transforming, by a first network unit of the one or more network units, the input vector to produce a plurality of values, the transformation being based at least in part on a plurality of weight values of the first network unit, sorting the plurality of values to produce a sorted plurality of values, and providing the sorted plurality of values as input to a second network layer of the neural network.. .
Nuance Communications, Inc.

Adaptation of speech recognition

A method, computer program product, and system for adapting speech recognition of a user's speech is provided. The method includes receiving a first utterance from a user having a duration below a predetermined threshold, identifying at least one further utterance from the user that provides additional information, generating a concatenated utterance by concatenating the first utterance with the at least one further utterance, transmitting the concatenated utterance to a speech recognition server, receiving a transcription of the concatenated utterance from the speech recognition server that includes a transcription of the first utterance, and extracting the transcription of the first utterance from the transcription of the concatenated utterance.
International Business Machines Corporation

Computer-implemented performing distributed speech recognition

A computer-implemented system and method for performing distributed speech recognition is provided. Audio data is collected.
Intellisist, Inc.

Information processing apparatus, control method, and program

There is provided an information processing apparatus, control method, and program capable of notifying a user of a candidate for a response, from the middle of a speech, through a voice u1, the information processing apparatus including: a semantic analysis unit configured to perform semantic analysis on speech text recognized by a speech recognition unit in the middle of a speech; a score calculation unit configured to calculate a score for a response candidate on the basis of a result of the analysis performed by the semantic analysis unit; and a notification control unit configured to perform control to notify of the response candidate, in the middle of the speech, according to the score calculated by the score calculation unit.. .
Sony Corporation

Speech recognition using an operating system hooking component for context-aware recognition models

Inputs provided into user interface elements of an application are observed. Records are made of the inputs and the state(s) the application was in while the inputs were provided.
Mmodal Ip Llc

Data augmentation method based on stochastic feature mapping for automatic speech recognition

A method of augmenting training data includes converting a feature sequence of a source speaker determined from a plurality of utterances within a transcript to a feature sequence of a target speaker under the same transcript, training a speaker-dependent acoustic model for the target speaker for corresponding speaker-specific acoustic characteristics, estimating a mapping function between the feature sequence of the source speaker and the speaker-dependent acoustic model of the target speaker, and mapping each utterance from each speaker in a training set using the mapping function to multiple selected target speakers in the training set.. .
International Business Machines Corporation

Speech recognition support for remote applications and desktops

An application may be hosted for utilization by a remote computing platform. User interface (ui) elements of a ui generated by the hosted application may be identified.
Citrix Systems, Inc.

Frequency warping in a speech recognition system

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for receiving a sequence representing an utterance, the sequence comprising a plurality of audio frames; determining one or more warping factors for each audio frame in the sequence using a warping neural network; applying, for each audio frame, the one or more warping factors for the audio frame to the audio frame to generate a respective modified audio frame, wherein the applying comprises using at least one of the warping factors to scale a respective frequency of the audio frame to a new respective frequency in the respective modified audio frame; and decoding the modified audio frames using a decoding neural network, wherein the decoding neural network is configured to output a word sequence that is a transcription of the utterance.. .
Google Inc.

Computer-implemented efficient voice transcription

A computer-implemented system and method for efficient voice transcription is provided. A verbal message is processed by splitting the verbal message into segments and generating text for each of the segments via automated speech recognition.
Intellisist, Inc.

Insertion of characters in speech recognition

One embodiment provides a method, including: receiving, from an audio capture device, speech input; converting, using a processor, the speech input to machine text; receiving, from an alternate input source, an input comprising at least one character; identifying, using a processor, a location associated with the machine text to insert the at least one character; and inserting, using a processor, the at least one character at the location identified. Other aspects are described and claimed..
Lenovo (singapore) Pte. Ltd.

System and learning alternate pronunciations for speech recognition

A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary.
Interactive Intelligence Group, Inc.

Method and device for updating language model and performing speech recognition based on language model

A method of updating a grammar model used during speech recognition includes obtaining a corpus including at least one word, obtaining the at least one word from the corpus, splitting the at least one obtained word into at least one segment, generating a hint for recombining the at least one segment into the at least one word, and updating the grammar model by using at least one segment comprising the hint.. .
Samsung Electronics Co., Ltd.



Speech Recognition topics:
  • Speech Recognition
  • Communications
  • Computing Device
  • Heterogeneous
  • Conditional
  • Transcription
  • False Positive
  • Application Control
  • Natural Language
  • Embedded System
  • Electronic Device
  • Constraints
  • Central Processing Unit
  • Demultiplex
  • Interactive


  • Follow us on Twitter
    twitter icon@FreshPatents

    ###

    This listing is a sample listing of patent applications related to Speech Recognition for is only meant as a recent sample of applications filed, not a comprehensive history. There may be associated servicemarks and trademarks related to these patents. Please check with patent attorney if you need further assistance or plan to use for business purposes. This patent data is also published to the public by the USPTO and available for free on their website. Note that there may be alternative spellings for Speech Recognition with additional patents listed. Browse our RSS directory or Search for other possible listings.


    0.4726

    file did exist - 2859

    2 - 1 - 53