Images List Premium Download Classic

Speech Recognition

Speech Recognition-related patent applications - as published by the U.S. Patent and Trademark Office (USPTO).


loading
Systems and method for performing speech recognition
Honeywell International Inc.
October 12, 2017 - N°20170294187

A system and method for performing speech recognition. A speech recognition engine includes a plurality of grammar paths each defining a recognized phrase. The grammar paths each have at least two nodes that are connected by a recognized word. An input device receives a user specified input that corresponds to the recognized word. A microphone receives a user phrase and ...
Method for scoring in an automatic speech recognition system
Nuance Communications, Inc.
October 12, 2017 - N°20170294186

A system and method for speech recognition is provided. Embodiments may include receiving an audio signal at a first deep neural network (“dnn”) associated with a computing device. Embodiments may further include receiving the audio signal at a second deep neural network (“dnn”) associated with a computing device, wherein the second deep ...
Noise suppressing apparatus, speech recognition apparatus, and noise suppressing method
Fujitsu Limited
October 05, 2017 - N°20170287501

A noise suppressing apparatus calculates a phase difference on the basis of a first and second sound signal obtained by a microphone array; calculates a first sound arrival rate on the basis of a first phase difference area and the phase difference and a second sound arrival rate on the basis of a second phase difference area and the phase ...
Speech Recognition Patent Pack
Download 299+ patent application PDFs
Speech Recognition Patent Applications
Download 299+ Speech Recognition-related PDFs
For professional research & prior art discovery
inventor
  • 299+ full patent PDF documents of Speech Recognition-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Improving automatic speech recognition of multilingual named entities
Nuance Communications, Inc.
October 05, 2017 - N°20170287474

Methods and systems are provided for improving speech recognition of multilingual named entities. In some embodiments, a list comprising a plurality of named entities may be accessed by a computing device. A first named entity represented in the native language may be compared with the first named entity represented in the foreign language. One or more words that appear in ...
Speech recognition apparatus and speech recognition method
Mitsubishi Electric Corporation
October 05, 2017 - N°20170287472

An apparatus includes a lip image recognition unit 103 to recognize a user state from image data which is information other than speech; a non-speech section deciding unit 104 to decide from the recognized user state whether the user is talking; a speech section detection threshold learning unit 106 to set a first speech section detection threshold (ssdt) from speech data when decided ...
Acoustic model training
International Business Machines Corporation
October 05, 2017 - N°20170287469

A method, executed by a computer, includes receiving a channel recording corresponding to a conversation, receiving a transcription for the conversation, generating a conversation-specific language model for the conversation using the transcription, and conducting speech recognition on the channel recording using the conversation-specific language model to provide time boundaries and written language corresponding to utterances within the channel recording. The ...
Speech Recognition Patent Pack
Download 299+ patent application PDFs
Speech Recognition Patent Applications
Download 299+ Speech Recognition-related PDFs
For professional research & prior art discovery
inventor
  • 299+ full patent PDF documents of Speech Recognition-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Secure nonscheduled video visitation system
Global Tel *link Corporation
September 28, 2017 - N°20170280100

Described are methods and systems in which the censorship and supervision tasks normally performed by secured facility personnel are augmented or automated entirely by a secure nonscheduled video visitation system. In embodiments, the secure nonscheduled video visitation system performs voice biometrics, speech recognition, non-verbal audio classification, fingerprint and other biometric authentication, image object classification, facial recognition, body joint location determination ...
Characterizing, selecting and adapting audio and acoustic training data for automatic speech recognition systems
Nuance Communications, Inc.
September 28, 2017 - N°20170278527

A system for and method of characterizing a target application acoustic domain analyzes one or more speech data samples from the target application acoustic domain to determine one or more target acoustic characteristics, including a codec type and bit-rate associated with the speech data samples. The determined target acoustic characteristics may also include other aspects of the target speech data ...
Technologies for automatic speech recognition using articulatory parameters
Intel Corporation
September 28, 2017 - N°20170278517

Technologies for automatic speech recognition using articulatory parameters are disclosed. An automatic speech recognition device may capture speech data from a speaker and also capture an image of the speaker. The automatic speech recognition device may determine one or more articulatory parameters based on the image, such as such as a jaw angle, a lip protrusion, or a lip height, ...
Adaptive audio enhancement for multichannel speech recognition
Google Inc.
September 28, 2017 - N°20170278513

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for neural network adaptive beamforming for multichannel speech recognition are disclosed. In one aspect, a method includes the actions of receiving a first channel of audio data corresponding to an utterance and a second channel of audio data corresponding to the utterance. The actions further include generating ...
Server-side asr adaptation to speaker, device and noise condition via non-asr audio transmission
Nuance Communications, Inc.
September 28, 2017 - N°20170278511

A mobile device is adapted for automatic speech recognition (asr). A user interface for interaction with a user includes an input microphone for obtaining speech inputs from the user for automatic speech recognition, and an output interface for system output to the user based on asr results that correspond to the speech input. A local controller obtains a sample of ...
Testing words in a pronunciation lexicon
International Business Machines Corporation
September 28, 2017 - N°20170278509

A method, for testing words defined in a pronunciation lexicon used in an automatic speech recognition (asr) system, is provided. The method includes: obtaining test sentences which can be accepted by a language model used in the asr system. The test sentences cover words defined in the pronunciation lexicon. The method further includes obtaining variations of speech data corresponding to ...
Finding of a target document in a spoken language processing
International Business Machines Corporation
September 28, 2017 - N°20170278508

Methods and systems are provided for finding a target document in spoken language processing. One of the methods includes calculating a score of each document in a document set, in response to a receipt of first n words of output of an automatic speech recognition (asr) system, n being equal or greater than zero. The method further includes reading a ...
Speech Recognition Patent Pack
Download 299+ patent application PDFs
Speech Recognition Patent Applications
Download 299+ Speech Recognition-related PDFs
For professional research & prior art discovery
inventor
  • 299+ full patent PDF documents of Speech Recognition-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Digital video synthesis
Al Levy Technologies Ltd.
September 21, 2017 - N°20170270950

A method which includes: detecting phrases in a transcript of an audiovisual file; applying a speech recognition algorithm to the audiovisual file and to a list of words of the phrase, to output a temporal location of each of the words that are uttered in the audio channel; compiling a list of sub-phrases of each of the phrases; creating a ...
Speech recognition
Cirrus Logic International Semiconductor Ltd.
September 21, 2017 - N°20170270920

A speech recognition system comprises: an input, for receiving an input signal from at least one microphone; a first buffer, for storing the input signal; a noise reduction block, for receiving the input signal and generating a noise reduced input signal; a speech recognition engine, for receiving either the input signal output from the first buffer or the noise reduced ...
Anchored speech detection and speech recognition
Amazon Technologies, Inc.
September 21, 2017 - N°20170270919

A system configured to process speech commands may classify incoming audio as desired speech, undesired speech, or non-speech. Desired speech is speech that is from a same speaker as reference speech. The reference speech may be obtained from a configuration session or from a first portion of input speech that includes a wakeword. The reference speech may be encoded using ...
Negative n-gram biasing
Google Inc.
September 21, 2017 - N°20170270918

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing dynamic, stroke-based alignment of touch displays. In one aspect, a method includes obtaining a candidate transcription that an automated speech recognizer generates for an utterance, determining a particular context associated with the utterance, determining that a particular n-gram that is included in the candidate transcription ...
Pre-training apparatus and method for speech recognition
Electronics And Telecommunications Research Institute
September 21, 2017 - N°20170270910

A pre-training apparatus and method for recognition speech, which initialize, by layers, a deep neural network to correct a node connection weight. The pre-training apparatus for speech recognition includes an input unit configured to receive speech data, a model generation unit configured to initialize a connection weight of a deep neural network, based on the speech data, and an output ...
Root cause analysis and recovery systems and methods
Gm Global Technology Operations Llc
September 21, 2017 - N°20170270908

Methods and systems are provided for recovering from an error in a speech recognition system. In one embodiment, a method includes: receiving, by a processor, a first command recognized from a first speech utterance by a first language model; receiving, by the processor, a second command recognized from the first speech utterance by a second language model; determining, by the ...
Apparatus, method, and computer program product for correcting speech recognition error
Kabushiki Kaisha Toshiba
September 21, 2017 - N°20170270086

An apparatus for correcting a character string in a text of an embodiment includes a first converter, a first output unit, a second converter, an estimation unit, and a second output unit. The first converter recognizes a first speech of a first speaker, and converts the first speech to a first text. The first output unit outputs a first caption ...
Multi-pass speech activity detection strategy to improve automatic speech recognition
International Business Machines Corporation
September 14, 2017 - N°20170263269

An automatic speech recognition system and a method performed by an automatic speech recognition system are provided. The method includes performing at least two passes of speech activity detection on an acoustic utterance uttered by a speaker. The at least two passes include an initial pass and a subsequent pass. The method further includes estimating at least one of feature ...
Loading