Follow us on Twitter
twitter icon@FreshPatents


Speech Recognition patents

      

This page is updated frequently with new Speech Recognition-related patent applications.

SALE: 220+ Speech Recognition-related patent PDFs



 Automatic speech recognition confidence classifier patent thumbnailAutomatic speech recognition confidence classifier
The described technology provides normalization of speech recognition confidence classifier (cc) scores that maintains the accuracy of acceptance metrics. A speech recognition cc scores quantitatively represents the correctness of decoded utterances in a defined range (e.g., [0,1]).
Microsoft Technology Licensing, Llc


 Automatic speech recognition with detection of at least one contextual element, and application management and maintenance of aircraft patent thumbnailAutomatic speech recognition with detection of at least one contextual element, and application management and maintenance of aircraft
An automatic speech recognition with detection of at least one contextual element, and application to aircraft flying and maintenance are provided. The automatic speech recognition device comprises a unit for acquiring an audio signal, a device for detecting the state of at least one contextual element, and a language decoder for determining an oral instruction corresponding to the audio signal.
Dassault Aviation


 Apparatus and  generating acoustic model, and  speech recognition patent thumbnailApparatus and generating acoustic model, and speech recognition
Described are an apparatus and method for generating to generate an acoustic model. The apparatus and method include a processor a processor configured to calculate a noise representation that represents noise data by using a noise model, and generate the acoustic model through training using training noisy speech data, which comprises speech data and the noise data, a string of phonemes corresponding to the speech data, and the noise representation..
Samsung Electronics Co., Ltd.


 Methods and  speech recognition using a garbage model patent thumbnailMethods and speech recognition using a garbage model
Methods and apparatus for performing speech recognition using a garbage model. The method comprises receiving audio comprising speech and processing at least some of the speech using a garbage model to produce a garbage speech recognition result.
Nuance Communication, Inc


 Microphone placement for sound source direction estimation patent thumbnailMicrophone placement for sound source direction estimation
Architectures of numbers of microphones and their positioning in a device for sound source direction estimation and source separation are presented. The directions of sources are front, back, left, right, top, and bottom of the device, and can be determined by amplitude and phase differences of microphone signals with proper microphone positioning.
Microsoft Technology Licensing, Llc


 Method and device for speech recognition patent thumbnailMethod and device for speech recognition
Embodiments of the present disclosure provide a method and device for speech recognition. The solution comprises: receiving a first speech signal issued by a user; performing analog to digital conversion on the first speech signal to generate a first digital signal after the analog to digital conversion; extracting a first speech parameter from the first digital signal, the first speech parameter describing a speech feature of the first speech signal; if the first speech parameter coincides with a first prestored speech parameter in a sample library, executing control signalling instructed by the first digital signal, the sample library prestoring prestored speech parameters of n users, n≧1.
Beijing Boe Multimedia Technology Co., Ltd.


 Speech recognition apparatus and method patent thumbnailSpeech recognition apparatus and method
An apparatus includes a language model group identifier configured to identify a language model group based on determined characteristic data of a user, and a language model generator configured to generate a user-based language model by interpolating a general language model for speech recognition based on the identified language model group.. .
Samsung Electronics Co., Ltd.


 Method and system for remotely training and commanding the speech recognition system on a cockpit via a carry-on-device in a connected aircraft patent thumbnailMethod and system for remotely training and commanding the speech recognition system on a cockpit via a carry-on-device in a connected aircraft
A method for implementing a speaker-independent speech recognition system with reduced latency is provided. The method includes capturing voice data at a carry-on-device from a user during a pre-flight check-in performed by the user for an upcoming flight; extracting features associated with the user from the captured voice data at the carry-on-device; uplinking the extracted features to the speaker-independent speech recognition system onboard the aircraft; and adapting the extracted features with an acoustic feature model of the speaker-independent speech recognition system..
Honeywell International Inc.


 Adapting a speech system to user pronunciation patent thumbnailAdapting a speech system to user pronunciation
A system and method of adapting a speech system includes the steps of: receiving confirmation of a phonetic transcription of one or more names, receiving confirmation of a selected stored text result, and storing the phonetic transcription with the selected stored text result using an automatic speech recognition (asr) system, a text-to-speech (tts) system, or both.. .
Gm Global Technology Operations Llc


 Enhanced speech endpointing patent thumbnailEnhanced speech endpointing
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.. .
Google Inc.


Enhanced speech endpointing

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.. .
Google Inc.

Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment

Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (asr) output from a media presentation and a transcription of the media presentation.
At&t Intellectual Property I, L.p.

Audio-visual speech recognition with scattering operators

Aspects described herein are directed towards methods, computing devices, systems, and computer-readable media that apply scattering operations to extracted visual features of audiovisual input to generate predictions regarding the speech status of a subject. Visual scattering coefficients generated according to one or more aspects described herein may be used as input to a neural network operative to generate the predictions regarding the speech status of the subject.
Nuance Communications, Inc.

Building of n-gram language model for automatic speech recognition (asr)

A method, a system, and a computer program product for building an n-gram language model for an automatic speech recognition. The method includes reading training text data and additional text data both for the n-gram language model from a storage, and building the n-gram language model by a smoothing algorithm having discount parameters for n-gram counts.
International Business Machines Corporation

Method and improving a neural network language model, and speech recognition method and apparatus

According to one embodiment, an apparatus for improving a neural network language model of a speech recognition system includes a word classifying unit, a language model training unit and a vector incorporating unit. The word classifying unit classifies words in a lexicon of the speech recognition system.
Kabushiki Kaisha Toshiba

Method and improving a language model, and speech recognition method and apparatus

According to one embodiment, an apparatus for improving a language model of a speech recognition system includes an extracting unit, a classifying unit, and a setting unit. The extracting unit extracts user words from a user document provided by a user.
Kabushiki Kaisha Toshiba

Topic shift detector

Aspects detect or recognize shifts in topics in computer implemented speech recognition processes as a function of mapping keywords to non-verbal cues. An initial topic is mapped to one or more keywords extracted from a first spoken query within a user keyword ontology mapping.
International Business Machines Corporation

Speech recognition apparatus and method

A speech recognition apparatus and method. The speech recognition apparatus includes a first recognizer configured to generate a first recognition result of an audio signal, in a first linguistic recognition unit, by using an acoustic model, a second recognizer configured to generate a second recognition result of the audio signal, in a second linguistic recognition unit, by using a language model, and a combiner configured to combine the first recognition result and the second recognition result to generate a final recognition result in the second linguistic recognition unit and to reflect the final recognition result in the language model.
Samsung Electronics Co., Ltd.

Speech recognition apparatus, vehicle having the speech recognition apparatus, and controlling the vehicle

Disclosed herein are speech recognition apparatuses, vehicles having the speech recognition apparatuses, and methods for controlling vehicles. According to an aspect, a speech recognition apparatus includes a speech input unit configured to receive a speech command from a user, a communication unit configured to receive the result of processing for speech recognition acquired by at least one user terminal located near the user, and a controller configured to compare the result of processing for speech recognition acquired from the speech command received by the speech input unit to the result of processing for speech recognition acquired by the at least one user terminal, thus processing the speech command according to the result of the comparison..
Hyundai Motor Company

Speech recognition system with abbreviated training

A method of adapting a speech recognition system to its user includes gathering information about a user of a speech recognition system, selecting at least a part of a speech model reflecting estimated speech attributes of the user based on the information about the user, running, in the speech recognition system, a speech model including the selected at least a part of a speech model, and training, in the speech recognition system, other parts of the speech model to reflect identified speech attributes of the user.. .
Toyota Motor Engineering & Manufacturing North America, Inc.

Order statistic techniques for neural networks

According to some aspects, a method of classifying speech recognition results is provided, using a neural network comprising a plurality of interconnected network units, each network unit having one or more weight values, the method comprising using at least one computer, performing acts of providing a first vector as input to a first network layer comprising one or more network units of the neural network, transforming, by a first network unit of the one or more network units, the input vector to produce a plurality of values, the transformation being based at least in part on a plurality of weight values of the first network unit, sorting the plurality of values to produce a sorted plurality of values, and providing the sorted plurality of values as input to a second network layer of the neural network.. .
Nuance Communications, Inc.

Adaptation of speech recognition

A method, computer program product, and system for adapting speech recognition of a user's speech is provided. The method includes receiving a first utterance from a user having a duration below a predetermined threshold, identifying at least one further utterance from the user that provides additional information, generating a concatenated utterance by concatenating the first utterance with the at least one further utterance, transmitting the concatenated utterance to a speech recognition server, receiving a transcription of the concatenated utterance from the speech recognition server that includes a transcription of the first utterance, and extracting the transcription of the first utterance from the transcription of the concatenated utterance.
International Business Machines Corporation

Computer-implemented performing distributed speech recognition

A computer-implemented system and method for performing distributed speech recognition is provided. Audio data is collected.
Intellisist, Inc.

Information processing apparatus, control method, and program

There is provided an information processing apparatus, control method, and program capable of notifying a user of a candidate for a response, from the middle of a speech, through a voice u1, the information processing apparatus including: a semantic analysis unit configured to perform semantic analysis on speech text recognized by a speech recognition unit in the middle of a speech; a score calculation unit configured to calculate a score for a response candidate on the basis of a result of the analysis performed by the semantic analysis unit; and a notification control unit configured to perform control to notify of the response candidate, in the middle of the speech, according to the score calculated by the score calculation unit.. .
Sony Corporation

Speech recognition using an operating system hooking component for context-aware recognition models

Inputs provided into user interface elements of an application are observed. Records are made of the inputs and the state(s) the application was in while the inputs were provided.
Mmodal Ip Llc

Data augmentation method based on stochastic feature mapping for automatic speech recognition

A method of augmenting training data includes converting a feature sequence of a source speaker determined from a plurality of utterances within a transcript to a feature sequence of a target speaker under the same transcript, training a speaker-dependent acoustic model for the target speaker for corresponding speaker-specific acoustic characteristics, estimating a mapping function between the feature sequence of the source speaker and the speaker-dependent acoustic model of the target speaker, and mapping each utterance from each speaker in a training set using the mapping function to multiple selected target speakers in the training set.. .
International Business Machines Corporation

Speech recognition support for remote applications and desktops

An application may be hosted for utilization by a remote computing platform. User interface (ui) elements of a ui generated by the hosted application may be identified.
Citrix Systems, Inc.

Frequency warping in a speech recognition system

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for receiving a sequence representing an utterance, the sequence comprising a plurality of audio frames; determining one or more warping factors for each audio frame in the sequence using a warping neural network; applying, for each audio frame, the one or more warping factors for the audio frame to the audio frame to generate a respective modified audio frame, wherein the applying comprises using at least one of the warping factors to scale a respective frequency of the audio frame to a new respective frequency in the respective modified audio frame; and decoding the modified audio frames using a decoding neural network, wherein the decoding neural network is configured to output a word sequence that is a transcription of the utterance.. .
Google Inc.

Computer-implemented efficient voice transcription

A computer-implemented system and method for efficient voice transcription is provided. A verbal message is processed by splitting the verbal message into segments and generating text for each of the segments via automated speech recognition.
Intellisist, Inc.

Insertion of characters in speech recognition

One embodiment provides a method, including: receiving, from an audio capture device, speech input; converting, using a processor, the speech input to machine text; receiving, from an alternate input source, an input comprising at least one character; identifying, using a processor, a location associated with the machine text to insert the at least one character; and inserting, using a processor, the at least one character at the location identified. Other aspects are described and claimed..
Lenovo (singapore) Pte. Ltd.

System and learning alternate pronunciations for speech recognition

A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary.
Interactive Intelligence Group, Inc.

Method and device for updating language model and performing speech recognition based on language model

A method of updating a grammar model used during speech recognition includes obtaining a corpus including at least one word, obtaining the at least one word from the corpus, splitting the at least one obtained word into at least one segment, generating a hint for recombining the at least one segment into the at least one word, and updating the grammar model by using at least one segment comprising the hint.. .
Samsung Electronics Co., Ltd.

Communication a smart phone with a text recognition module

A portable device can transmit information through one of a mobile phone network and an internet, wherein the portable device includes a text-based communication module to allow a user may synchronously transmit or receive data through a local area network, wherein the data is text, audio, video or the combination thereof. The text-based communication module of the portable device includes a text-to-speech recognition module used to convert a text data for outputting the text data by vocal, and a read determination module for determining read target terminals and unread target terminals when a user of the portable phone device activates the read determination module..

Business listing search

A method of operating a voice-enabled business directory search system includes receiving category-business pairs, each category-business pair including a business category and a specific business, and establishing a data structure having nodes based on the category-business pairs. Each node of the data structure is associated with one or more business categories and a speech recognition language model for recognizing specific businesses associated with the one or more businesses categories..
Google Inc.

Speech recognition method and mobile terminal

A speech recognition method and a mobile terminal relate to the field of electronic and information technologies, and can flexibly perform speech collection and improve a speech recognition rate. The method includes acquiring, by a mobile terminal, an orientation/motion status of the mobile terminal, and determining, according to the orientation/motion status, a voice collection apparatus for voice collection; acquiring, by the mobile terminal, a speech signal from the voice collection apparatus; and recognizing, by the mobile terminal, the speech signal.
Huawei Technologies Co., Ltd.

Apparatus and acoustic score calculation and speech recognition

An apparatus for calculating acoustic score, a method of calculating acoustic score, an apparatus for speech recognition, a method of speech recognition, and an electronic device including the same are provided. An apparatus for calculating acoustic score includes a preprocessor configured to sequentially extract audio frames into windows and a score calculator configured to calculate an acoustic score of a window by using a deep neural network (dnn)-based acoustic model..
Samsung Electronics Co., Ltd.

Unsupervised training method, training apparatus, and training program for an n-gram language model based upon recognition reliability

A computer-based, unsupervised training method for an n-gram language model includes reading, by a computer, recognition results obtained as a result of speech recognition of speech data; acquiring, by the computer, a reliability for each of the read recognition results; referring, by the computer, to the recognition result and the acquired reliability to select an n-gram entry; and training, by the computer, the n-gram language model about selected one of more of the n-gram entries using all recognition results.. .
International Business Machines Corporation

Speech recognition apparatus and method

A speech recognition apparatus includes a processor configured to recognize a user's speech using any one or combination of two or more of an acoustic model, a pronunciation dictionary including primitive words, and a language model including primitive words; and correct word spacing in a result of speech recognition based on a word-spacing model.. .
Samsung Electronics Co., Ltd.

System and natural language driven search and discovery in large data sources

In some natural language understanding (nlu) applications, results may not be tailored to the user's query. In an embodiment of the present invention, a method includes tagging elements of automated speech recognition (asr) data based on an ontology stored in a memory.
Nuance Communications, Inc.

Vehicle and control method thereof

A vehicle includes: an input unit configured to receive an execution command for speech recognition; a calculator configured to calculate a time in which the vehicle is expected to arrive at an obstacle existing on a road on which the vehicle travels; and a speech recognition controller configured to compare the calculated time in which the vehicle is expected to arrive at the obstacle to a time in which a voice command input is expected to be completed to determine whether to perform dynamic noise removal pre-processing.. .
Hyundai Motor Company

Real-time adaptation of in-vehicle speech recognition systems

A system and method of controlling an automatic speech recognition (asr) system includes: detecting changes in ambient noise via a microphone in a vehicle equipped with the asr system; determining an environmental noise compensation value and a channel bias compensation value based on the detected changes; and applying the environmental noise compensation value and a channel bias compensation value to speech received by the asr system.. .
Gm Global Technology Operations Llc

Interest notification apparatus and method

An apparatus for notification of speech of interest to a user includes a voice analyzer configured to recognize speech, evaluate a relevance between a result of the speech recognition and a determined user's topic of interest, and determine whether to provide a notification; and an outputter configured to, in response to the voice analyzer determining to provide the notification, generate and output a notification message.. .
Samsung Electronics Co., Ltd.

Speech recognition apparatus and method

A speech recognition apparatus includes a converter configured to convert a captured user speech signal into a standardized speech signal format, one or more processing devices configured to apply the standardized speech signal to an acoustic model, and recognize the user speech signal based on a result of application to the acoustic model.. .
Samsung Electronics Co., Ltd.

Layered contextual configuration management system and method and minimized input speech recognition user interface interactions experience

In an effort to customize or enhance software applications, configuration data is often used. Configuration settings that are editable by users need not to be limited to a simple flat entry that can be taken out of context anymore.

Multiple parallel dialogs in smart phone applications

An arrangement is described for conducting natural language dialogs with a user on a mobile device using automatic speech recognition (asr) and multiple different dialog applications. A user interface provides for user interaction with the dialogue applications in natural language dialogs.
Nuance Communications, Inc.

Using word confidence score, insertion and substitution thresholds for selected words in speech recognition

A method and system for improving the accuracy of a speech recognition system using were confidence score (wcs) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors.
Adacel, Inc.

Speech recognition system and method

A system and a method of speech recognition which enable a spoken language to be automatically identified while recognizing speech of a person who vocalize to effectively process multilingual speech recognition without a separate process for user registration or recognized language setting such as use of a button for allowing a user to manually select a language to be vocalized and support speech recognition of each language to be automatically performed even though persons who speak different languages vocalize by using one terminal to increase convenience of the user.. .
Electronics And Telecommunications Research Institute

Methods employing phase state analysis for use in speech synthesis and recognition

A computer-implemented method for automatically analyzing, predicting, and/or modifying acoustic units of prosodic human speech utterances for use in speech synthesis or speech recognition. Possible steps include: initiating analysis of acoustic wave data representing the human speech utterances, via the phase state of the acoustic wave data; using one or more phase state defined acoustic wave metrics as common elements for analyzing, and optionally modifying, pitch, amplitude, duration, and other measurable acoustic parameters of the acoustic wave data, at predetermined time intervals; analyzing acoustic wave data representing a selected acoustic unit to determine the phase state of the acoustic unit; and analyzing the acoustic wave data representing the selected acoustic unit to determine at least one acoustic parameter of the acoustic unit with reference to the determined phase state of the selected acoustic unit.
Lessac Technologies, Inc.

System and three-way call detection

A system for detecting three-way calls in a monitored telephone conversation includes a speech recognition processor that transcribes the monitored telephone conversation and associates characteristics of the monitored telephone conversation with a transcript thereof, a database to store the transcript and the characteristics associated therewith, and a three-way call detection processor to analyze the characteristics of the conversation and to detect therefrom the addition of one or more parties to the conversation. The system preferably includes at least one domain-specific language model that the speech recognition processor utilizes to transcribe the conversation.
Dsi-iti, Llc

Corrective feedback loop for automated speech recognition

A method for facilitating the updating of a language model includes receiving, at a client device, via a microphone, an audio message corresponding to speech of a user; communicating the audio message to a first remote server; receiving, that the client device, a result, transcribed at the first remote server using an automatic speech recognition system (“asr”), from the audio message; receiving, at the client device from the user, an affirmation of the result; storing, at the client device, the result in association with an identifier corresponding to the audio message; and communicating, to a second remote server, the stored result together with the identifier.. .
Amazon Technologies, Inc.

Method for controlling operation of an agricultural machine and system thereof

A method for controlling operation of an agricultural machine and system thereof are disclosed. The method may comprise providing a portable device that has an input device, a processing unit, a storage unit, an output device, and a transceiver device configured for wireless data transmission; receiving a voice control command over a microphone device of the input device of the portable device; determining command text data from the voice control command by processing the voice control command by a speech recognition application running on the processing unit of the portable device; providing machine control signals assigned to a machine control function in a control device of an agricultural machine located remotely from the portable device; and controlling the operation of the agricultural machine according to the machine control signals..
Kverneland Group Mechatronics B.v.

Speech recognition apparatus, speech recognition method, and electronic device

A speech recognition apparatus includes a probability calculator configured to calculate phoneme probabilities of an audio signal using an acoustic model; a candidate set extractor configured to extract a candidate set from a recognition target list; and a result returner configured to return a recognition result of the audio signal based on the calculated phoneme probabilities and the extracted candidate set.. .
Samsung Electronics Co., Ltd.

Testing words in a pronunciation lexicon

A method, for testing words defined in a pronunciation lexicon used in an automatic speech recognition (asr) system, is provided. The method includes: obtaining test sentences which can be accepted by a language model used in the asr system.
International Business Machines Corporation



Speech Recognition topics:
  • Speech Recognition
  • Communications
  • Computing Device
  • Heterogeneous
  • Conditional
  • Transcription
  • False Positive
  • Application Control
  • Natural Language
  • Embedded System
  • Electronic Device
  • Constraints
  • Central Processing Unit
  • Demultiplex
  • Interactive


  • Follow us on Twitter
    twitter icon@FreshPatents

    ###

    This listing is a sample listing of patent applications related to Speech Recognition for is only meant as a recent sample of applications filed, not a comprehensive history. There may be associated servicemarks and trademarks related to these patents. Please check with patent attorney if you need further assistance or plan to use for business purposes. This patent data is also published to the public by the USPTO and available for free on their website. Note that there may be alternative spellings for Speech Recognition with additional patents listed. Browse our RSS directory or Search for other possible listings.


    0.2174

    file did exist - 2859

    2 - 1 - 53