Images List Premium Download Classic

Speech Recognition

Speech Recognition-related patent applications - as published by the U.S. Patent and Trademark Office (USPTO).


loading
Scripting support for data identifiers, voice recognition and speech in a telnet session
Crimson Corporation
February 15, 2018 - N°20180048699

Methods of adding data identifiers and speech/voice recognition functionality are disclosed. A telnet client runs one or more scripts that add data identifiers to data fields in a telnet session. The input data is inserted in the corresponding fields based on data identifiers. Scripts run only on the telnet client without modifications to the server applications. Further disclosed are ...
Training deep neural network for acoustic modeling in speech recognition
International Business Machines Corporation
February 15, 2018 - N°20180047413

A method is provided for training a deep neural network (dnn) for acoustic modeling in speech recognition. The method includes reading central frames and side frames as input frames from a memory. The side frames are preceding side frames preceding the central frames and/or succeeding side frames succeeding the central frames. The method further includes executing pre-training for only ...
Voice print identification portal
February 15, 2018 - N°20180047397

Systems and methods providing for a dual use voice analysis system are disclosed herein. Speech recognition is achieved by comparing characteristics of words spoken by a speaker to one or more templates of human language words. Speaker identification is achieved by comparing characteristics of a speaker's speech to one or more templates, or voice prints. The system is adapted to ...
Speech Recognition Patent Pack
Download 299+ patent application PDFs
Speech Recognition Patent Applications
Download 299+ Speech Recognition-related PDFs
For professional research & prior art discovery
inventor
  • 299+ full patent PDF documents of Speech Recognition-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Hybrid phoneme, diphone, morpheme, and word-level deep neural networks
Apptek, Inc.
February 15, 2018 - N°20180047385

An approach of hybrid frame, phone, diphone, morpheme, and word-level deep neural networks (dnn) in model training and applications is described. The approach can be applied to many applications. The approach is based on a regular asr system, which can be based on gaussian mixture models (gmm) or dnn. In the first step, a regular asr model is trained. All ...
Data processing method and live broadcasting method and device
Alibaba Group Holding Limited
February 08, 2018 - N°20180041783

Data processing methods, live broadcasting methods and devices are disclosed. An example data processing method may comprise converting audio and video data into broadcast data in a predetermined format, and performing speech recognition on audio data in the audio and video data, and adding the text information obtained from speech recognition into the broadcast data. In real time, text information ...
Method and apparatus for identifying acoustic background environments based on time and speed to enhance ...
Nuance Communications, Inc.
February 08, 2018 - N°20180040318

Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information based on a previously recorded time and speed of the caller, classifying a background environment of the caller based on the analyzed acoustic features and the ...
Speech Recognition Patent Pack
Download 299+ patent application PDFs
Speech Recognition Patent Applications
Download 299+ Speech Recognition-related PDFs
For professional research & prior art discovery
inventor
  • 299+ full patent PDF documents of Speech Recognition-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Knowledge sharing based on meeting information
Audible, Inc.
February 08, 2018 - N°20180039634

Features are disclosed for automatically facilitating knowledge sharing using information collected during meetings. Collected information may include both the content and context of a meeting. The meeting content may comprise text collected by automatic speech recognition or entered manually at a user device. The meeting context may comprise information such as the identities of meeting participants and subject matter from ...
System and method for speech-enabled access to media content by a ranked normalized weighted graph ...
Nuance Communications, Inc.
February 08, 2018 - N°20180039481

Disclosed herein are systems, methods, and computer-readable storage media for generating a speech recognition model for a media content retrieval system. The method causes a computing device to retrieve information describing media available in a media content retrieval system, construct a graph that models how the media are interconnected based on the retrieved information, rank the information describing the media ...
Distinguishing user speech from background speech in speech-dense environments
Vocollect, Inc.
February 01, 2018 - N°20180033454

A device, system, and method whereby a speech-driven system can distinguish speech obtained from users of the system from other speech spoken by background persons, as well as from background speech from public address systems. In one aspect, the present system and method prepares, in advance of field-use, a voice-data file which is created in a training environment. The training ...
Speech recognition method, speech wakeup apparatus, speech recognition apparatus, and terminal
Huawei Technologies Co., Ltd.
February 01, 2018 - N°20180033436

Embodiments of the present invention provide a speech recognition method and a terminal. The method includes: listening, by a speech wakeup apparatus, to speech information in a surrounding environment; when determining that the speech information obtained by listening matches a speech wakeup model, buffering, by the speech wakeup apparatus, speech information, of first preset duration, obtained by listening, and sending ...
Information processing system and information processing method
Sony Corporation
February 01, 2018 - N°20180033430

[object] it is desirable to provide a technology capable of flexibly starting speech recognition processing in accordance with a situation. [solution] provided is an information processing system including: an output controller that causes an output portion to output a start condition for speech recognition processing to be performed by a speech recognition portion on sound information input from a sound ...
Speech recognition transformation system
Samsung Electronics Co., Ltd.
February 01, 2018 - N°20180033427

A speech recognition method may include preprocessing a first signal to generate a second signal, where the first signal corresponds to an audio signal that includes at least one voice audio signal generated by a speaker, extracting a feature point associated with the second signal and converting the second signal into a third signal by converting the feature point using ...
Acoustic model training using corrected terms
Google Inc.
February 01, 2018 - N°20180033426

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; ...
Speech Recognition Patent Pack
Download 299+ patent application PDFs
Speech Recognition Patent Applications
Download 299+ Speech Recognition-related PDFs
For professional research & prior art discovery
inventor
  • 299+ full patent PDF documents of Speech Recognition-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Circuit and method for speech recognition
Dolphin Integration
January 25, 2018 - N°20180025730

The invention concerns a circuit for speech recognition comprising: a voice detection circuit configured to detect, based on at least one input parameter, the presence of a voice signal in an input audio signal and to generate an activation signal on each voice detection event; a speech recognition circuit configured to be activated by the activation signal and to perform ...
Audio-visual speech recognition with scattering operators
Nuance Communications, Inc.
January 25, 2018 - N°20180025729

Aspects described herein are directed towards methods, computing devices, systems, and computer-readable media that apply scattering operations to extracted visual features of audiovisual input to generate predictions regarding the speech status of a subject. Visual scattering coefficients generated according to one or more aspects described herein may be used as input to a neural network operative to generate the predictions ...
System and method for enhancing speech recognition accuracy using weighted grammars based on user profile ...
Nuance Communications, Inc.
January 25, 2018 - N°20180025722

Disclosed herein are systems, computer-implemented methods, and computer-readable media for enhancing speech recognition accuracy. The method includes dividing a system dialog turn into segments based on timing of probable user responses, generating a weighted grammar for each segment, exclusively activating the weighted grammar generated for a current segment of the dialog turn during the current segment of the dialog turn, ...
Automatic speech recognition using multi-dimensional models
Google Inc.
January 25, 2018 - N°20180025721

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for automatic speech recognition using multi-dimensional models. In some implementations, audio data that describes an utterance is received. A transcription for the utterance is determined using an acoustic model that includes a neural network having first memory blocks for time information and second memory blocks for frequency ...
Optimizations to decoding of wfst models for automatic speech recognition
Intel Corporation
January 25, 2018 - N°20180025720

A method in a computing device for decoding a weighted finite state transducer (wfst) for automatic speech recognition is described. The method includes sorting a set of one or more wfst arcs based on their arc weight in ascending order. The method further includes iterating through each arc in the sorted set of arcs according to the ascending order until ...
Led light bulb, lamp fixture with self-networking intercom, system and method therefore
Athena Patent Development Llc.
January 18, 2018 - N°20180020530

A networked light for illumination and intercom for communications in a single housing, with voice command and control, hands-free. The system in a housing configured to conventional looking lamp, bulb, fixture, lighting devices, suitable for a direct replacement of conventional illuminating devices typical found in homes or buildings. A network of such voice command and control systems may be further ...
Call forwarding to unavailable party based on artificial intelligence
Circle River, Inc.
January 18, 2018 - N°20180018969

A called party indicates that he or she is unavailable to receive a call. However, by way of a combination or any one of determining aspects of the who the caller is, where the caller is located, what he is speaking about, or the like as well as comparing this to prior calls, the call might be sent to a ...
Techniques to provide a standard interface to a speech recognition platform
Microsoft Technology Licensing, Llc
January 18, 2018 - N°20180018968

Techniques and systems to provide speech recognition services over a network using a standard interface are described. In an embodiment, a technique includes accepting a speech recognition request that includes at least audio input, via an application program interface (api). The speech recognition request may also include additional parameters. The technique further includes performing speech recognition on the audio according ...
Loading