Images List Premium Download Classic

Corpus

Corpus-related patent applications - as published by the U.S. Patent and Trademark Office (USPTO).


loading
Method for speaker recognition and apparatus for speaker recognition
Fujitsu Limited
October 12, 2017 - N°20170294191

The present invention discloses a method for speaker recognition and an apparatus for speaker recognition. The method for speaker recognition comprises: extracting, from a speaker-to-be-recognized corpus, voice characteristics of a speaker to be recognized: obtaining a speaker-to-be-recognized model based on the extracted voice characteristics of the speaker to be recognized, a universal background model ubm reflecting distribution of the voice ...
Evaluating text classifier parameters based on semantic features
Abbyy Infopoisk Llc
October 12, 2017 - N°20170293687

Systems and methods for evaluating text classifier parameters based on semantic features. An example method comprises: performing a semantico-syntactic analysis of a natural language text of a corpus of natural language texts to produce a semantic structure representing a set of semantic classes; identifying a natural language text feature to be extracted using a set of values of a plurality ...
Aggregating results from named entity recognition services
Sap Se
October 12, 2017 - N°20170293682

An aggregation service aggregates extraction results from diverse named entity recognition (“ner”) services, which can help improve the quality of extracted information. In some cases, the aggregation service considers differences in entity type classifications when aggregating extraction results from different ner services. The aggregation service can also consider performance characteristics (e. G., error rates) for the ...
Corpus Patent Pack
Download 184+ patent application PDFs
Corpus Patent Applications
Download 184+ Corpus-related PDFs
For professional research & prior art discovery
inventor
  • 184+ full patent PDF documents of Corpus-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Systems and methods to develop training set of data based on resume corpus
Facebook, Inc.
October 05, 2017 - N°20170286914

Systems, methods, and non-transitory computer readable media are configured to acquire a resume corpus. The resume corpus is processed to generate resume tokens. A machine learning model is trained based on the resume tokens. The machine learning model is applied to recommend a job classification based on evaluation data.
Method of automated discovery of new topics
Qbase, Llc
October 05, 2017 - N°20170286837

The present disclosure relates to a method for performing automated discovery of new topics from unlimited documents related to any subject domain, employing a multi-component extension of latent dirichlet allocation (mc-lda) topic models, to discover related topics in a corpus. The resulting data may contain millions of term vectors from any subject domain identifying the most distinguished co-occurring topics that ...
Analyzing concepts over time
International Business Machines Corporation
October 05, 2017 - N°20170286833

A method and apparatus are provided for automatically generating and processing first and second concept vector sets extracted, respectively, from a first set of concept sequences and from a second, temporally separated, concept sequences by performing a natural language processing (nlp) analysis of the first concept vector set and second concept vector set to detect changes in the corpus over ...
Corpus Patent Pack
Download 184+ patent application PDFs
Corpus Patent Applications
Download 184+ Corpus-related PDFs
For professional research & prior art discovery
inventor
  • 184+ full patent PDF documents of Corpus-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Analyzing concepts over time
International Business Machines Corporation
October 05, 2017 - N°20170286831

A method and apparatus are provided for automatically generating and processing first and second concept vector sets extracted, respectively, from a first set of concept sequences and from a second, temporally separated, concept sequences by performing a natural language processing (nlp) analysis of the first concept vector set and second concept vector set to detect changes in the corpus over ...
Real time video summarization
Intel Corporation
October 05, 2017 - N°20170286777

System, apparatus, method, and computer readable media for on-the-fly captured video summarization. A video stream is incrementally summarized in concurrence with generation of the stream by a camera module. Saliency of the video stream summary is maintained as the stream evolves by updating the summary to include only the most significant frames. In one exemplary embodiment, saliency is determined by ...
Automatic generation of an executive summary for a medical event in an electronic medical record
Microsoft Technology Licensing, Llc
October 05, 2017 - N°20170286601

Described herein are technologies pertaining to automatic generation of an executive summary (explanation) of a medical event in an electronic medical record (emr) of a patient. A medical event in the emr is automatically identified, and a search is conducted over a document corpus based upon the identified medical event. A document retrieved as a result of the search is ...
Scalable mining of trending insights from text
Linkedin Corporation
October 05, 2017 - N°20170286531

A system and method for identifying trending topics in a document corpus are provided. First, multiple topics are identified, some of which topics may be filtered or removed based on co-occurrence. Then, for each remaining topic, a frequency of the topic in the document corpus is determined, one or more frequencies of the topic in one or more other document ...
System, method, and recording medium for natural language learning
International Business Machines Corporation
October 05, 2017 - N°20170286403

A natural language learning method, system, and non-transitory computer readable medium include analyzing a corpus of sentences stored in a database to identify an internal structure of words in the corpus of sentences, creating a plurality of new words that are a combination of the internal structure of a word of the words in the corpus of sentences and the ...
System, method, and recording medium for regular rule learning
International Business Machines Corporation
October 05, 2017 - N°20170286400

A regular rule learning method, system, and non-transitory computer readable medium, include an analyzing circuit configured to analyze a corpus of sentences stored in a database to discover lexical features and conjunctively create a regular set of rules based on the discovered lexical features and syntactical features.
System, method, and recording medium for corpus pattern paraphrasing
International Business Machines Corporation
October 05, 2017 - N°20170286399

A corpus pattern paraphrasing method, system, and non-transitory computer readable medium, include an analyzing circuit configured to analyze a corpus of sentences stored in a database to determine regular structures including a plurality of substitute words for verbs expressed as patterns and apply deep learning of the regular structures over the patterns, a representative word determining circuit configured to determine ...
Corpus Patent Pack
Download 184+ patent application PDFs
Corpus Patent Applications
Download 184+ Corpus-related PDFs
For professional research & prior art discovery
inventor
  • 184+ full patent PDF documents of Corpus-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Language modeling based on spoken and unspeakable corpuses
Microsoft Technology Licensing, Llc
September 21, 2017 - N°20170270912

A computer system for language modeling may collect training data from one or more information sources, generate a spoken corpus containing text of transcribed speech, and generate a typed corpus containing typed text. The computer system may derive feature vectors from the spoken corpus, analyze the typed corpus to determine feature vectors representing items of typed text, and generate an ...
Using paraphrase metrics for answering questions
International Business Machines Corporation
September 21, 2017 - N°20170270191

A mechanism is provided in a data processing system for using paraphrase metrics for answering questions. The mechanism receives an input question and generating a candidate answer from a corpus of information. The candidate answer has a supporting passage from the corpus of information. The mechanism divides the input question into a first sequence of tokens and divides the supporting ...
Dialog corpus collecting apparatus, method and program
Kabushiki Kaisha Toshiba
September 21, 2017 - N°20170270094

According to one embodiment, a dialog corpus collecting apparatus includes an assigning unit, an extractor, and a controller. The assigning unit assigns a worker to a second role in accordance with the execution state with respect to a task in which a worker is assigned to a first role. The extractor extracts current candidate responses in reply to a choice ...
Textual information extraction, parsing, and inferential analysis
Inferlink Corporation
September 14, 2017 - N°20170262430

Textual information extraction, parsing, and inferential analysis systems and methods are provided herein. An example method includes extracting content for each of a plurality of types from a corpus of textual information, the plurality of types corresponding to segments of an inference scheme, the inference scheme including a dependency that orders the segments together so as to create a summation ...
Cosmetic element and a method for making such a cosmetic element
Chromavis S.p.a.
September 14, 2017 - N°20170258200

A cosmetic element (3), in particular a make-up product, formed by a first cosmetic product (4) and at least a second cosmetic product (5) different from the first one; the first cosmetic product (4) is a matrix inside of which a plurality of macroscopic corpuscular units (6) are sunk, formed by said at least one second cosmetic product (5).
Domain-specific negative media search techniques
Giant Oak, Inc.
September 07, 2017 - N°20170255700

In some implementations, systems and methods that are capable of customizing negative media searches using domain-specific search indexes are described. Data indicating a search query associated with a negative media search for an entity and a corpus of documents to be searched are obtained. Content from a particular collection of documents from among the corpus of documents is obtained and ...
Location specific content visualizations
Google Inc.
September 07, 2017 - N°20170255594

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for evaluating digital content. In one aspect, a system includes a distributed computing system that accesses the third-party corpus database to evaluate the various third-party content and transmit, to a user device, digital data that produce visualizations of at least a portion of a set of the ...
Realtime data stream cluster summarization and labeling system
Uda, Llc
September 07, 2017 - N°20170255536

A method is provided for automatically discovering topics in electronic posts, such as social media posts. The method includes receiving a corpus that includes a plurality of electronic posts. The method further includes identifying a plurality of candidate terms within the corpus and selecting, as a trimmed lexicon, a subset of the plurality of candidate terms using predefined criteria. The ...
Loading