Images List Premium Download Classic

Web Crawler

Web Crawler-related patent applications - as published by the U.S. Patent and Trademark Office (USPTO).


loading
Systems and methods for discovery and tracking of web-based advertisements
Pathmatics, Inc.
December 28, 2017 - N°20170372357

Systems and methods are provided for discovering advertisements on publisher web pages and for identifying placement pathways by which discovered advertisements have been placed on the publisher web pages. An advertisement tracking and discovery system may use multiple web crawler applications to explore multiple publisher websites. The web crawler applications may gather advertisement data that includes times associated with each ...
Geospatial web crawler architecture
National Central University
November 23, 2017 - N°20170337205

Architecture for searching geospatial resources is provided. Geospatial web crawlers are used. The architecture comprises a database, a plurality of computers (workers) and a server (master). The master is connected with the database and the workers. By using the concept of web crawler and parallel processing, geospatial resources shared on the internet can be automatically and quickly found in a ...
Systems and methods for generating and maintaining internet user profile data
Pathmatics, Inc.
August 17, 2017 - N°20170236157

Systems and methods are provided for automatically generating and maintaining user profile cookie sets. The user profile cookie sets may be used by a web crawler when gathering data such as advertisement data associated with one or more websites. The cookie sets may be generated by choosing a user profile with a set of user traits, selecting a set of ...
Web Crawler Patent Pack
Download + patent application PDFs
Web Crawler Patent Applications
Download + Web Crawler-related PDFs
For professional research & prior art discovery
inventor
  • + full patent PDF documents of Web Crawler-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Detection of coordinated cyber-attacks
F-secure Corporation
June 22, 2017 - N°20170180402

A method of detecting coordinated attacks on computer and computer networks via the internet. The method includes using a web crawler to crawl the world wide web to identify domains and subdomains and their associated ip addresses, and to identify links between domains and subdomains, and storing the results in a database. When an ip address is identified as malicious ...
Developing an item data model for an item
Wal-mart Stores, Inc.
June 22, 2017 - N°20170177725

The present invention extends to methods, systems, and computer program products for developing an item data model for an item. Aspects of the invention can automate the process of data collection of “facts” for “items” that information is needed about. Facts can be organized and normalized to eliminate redundant facts, and interpret ...
Detecting disclosed content sources using dynamic steganography
Box, Inc.
May 11, 2017 - N°20170134344

Systems for forensic steganography. A server is interfaced with storage facilities that store an object accessible by two or more users, each of which users are associated with respective profiles comprising one or more user-specific attributes. A method detects a user request to view the object. User-specific attributes are encoded into a steganographic message, which is formatted for saving into ...
Web Crawler Patent Pack
Download + patent application PDFs
Web Crawler Patent Applications
Download + Web Crawler-related PDFs
For professional research & prior art discovery
inventor
  • + full patent PDF documents of Web Crawler-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Providing cloud-based health-related data analytics services
Fujitsu Limited
April 20, 2017 - N°20170109443

A method to provide health-related data analytics services via a web service may include crawling, via a web crawler, the internet to identify multiple websites with content related to human health. The method may also include obtaining, using text classification, multiple words associated with an occurrence in lives of people and multiple words associated with a health outcome in the ...
Interactive web crawler
Microsoft Technology Licensing, Llc
March 02, 2017 - N°20170061029

The claimed subject matter provides a system or method for web crawling hidden files. An exemplary method comprises loading a web page with a browser agent, and executing any dynamic elements hosted on the web page using the browser agent to insert pre-determined values. A list of form controls may be retrieved from the web page using the browser agent, ...
Identifying search friendly web pages
Bloomreach, Inc.
February 16, 2017 - N°20170046763

A system for evaluating web pages for searchable content can be utilized to make an e-commerce search engine more effective by identifying pages with searchable value. In embodiments, a web page exhibiting “searchable value” is a page that provides useful information responsive to a user's query on an e-commerce search engine. One embodiment of a page ...
Avoiding masked web page content indexing errors for search engines
Bloomreach, Inc.
November 24, 2016 - N°20160342703

Multiple non-host client sites provide cached user copies of web pages and/or web content, or summaries thereof, to a server. Obtaining data from non-host sources for indexing purposes avoids masked web page content indexing errors for search engines. The server aggregates, summarizes and indexes the web pages and/or web content in an index of cached content, in conjunction ...
Method and system for scheduling web crawlers according to keyword search
Beijing Jingdong Century Trading Co., Ltd.
November 10, 2016 - N°20160328475

A method and a system for scheduling web crawlers according to keyword search. The method comprises: a scheduling end receiving a task request command sent by a crawling node; the scheduling end acquiring a secondary download link address from a priority bucket, generating tasks, adding the generated tasks into a task list, acquiring keyword link addresses from a dynamic bucket, ...
Anchor tag indexing in a web crawler system
Google Inc.
November 03, 2016 - N°20160321252

Provided is a method and system for indexing documents in a collection of linked documents. A link log, including one or more pairings of source documents and target documents is accessed. A sorted anchor map, containing one or more target document to source document pairings, is generated. The pairings in the sorted anchor map are ordered based on target document ...
Crawling of m2m devices
Convida Wireless, Llc
September 22, 2016 - N°20160275190

In accordance with various example embodiments, an m2m crawler service may support capabilities to enable m2m devices to be efficiently and effectively crawled by web crawlers. As a result, m2m devices may be indexed and searched by web search engines, and thus by web users making use of web search engines. Thus, the described-herein m2m crawler ...
Web Crawler Patent Pack
Download + patent application PDFs
Web Crawler Patent Applications
Download + Web Crawler-related PDFs
For professional research & prior art discovery
inventor
  • + full patent PDF documents of Web Crawler-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Method and apparatus to throttle media access by web crawlers
The Nielsen Company (us), Llc
May 05, 2016 - N°20160127262

Methods, apparatus, systems and articles of manufacture are disclosed to throttle resource access by web crawlers. An example method disclosed herein includes obtaining, at a server, a media request message for media hosted by the server, the media request message requesting access to the media, characterizing a media-requesting source associated with the media request message, and inserting a time delay ...
System and method for a cyber intelligence hub
Comsec Consulting Ltd.
April 28, 2016 - N°20160119365

A method for defining and forming a cyber intelligence channel communicating with consumers is facing cyber threats in real time. The method includes collecting information, such that web crawlers and scrapers. The method also includes filtering the collected information, by filtering mechanisms founded on advanced algorithms. The method goes on to categorize the information into groups based on their unique ...
Community authoring content generation and navigation
Microsoft Technology Licensing, Llc
April 28, 2016 - N°20160117321

One or more techniques and/or systems are provided for creating socially authored, or community authored, summaries of documents and/or for navigating a forum comprising such summaries. In one embodiment, at least some of the summaries are generated automatically when a document is written and/or discovered (e. G., by a web crawler), for example. In another embodiment, the ...
Interactive web crawler
Microsoft Technology Licensing, Llc
April 21, 2016 - N°20160110456

The claimed subject matter provides a system or method for web crawling hidden files. An exemplary method comprises loading a web page with a browser agent, and executing any dynamic elements hosted on the web page using the browser agent to insert pre-determined values. A list of form controls may be retrieved from the web page using the browser agent, ...
System and method to identify machine-readable codes
Ebay Inc.
March 24, 2016 - N°20160085874

A method and a system to identify machine-readable codes using a web crawler are provided. Machine-readable codes include, but are not limited to, universal product codes (upc), quick response (qr) codes, stock-keeping units (skus) and international standard book number (isbn) codes. A web crawler downloads pages from the world wide web. A determination module accesses the downloaded pages and identifies ...
Web crawler for acquiring content
Ut Battelle, Llc
February 25, 2016 - N°20160055243

An adaptive web crawling system generates a first utility measurement based on web page snippets associated with individual search result items by crawling from a collection of web page crawling seeds and according to a specific user web crawling criteria. The system generates a second utility measurement based on features extracted from the full webpages downloaded according to the guidance ...
Release date notification system
Ut Battelle, Llc
November 12, 2015 - N°20150324853

An application software for smartphones and tablets works with a cooperating website. The app will compile a calendar and notify the user of the dates of release or availability of selected events, media and/or products, based on a targeted item database compiled by the user and stored locally on the smart device as well as remotely accessible through the ...
Web crawler scheduler that utilizes sitemaps from websites
Google Inc.
August 27, 2015 - N°20150242508

Systems and methods for scheduling documents for crawling are disclosed in which sitemap information is updated for a first website identified by a sitemap by downloading updated sitemap information for the first website and scheduling documents for crawling in accordance with the updated sitemap information for the first website. The sitemap information includes one or more sitemap indexes, where each ...
Loading