Images List Premium Download Classic

Web Crawler

Web Crawler-related patent applications - as published by the U.S. Patent and Trademark Office (USPTO).


loading
Detection of coordinated cyber-attacks
F-secure Corporation
June 22, 2017 - N°20170180402

A method of detecting coordinated attacks on computer and computer networks via the internet. The method includes using a web crawler to crawl the world wide web to identify domains and subdomains and their associated ip addresses, and to identify links between domains and subdomains, and storing the results in a database. When an ip address is identified as malicious ...
Developing an item data model for an item
Wal-mart Stores, Inc.
June 22, 2017 - N°20170177725

The present invention extends to methods, systems, and computer program products for developing an item data model for an item. Aspects of the invention can automate the process of data collection of “facts” for “items” that information is needed about. Facts can be organized and normalized to eliminate redundant facts, and interpret ...
Detecting disclosed content sources using dynamic steganography
Box, Inc.
May 11, 2017 - N°20170134344

Systems for forensic steganography. A server is interfaced with storage facilities that store an object accessible by two or more users, each of which users are associated with respective profiles comprising one or more user-specific attributes. A method detects a user request to view the object. User-specific attributes are encoded into a steganographic message, which is formatted for saving into ...
Web Crawler Patent Pack
Download + patent application PDFs
Web Crawler Patent Applications
Download + Web Crawler-related PDFs
For professional research & prior art discovery
inventor
  • + full patent PDF documents of Web Crawler-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Providing cloud-based health-related data analytics services
Fujitsu Limited
April 20, 2017 - N°20170109443

A method to provide health-related data analytics services via a web service may include crawling, via a web crawler, the internet to identify multiple websites with content related to human health. The method may also include obtaining, using text classification, multiple words associated with an occurrence in lives of people and multiple words associated with a health outcome in the ...
Interactive web crawler
Microsoft Technology Licensing, Llc
March 02, 2017 - N°20170061029

The claimed subject matter provides a system or method for web crawling hidden files. An exemplary method comprises loading a web page with a browser agent, and executing any dynamic elements hosted on the web page using the browser agent to insert pre-determined values. A list of form controls may be retrieved from the web page using the browser agent, ...
Identifying search friendly web pages
Bloomreach, Inc.
February 16, 2017 - N°20170046763

A system for evaluating web pages for searchable content can be utilized to make an e-commerce search engine more effective by identifying pages with searchable value. In embodiments, a web page exhibiting “searchable value” is a page that provides useful information responsive to a user's query on an e-commerce search engine. One embodiment of a page ...
Web Crawler Patent Pack
Download + patent application PDFs
Web Crawler Patent Applications
Download + Web Crawler-related PDFs
For professional research & prior art discovery
inventor
  • + full patent PDF documents of Web Crawler-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Method and system for scheduling web crawlers according to keyword search
Beijing Jingdong Century Trading Co., Ltd.
November 10, 2016 - N°20160328475

A method and a system for scheduling web crawlers according to keyword search. The method comprises: a scheduling end receiving a task request command sent by a crawling node; the scheduling end acquiring a secondary download link address from a priority bucket, generating tasks, adding the generated tasks into a task list, acquiring keyword link addresses from a dynamic bucket, ...
Anchor tag indexing in a web crawler system
Google Inc.
November 03, 2016 - N°20160321252

Provided is a method and system for indexing documents in a collection of linked documents. A link log, including one or more pairings of source documents and target documents is accessed. A sorted anchor map, containing one or more target document to source document pairings, is generated. The pairings in the sorted anchor map are ordered based on target document ...
Crawling of m2m devices
Convida Wireless, Llc
September 22, 2016 - N°20160275190

In accordance with various example embodiments, an m2m crawler service may support capabilities to enable m2m devices to be efficiently and effectively crawled by web crawlers. As a result, m2m devices may be indexed and searched by web search engines, and thus by web users making use of web search engines. Thus, the described-herein m2m crawler ...
High precision internet local search
Uber Technologies, Inc.
June 02, 2016 - N°20160154807

High-precision local search is performed on the internet. A map image-rendering software provider embeds spatial keys into maps, which are then provided to producers of internet content such as map providers. For example, a homeowner may post a message on a web bulletin board advertising his house for sale, and including a map showing the location of the house. When ...
Method and apparatus to throttle media access by web crawlers
The Nielsen Company (us), Llc
May 05, 2016 - N°20160127262

Methods, apparatus, systems and articles of manufacture are disclosed to throttle resource access by web crawlers. An example method disclosed herein includes obtaining, at a server, a media request message for media hosted by the server, the media request message requesting access to the media, characterizing a media-requesting source associated with the media request message, and inserting a time delay ...
System and method for a cyber intelligence hub
Comsec Consulting Ltd.
April 28, 2016 - N°20160119365

A method for defining and forming a cyber intelligence channel communicating with consumers is facing cyber threats in real time. The method includes collecting information, such that web crawlers and scrapers. The method also includes filtering the collected information, by filtering mechanisms founded on advanced algorithms. The method goes on to categorize the information into groups based on their unique ...
Community authoring content generation and navigation
Microsoft Technology Licensing, Llc
April 28, 2016 - N°20160117321

One or more techniques and/or systems are provided for creating socially authored, or community authored, summaries of documents and/or for navigating a forum comprising such summaries. In one embodiment, at least some of the summaries are generated automatically when a document is written and/or discovered (e. G., by a web crawler), for example. In another embodiment, the ...
Web Crawler Patent Pack
Download + patent application PDFs
Web Crawler Patent Applications
Download + Web Crawler-related PDFs
For professional research & prior art discovery
inventor
  • + full patent PDF documents of Web Crawler-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
System and method to identify machine-readable codes
Ebay Inc.
March 24, 2016 - N°20160085874

A method and a system to identify machine-readable codes using a web crawler are provided. Machine-readable codes include, but are not limited to, universal product codes (upc), quick response (qr) codes, stock-keeping units (skus) and international standard book number (isbn) codes. A web crawler downloads pages from the world wide web. A determination module accesses the downloaded pages and identifies ...
Web crawler for acquiring content
Ut Battelle, Llc
February 25, 2016 - N°20160055243

An adaptive web crawling system generates a first utility measurement based on web page snippets associated with individual search result items by crawling from a collection of web page crawling seeds and according to a specific user web crawling criteria. The system generates a second utility measurement based on features extracted from the full webpages downloaded according to the guidance ...
Release date notification system
Ut Battelle, Llc
November 12, 2015 - N°20150324853

An application software for smartphones and tablets works with a cooperating website. The app will compile a calendar and notify the user of the dates of release or availability of selected events, media and/or products, based on a targeted item database compiled by the user and stored locally on the smart device as well as remotely accessible through the ...
Web crawler scheduler that utilizes sitemaps from websites
Google Inc.
August 27, 2015 - N°20150242508

Systems and methods for scheduling documents for crawling are disclosed in which sitemap information is updated for a first website identified by a sitemap by downloading updated sitemap information for the first website and scheduling documents for crawling in accordance with the updated sitemap information for the first website. The sitemap information includes one or more sitemap indexes, where each ...
System and method for preventing web crawler access
Alibaba Group Holding Limited
July 09, 2015 - N°20150195305

Preventing web crawler access includes receiving a request for a webpage that includes web content that is to be protected from a web crawler, encrypting the web content to be protected to generate encrypted content and responding to the request, including sending the encrypted content and a decryption instruction. The decryption instruction is configured to allow a web browser to ...
Web crawler optimization system
Ebay Inc.
June 11, 2015 - N°20150161257

Techniques for optimizing the performance of a webpage crawler are described. According to various embodiments, historical web crawler performance data is accessed, the data describing a performance of a web crawler during various time periods in one or more prior days. A capacity of the web crawler to fulfil uniform resource locator (url) crawl requests for an upcoming given time ...
Method for correlating data
Ebay Inc.
April 30, 2015 - N°20150120694

A method for correlating data stored in a database implements a web crawler element and an analyzer element to discover data correlations between a first set of data and a second set of data. The web crawler element searches online for a plurality of electronic files, and inspects said electronic files in order to determine a file type for each ...
Method, device, and system for acquiring user behavior
Huawei Technologies Co., Ltd.
April 30, 2015 - N°20150120692

Embodiments of the present invention provide a method, a device, and a system for acquiring a user behavior. In the embodiments of the present invention, an acquired url request matches a database, and the database stores a url actively initiated by a user recognized by adopting a web crawler technology. If a url contained in the url request matches a ...
Loading