Images List Premium Download Classic

Web Crawler

Web Crawler-related patent applications - as published by the U.S. Patent and Trademark Office (USPTO).


loading
Systems and methods for generating and maintaining internet user profile data
Pathmatics, Inc.
August 17, 2017 - N°20170236157

Systems and methods are provided for automatically generating and maintaining user profile cookie sets. The user profile cookie sets may be used by a web crawler when gathering data such as advertisement data associated with one or more websites. The cookie sets may be generated by choosing a user profile with a set of user traits, selecting a set of ...
Three dimensional web crawler
Pathmatics, Inc.
July 06, 2017 - N°20170193569

The invention is a computer process in which a web crawler or other type of search process views visual media files on the world wide web, and initially scans or downloads the files contained in html or otherwise backend internet web site files. The process identifies objects in the visual media content as being mass production items, and places the ...
Detection of coordinated cyber-attacks
F-secure Corporation
June 22, 2017 - N°20170180402

A method of detecting coordinated attacks on computer and computer networks via the internet. The method includes using a web crawler to crawl the world wide web to identify domains and subdomains and their associated ip addresses, and to identify links between domains and subdomains, and storing the results in a database. When an ip address is identified as malicious ...
Web Crawler Patent Pack
Download + patent application PDFs
Web Crawler Patent Applications
Download + Web Crawler-related PDFs
For professional research & prior art discovery
inventor
  • + full patent PDF documents of Web Crawler-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Detecting disclosed content sources using dynamic steganography
Box, Inc.
May 11, 2017 - N°20170134344

Systems for forensic steganography. A server is interfaced with storage facilities that store an object accessible by two or more users, each of which users are associated with respective profiles comprising one or more user-specific attributes. A method detects a user request to view the object. User-specific attributes are encoded into a steganographic message, which is formatted for saving into ...
Securing shared documents using dynamic natural language steganography
Box, Inc.
May 04, 2017 - N°20170126631

Systems for secure cloud-based collaboration over shared objects. Embodiments operate within systems in a cloud-based environment, wherein one or more servers are configured to interface with storage devices that store objects accessible by one or more users. A process receives an electronic message comprising a user request to access an object. Before providing user access to the object, the system ...
Providing cloud-based health-related data analytics services
Fujitsu Limited
April 20, 2017 - N°20170109443

A method to provide health-related data analytics services via a web service may include crawling, via a web crawler, the internet to identify multiple websites with content related to human health. The method may also include obtaining, using text classification, multiple words associated with an occurrence in lives of people and multiple words associated with a health outcome in the ...
Web Crawler Patent Pack
Download + patent application PDFs
Web Crawler Patent Applications
Download + Web Crawler-related PDFs
For professional research & prior art discovery
inventor
  • + full patent PDF documents of Web Crawler-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Identifying search friendly web pages
Bloomreach, Inc.
February 16, 2017 - N°20170046763

A system for evaluating web pages for searchable content can be utilized to make an e-commerce search engine more effective by identifying pages with searchable value. In embodiments, a web page exhibiting “searchable value” is a page that provides useful information responsive to a user's query on an e-commerce search engine. One embodiment of a page ...
Avoiding masked web page content indexing errors for search engines
Bloomreach, Inc.
November 24, 2016 - N°20160342703

Multiple non-host client sites provide cached user copies of web pages and/or web content, or summaries thereof, to a server. Obtaining data from non-host sources for indexing purposes avoids masked web page content indexing errors for search engines. The server aggregates, summarizes and indexes the web pages and/or web content in an index of cached content, in conjunction ...
Method and system for scheduling web crawlers according to keyword search
Beijing Jingdong Century Trading Co., Ltd.
November 10, 2016 - N°20160328475

A method and a system for scheduling web crawlers according to keyword search. The method comprises: a scheduling end receiving a task request command sent by a crawling node; the scheduling end acquiring a secondary download link address from a priority bucket, generating tasks, adding the generated tasks into a task list, acquiring keyword link addresses from a dynamic bucket, ...
Anchor tag indexing in a web crawler system
Google Inc.
November 03, 2016 - N°20160321252

Provided is a method and system for indexing documents in a collection of linked documents. A link log, including one or more pairings of source documents and target documents is accessed. A sorted anchor map, containing one or more target document to source document pairings, is generated. The pairings in the sorted anchor map are ordered based on target document ...
Crawling of m2m devices
Convida Wireless, Llc
September 22, 2016 - N°20160275190

In accordance with various example embodiments, an m2m crawler service may support capabilities to enable m2m devices to be efficiently and effectively crawled by web crawlers. As a result, m2m devices may be indexed and searched by web search engines, and thus by web users making use of web search engines. Thus, the described-herein m2m crawler ...
High precision internet local search
Uber Technologies, Inc.
June 02, 2016 - N°20160154807

High-precision local search is performed on the internet. A map image-rendering software provider embeds spatial keys into maps, which are then provided to producers of internet content such as map providers. For example, a homeowner may post a message on a web bulletin board advertising his house for sale, and including a map showing the location of the house. When ...
Method and apparatus to throttle media access by web crawlers
The Nielsen Company (us), Llc
May 05, 2016 - N°20160127262

Methods, apparatus, systems and articles of manufacture are disclosed to throttle resource access by web crawlers. An example method disclosed herein includes obtaining, at a server, a media request message for media hosted by the server, the media request message requesting access to the media, characterizing a media-requesting source associated with the media request message, and inserting a time delay ...
Web Crawler Patent Pack
Download + patent application PDFs
Web Crawler Patent Applications
Download + Web Crawler-related PDFs
For professional research & prior art discovery
inventor
  • + full patent PDF documents of Web Crawler-related inventions.
  • Exact USPTO filing data with full-text, images, drawings & claims.
  • Index pages: Table View and Image-Grid View layouts. All images in each PDF.
Community authoring content generation and navigation
Microsoft Technology Licensing, Llc
April 28, 2016 - N°20160117321

One or more techniques and/or systems are provided for creating socially authored, or community authored, summaries of documents and/or for navigating a forum comprising such summaries. In one embodiment, at least some of the summaries are generated automatically when a document is written and/or discovered (e. G., by a web crawler), for example. In another embodiment, the ...
Interactive web crawler
Microsoft Technology Licensing, Llc
April 21, 2016 - N°20160110456

The claimed subject matter provides a system or method for web crawling hidden files. An exemplary method comprises loading a web page with a browser agent, and executing any dynamic elements hosted on the web page using the browser agent to insert pre-determined values. A list of form controls may be retrieved from the web page using the browser agent, ...
System and method to identify machine-readable codes
Ebay Inc.
March 24, 2016 - N°20160085874

A method and a system to identify machine-readable codes using a web crawler are provided. Machine-readable codes include, but are not limited to, universal product codes (upc), quick response (qr) codes, stock-keeping units (skus) and international standard book number (isbn) codes. A web crawler downloads pages from the world wide web. A determination module accesses the downloaded pages and identifies ...
Web crawler for acquiring content
Ut Battelle, Llc
February 25, 2016 - N°20160055243

An adaptive web crawling system generates a first utility measurement based on web page snippets associated with individual search result items by crawling from a collection of web page crawling seeds and according to a specific user web crawling criteria. The system generates a second utility measurement based on features extracted from the full webpages downloaded according to the guidance ...
Release date notification system
Ut Battelle, Llc
November 12, 2015 - N°20150324853

An application software for smartphones and tablets works with a cooperating website. The app will compile a calendar and notify the user of the dates of release or availability of selected events, media and/or products, based on a targeted item database compiled by the user and stored locally on the smart device as well as remotely accessible through the ...
Web crawler scheduler that utilizes sitemaps from websites
Google Inc.
August 27, 2015 - N°20150242508

Systems and methods for scheduling documents for crawling are disclosed in which sitemap information is updated for a first website identified by a sitemap by downloading updated sitemap information for the first website and scheduling documents for crawling in accordance with the updated sitemap information for the first website. The sitemap information includes one or more sitemap indexes, where each ...
System and method for preventing web crawler access
Alibaba Group Holding Limited
July 09, 2015 - N°20150195305

Preventing web crawler access includes receiving a request for a webpage that includes web content that is to be protected from a web crawler, encrypting the web content to be protected to generate encrypted content and responding to the request, including sending the encrypted content and a decryption instruction. The decryption instruction is configured to allow a web browser to ...
Web crawler optimization system
Ebay Inc.
June 11, 2015 - N°20150161257

Techniques for optimizing the performance of a webpage crawler are described. According to various embodiments, historical web crawler performance data is accessed, the data describing a performance of a web crawler during various time periods in one or more prior days. A capacity of the web crawler to fulfil uniform resource locator (url) crawl requests for an upcoming given time ...
Loading