laggingreflex / scrapers-benchmarksLinks
☆14Updated 8 years ago
Alternatives and similar repositories for scrapers-benchmarks
Users that are interested in scrapers-benchmarks are comparing it to the libraries listed below
Sorting:
- Automatically extracts structured information from webpages☆112Updated 3 years ago
- Simhash implementation in Javascript☆39Updated 8 years ago
- a puppeteer walker 🕷 🕸☆79Updated 5 years ago
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…☆382Updated 3 years ago
- Easily generate correct user-agent strings for popular browsers☆74Updated 4 years ago
- Puppeteer(Chrome headless node API) based web page renderer☆329Updated last month
- Email automation driven by headless chrome.☆167Updated 5 years ago
- A command-line tool to crawl websites using puppeteer.☆104Updated 3 years ago
- Chromium / Puppeteer site crawler☆48Updated 5 years ago
- Vanilla JavaScript implementation of the Weighted PageRank Algorithm☆34Updated 6 years ago
- PageRank calculation for ngraph.graph☆29Updated 2 months ago
- A simple browser/client-side web scraper.☆241Updated 8 years ago
- Search domain names if registered or not in terminal☆167Updated 7 years ago
- Experimental Nightmare plugin for real mouse events☆69Updated 5 years ago
- Get n-grams from text☆84Updated 3 years ago
- Some tiny hash functions in javascript☆123Updated 5 years ago
- ☆138Updated 3 years ago
- Simple abstraction to use Chrome as a Headless Browser with Node JS☆214Updated 8 years ago
- sandcrawler.js - the server-side scraping companion.☆109Updated 10 years ago
- JavaScript arbitrary-precision arithmetic library. Built for speed.☆145Updated 6 years ago
- PhantomJS resource pool based on generic-pool☆106Updated 6 years ago
- Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs☆22Updated 7 years ago
- Advanced Node proxy checker (node proxy verifier, node proxy tester) with socks and https support☆107Updated 3 years ago
- Randomly generates User-Agent strings based on actual usage statistics from Wikipedia and StatOwl.com as of July 2012.☆35Updated 9 years ago
- Parse WARC (Web Archive Files) as a node.js stream☆23Updated 11 years ago
- NodeJS real threads with shared memory☆40Updated 6 years ago
- Robust text renderer using headless chrome.☆66Updated 2 years ago
- REST API for scraping dynamic websites using Node.js, headless Chrome and Cheerio.☆64Updated last year
- A Better Scraper, with Puppeteer☆43Updated last month
- Creates screencasting-like gifs of page-scrolls☆16Updated 3 years ago