danielnieto / scrapmanLinks
Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
☆22Updated 7 years ago
Alternatives and similar repositories for scrapman
Users that are interested in scrapman are comparing it to the libraries listed below
Sorting:
- Friendly web crawler for x-ray☆44Updated 2 years ago
- Fastest way to fetch the web content(HTML stream) from server, supports:redirects, auto decode(e.g.:Chinese), gzip, cookie, proxy...☆33Updated 5 years ago
- Yagnus Javascript Libraries☆22Updated 12 years ago
- Entity relationship, role, and permissions API for Node.js☆61Updated 10 years ago
- Fast and extendible Node.js/Javascript fulltext search engine.☆72Updated 2 weeks ago
- given two streams of newline delimited JSON data perform a merge/extend on each object in the stream☆50Updated 8 years ago
- 🙋 Fast, lightweight and transparent http(s) proxy that supports dynamic hostnames.☆16Updated 8 years ago
- A nodejs Scraping Utility for lazy people. MIT Licensed☆44Updated 3 years ago
- Job Queue in LevelDB☆86Updated 2 years ago
- 📉 JavaScript Text Statistics that counts lines, words, chars, and spaces.☆36Updated 3 years ago
- a puppeteer walker 🕷 🕸☆79Updated 5 years ago
- Naive Bayes Text Classifier☆40Updated 8 months ago
- Event sourcing JavaScript entity class☆11Updated 8 years ago
- A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and …☆57Updated last year
- Stargazerz - export repository 🌟 stargazers 🌟 profile as JSON☆10Updated 8 years ago
- sandcrawler.js - the server-side scraping companion.☆108Updated 9 years ago
- Small module wrapper for the AWS sdk that allows you to easily use s3 or the local file system☆79Updated 6 years ago
- Create HTML snippets/embeds from URLs using info from oEmbed, Open Graph, meta tags.☆66Updated 2 years ago
- Serializes any DOM node into a String☆37Updated 6 years ago
- streaming parser for the ZIM aka OpenZIM file format http://www.openzim.org/wiki/ZIM_file_format☆14Updated 8 years ago
- FIFO queue in Javascript☆59Updated 4 years ago
- x-ray's selector parser.☆16Updated 9 years ago
- transform streaming html using css selectors☆48Updated 2 years ago
- A 2nd generation spider to crawl any article site, automatic read title and article.☆43Updated 9 years ago
- A simple CRUD based persistence abstraction for storing objects to any backend data store. eg. Memory, MongoDB, Redis, CouchDB, Postgres,…☆144Updated 2 years ago
- Vanilla JavaScript implementation of the Weighted PageRank Algorithm☆34Updated 6 years ago
- in memory mocking engine for mongo db☆25Updated 2 years ago
- A cross-platform scanner for wireless networks☆13Updated 10 years ago
- Node wrapper around FastText Library☆57Updated 2 years ago
- Super simple and fast html page meta data extractor with low memory footprint☆36Updated 2 years ago