danielnieto / scrapmanLinks
Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
☆22Updated 7 years ago
Alternatives and similar repositories for scrapman
Users that are interested in scrapman are comparing it to the libraries listed below
Sorting:
- Friendly web crawler for x-ray☆44Updated 3 years ago
- A nodejs Scraping Utility for lazy people. MIT Licensed☆44Updated 3 years ago
- x-ray's selector parser.☆16Updated 10 years ago
- 🕴Elegant command execution for Node.☆37Updated 5 years ago
- given two streams of newline delimited JSON data perform a merge/extend on each object in the stream☆50Updated 9 years ago
- sandcrawler.js - the server-side scraping companion.☆109Updated 10 years ago
- 🙋 Fast, lightweight and transparent http(s) proxy that supports dynamic hostnames.☆16Updated 9 years ago
- a puppeteer walker 🕷 🕸☆79Updated 5 years ago
- Kanban board that just works in your browser (even when you have no internet)☆39Updated last year
- Naive Bayes Text Classifier☆41Updated 11 months ago
- A simple function that prints objects as ASCII tables. Supports ANSI styling and weird Unicode 💩 emojis – they won't break the layout.☆65Updated 2 years ago
- Serializes any DOM node into a String☆37Updated 6 years ago
- Stargazerz - export repository 🌟 stargazers 🌟 profile as JSON☆10Updated 9 years ago
- Returns a `stream.Readable` from a URI string☆49Updated 2 years ago
- Super simple and fast html page meta data extractor with low memory footprint☆36Updated 3 years ago
- Job Queue in LevelDB☆86Updated 3 years ago
- Small module wrapper for the AWS sdk that allows you to easily use s3 or the local file system☆79Updated 6 years ago
- A WIP CSV viewer element.☆50Updated 9 years ago
- Yagnus Javascript Libraries☆22Updated 12 years ago
- Single file write-once database that is valid JSON with efficient random access on bigger datasets☆111Updated 7 years ago
- Fastest way to fetch the web content(HTML stream) from server, supports:redirects, auto decode(e.g.:Chinese), gzip, cookie, proxy...☆33Updated 5 years ago
- EventSource implemented in node and the browser as a readable stream☆47Updated 7 years ago
- Filters (removes) objects from document based on passed json-schema☆37Updated 3 years ago
- [WIP] Web Crawler in Node.js to spider dynamically whole websites.☆35Updated 6 years ago
- Ease the implementation of multi processing accross your microservices☆48Updated 3 years ago
- Automatically extracts structured information from webpages☆112Updated 3 years ago
- Vanilla JavaScript implementation of the Weighted PageRank Algorithm☆34Updated 6 years ago
- File system cache for Node.JS☆25Updated 4 years ago
- Create HTML snippets/embeds from URLs using info from oEmbed, Open Graph, meta tags.☆67Updated 2 years ago
- Detect the polarity (sentiment) of text☆54Updated 3 years ago