medialab / sandcrawler
sandcrawler.js - the server-side scraping companion.
☆107Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for sandcrawler
- phantom driver for x-ray.☆111Updated 8 years ago
- The selection parser for x-ray. Aiming to bring structure to the web.☆20Updated 9 years ago
- Friendly web crawler for x-ray☆44Updated last year
- Capture screenshots in multiple browsers!☆416Updated last year
- Nightmare plugin for LinkedIn.☆64Updated 9 years ago
- Parser for robots.txt for node.js☆66Updated 3 years ago
- A simple Elasticsearch CSV importer node.js library☆48Updated 8 years ago
- Simple query builder for elasticsearch☆56Updated 8 years ago
- A handy terminal dashboard plugin for sandcrawler.☆20Updated 8 years ago
- Simple bridge to phantomjs for Node☆201Updated 4 years ago
- Utility to crawl and diff websites for node.js☆112Updated 7 years ago
- Image analysis and comparison☆66Updated 8 years ago
- ☆128Updated 6 years ago
- Web scraping and HTML-reprocessing. The easy way.☆393Updated 5 years ago
- Convert a GIF image into an HTML5-ready video for considerably better file sizes☆67Updated 10 years ago
- node.js wrapper for the Diffbot API (article and frontpage)☆35Updated 8 years ago
- A suite of modules for text analysis, including simple analysis, nGrams, and TFIDF analysis☆49Updated 3 years ago
- A simple-but-useful kNN library for NodeJS, comparing JSON Objects using Euclidean distances☆215Updated 9 years ago
- A 2nd generation spider to crawl any article site, automatic read title and article.☆43Updated 8 years ago
- Martin Porter's stemmer for node.js☆100Updated 4 years ago
- A simple node.js wrapper for Stanford CoreNLP.☆75Updated 2 years ago
- 💱 Advanced node.js wrapper for the Open Exchange Rates API☆73Updated last year
- Node.js wrapper for the DuckDuckGo Instant Answers API.☆64Updated last year
- More verbose and readable regular expressions☆61Updated 9 years ago
- A Serverless - Node.js project to create reports from database queries, and send those reports out in pretty emails. Motivation:☆51Updated 2 months ago
- Chrome extension that acts as terminal and task runner☆269Updated 5 years ago
- This is the pure Node API for reading MaxMind DB files. MaxMind DB is a binary file format that stores data indexed by IP address subnets…☆88Updated 6 years ago