ContentMine / thresherLinks
Headless scraperJSON scraping for Node.js
☆27Updated 9 years ago
Alternatives and similar repositories for thresher
Users that are interested in thresher are comparing it to the libraries listed below
Sorting:
- The scraperJSON standard for defining web scrapers as JSON objects☆33Updated 11 years ago
- [DEPRECATED] Please use https://datahub.io/docs/features/data-cli☆109Updated 7 years ago
- A command line tutorial to learn dat☆70Updated 7 years ago
- A compile-to-JSON data pipeline scripting language [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ …☆43Updated 3 years ago
- Archive and make discoverable data and links with schema.org metadata.☆37Updated 10 years ago
- agent has moved to https://lab.allmende.io/valueflows/agent☆10Updated 5 years ago
- a repo for discussions and other non-code organizing stuff [ DEPRECATED - More info on active projects and modules at https://dat-ecosyst…☆65Updated 3 years ago
- 'Git for Tabular Data'☆46Updated 9 years ago
- Node.js module/CLI tool for semantic analysis of text using the OpenCalais web service.☆44Updated 9 years ago
- A DSL for building JSON-LD resources☆15Updated 10 years ago
- Build consistent and versioned styleguides by including and running consistent lint files across projects.☆26Updated 8 years ago
- Build cross platform data pipelines [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ ]☆191Updated 3 years ago
- To JSON what Excel is to CSV☆144Updated 8 years ago
- Render a hyperdrive in the browser.☆41Updated 9 years ago
- A tool for editing CSV & JSON files from your computer & from GitHub.☆48Updated 8 years ago
- A CouchDB powered registry for linked data.☆31Updated 10 years ago
- Manage your dataset downloads.☆43Updated 8 years ago
- minimal module for launching compute clusters☆24Updated 9 years ago
- Dat Project Website [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ ]☆23Updated 3 years ago
- Dulcimer is a Node.js ORM for keystores.☆79Updated 5 years ago
- [RETIRED] Webmaker Filesystem☆352Updated 7 years ago
- Githulk smash API. Githulk strong.☆30Updated 5 years ago
- SVM Classifier to Detect Sentiment of Tweets☆16Updated 10 years ago
- Toolbox for deep, resilient, markup-invariant linking into HTML documents without their cooperation☆26Updated 2 years ago
- WebGL Globe☆22Updated 10 years ago
- a little nodejs server and script that extracts letters from images via tesseract☆19Updated 10 years ago
- Collaborative Innovation Class Project☆14Updated 10 years ago
- Scholar Ninja - Chrome extension. A distributed open search engine for scholarly content, based on a WebRTC DHT network☆115Updated 9 years ago
- A multi-transport job queue for Seneca☆12Updated 4 years ago
- LevelGraph plugin for storing N3/Turtle/RDF data☆36Updated 5 years ago