ContentMine / thresher
Headless scraperJSON scraping for Node.js
☆27Updated 8 years ago
Alternatives and similar repositories for thresher:
Users that are interested in thresher are comparing it to the libraries listed below
- The scraperJSON standard for defining web scrapers as JSON objects☆33Updated 10 years ago
- A compile-to-JSON data pipeline scripting language [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ …☆43Updated 3 years ago
- Render a hyperdrive in the browser.☆40Updated 8 years ago
- LevelGraph.io Playground☆11Updated 3 years ago
- Session notes, data, instructions and examples for a hands-on workshop on using a diverse set of tools and practices for journalistic dat…☆15Updated 8 years ago
- Dat Project Website [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ ]☆23Updated 3 years ago
- minimal module for launching compute clusters☆24Updated 9 years ago
- A command line tutorial to learn dat☆69Updated 6 years ago
- a repo for discussions and other non-code organizing stuff [ DEPRECATED - More info on active projects and modules at https://dat-ecosyst…☆65Updated 3 years ago
- Render templates from any engine. Make custom template types, use layouts on pages, partials or any custom template type, custom delimite…☆52Updated 8 years ago
- Manage your dataset downloads.☆43Updated 8 years ago
- a resource for creating and configuring express http servers☆17Updated 4 years ago
- agent has moved to https://lab.allmende.io/valueflows/agent☆10Updated 4 years ago
- IPFS Merkle DAG that replicates based on append-only logs and causal linking.☆56Updated 7 years ago
- a tale of two applications. one built as a monolith. the same built with microservices☆15Updated 9 years ago
- low level implementation of the dat data version graph☆42Updated 9 years ago
- AFINN 111 (list of English words rated for valence) in JSON☆44Updated 2 years ago
- [DEPRECATED] Please use https://datahub.io/docs/features/data-cli☆109Updated 7 years ago
- try dat using docker☆30Updated 7 years ago
- DEPRECATED in favour of retext’s virtual object model☆39Updated 9 years ago
- Githulk smash API. Githulk strong.☆30Updated 4 years ago
- Yeoman generator for atom editor packages.☆39Updated 9 months ago
- Build consistent and versioned styleguides by including and running consistent lint files across projects.☆26Updated 7 years ago
- A CouchDB powered registry for linked data.☆31Updated 10 years ago
- Portable Linked Profiles documentation☆26Updated 9 years ago
- ☆24Updated 8 years ago
- LevelGraph plugin for storing N3/Turtle/RDF data☆36Updated 5 years ago
- A simple term frequency library (see https://en.wikipedia.org/wiki/Tf%E2%80%93idf#Term_frequency_2 )☆11Updated 8 years ago
- warning! you should probably still use the regular dat cli ---->☆27Updated 6 years ago
- Dat Project Projects [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ ]☆13Updated 3 years ago