ContentMine / thresherLinks
Headless scraperJSON scraping for Node.js
☆27Updated 8 years ago
Alternatives and similar repositories for thresher
Users that are interested in thresher are comparing it to the libraries listed below
Sorting:
- The scraperJSON standard for defining web scrapers as JSON objects☆33Updated 10 years ago
- A command line tutorial to learn dat☆69Updated 6 years ago
- Archive and make discoverable data and links with schema.org metadata.☆36Updated 10 years ago
- A CouchDB powered registry for linked data.☆31Updated 10 years ago
- A DSL for building JSON-LD resources☆15Updated 10 years ago
- LevelGraph.io Playground☆11Updated 3 years ago
- A compile-to-JSON data pipeline scripting language [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ …☆43Updated 3 years ago
- Node.js module/CLI tool for semantic analysis of text using the OpenCalais web service.☆44Updated 9 years ago
- Render a hyperdrive in the browser.☆41Updated 8 years ago
- Annotator PouchDB Storage Plugin☆11Updated 9 years ago
- Multilingual DBpedia Spotlight for NodeJS☆13Updated 7 years ago
- Manage your dataset downloads.☆43Updated 8 years ago
- Toolbox for deep, resilient, markup-invariant linking into HTML documents without their cooperation☆26Updated 2 years ago
- schema.org in JS (work in progress)☆44Updated 2 years ago
- A multi-transport job queue for Seneca☆12Updated 3 years ago
- CLI tool for automating the use of docker containers in streaming data processing pipelines. Works on Windows, Mac and Linux.☆68Updated 10 years ago
- A simple SPARQL client for node.js☆50Updated 6 years ago
- Githulk smash API. Githulk strong.☆30Updated 5 years ago
- LevelGraph plugin for storing N3/Turtle/RDF data☆36Updated 5 years ago
- A Node.js (pure JavaScript) client library for accessing neo4j databases with batch support.☆122Updated 8 years ago
- A suite of modules for text analysis, including simple analysis, nGrams, and TFIDF analysis☆48Updated 4 years ago
- Less code, more flow. Let's dance!☆47Updated 3 months ago
- [DEPRECATED] Please use https://datahub.io/docs/features/data-cli☆109Updated 7 years ago
- Session notes, data, instructions and examples for a hands-on workshop on using a diverse set of tools and practices for journalistic dat…☆15Updated 8 years ago
- Wrapper around gremlin-node to provide out of the box support for Titan graph database☆23Updated 10 years ago
- ☆22Updated 13 years ago
- agent has moved to https://lab.allmende.io/valueflows/agent☆10Updated 4 years ago
- Tables In, Tables Out☆22Updated 9 years ago
- A simple term frequency library (see https://en.wikipedia.org/wiki/Tf%E2%80%93idf#Term_frequency_2 )☆11Updated 8 years ago
- A Hacker News reader built with Choo☆30Updated 8 years ago