ContentMine / thresher
Headless scraperJSON scraping for Node.js
☆27Updated 8 years ago
Alternatives and similar repositories for thresher:
Users that are interested in thresher are comparing it to the libraries listed below
- The scraperJSON standard for defining web scrapers as JSON objects☆33Updated 10 years ago
- LevelGraph.io Playground☆11Updated 3 years ago
- Archive and make discoverable data and links with schema.org metadata.☆36Updated 10 years ago
- A compile-to-JSON data pipeline scripting language [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ …☆43Updated 3 years ago
- A CouchDB powered registry for linked data.☆31Updated 10 years ago
- agent has moved to https://lab.allmende.io/valueflows/agent☆10Updated 4 years ago
- A command line tutorial to learn dat☆69Updated 6 years ago
- LevelGraph plugin for storing N3/Turtle/RDF data☆36Updated 5 years ago
- a repo for discussions and other non-code organizing stuff [ DEPRECATED - More info on active projects and modules at https://dat-ecosyst…☆65Updated 3 years ago
- Render a hyperdrive in the browser.☆40Updated 8 years ago
- Annotator PouchDB Storage Plugin☆11Updated 8 years ago
- Session notes, data, instructions and examples for a hands-on workshop on using a diverse set of tools and practices for journalistic dat…☆15Updated 8 years ago
- Toolbox for deep, resilient, markup-invariant linking into HTML documents without their cooperation☆26Updated 2 years ago
- Dat Project Website [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ ]☆23Updated 3 years ago
- Manage your dataset downloads.☆43Updated 7 years ago
- A DSL for building JSON-LD resources☆15Updated 9 years ago
- Convert OWL/RDFS and XML Schema to a canonical JSON Schema (draft 4) representation.☆22Updated 9 years ago
- schema.org in JS (work in progress)☆44Updated 2 years ago
- A node.js module that creates a term vector from a mixed text input. Supports stopword removal and customisable separators.☆19Updated 3 months ago
- sci.pe (science periodicals) extension of schema:ScholarlyArticle to describe the production process, content, distribution and preser…☆4Updated 2 years ago
- The Object Document Mapper for LevelGraph based on JSON-LD☆113Updated last year
- ☆175Updated 7 years ago
- SVM Classifier to Detect Sentiment of Tweets☆16Updated 9 years ago
- A custom RDFa parser (based on green-turtle) to be registered with jsonld.js registerRDFParser method☆13Updated last year
- 'Git for Tabular Data'☆46Updated 8 years ago
- Implementation of the Linked Data Platform for Node.☆19Updated 11 years ago
- CLI tool for automating the use of docker containers in streaming data processing pipelines. Works on Windows, Mac and Linux.☆68Updated 10 years ago
- WebGL Globe☆22Updated 9 years ago
- Build consistent and versioned styleguides by including and running consistent lint files across projects.☆26Updated 7 years ago
- a resource for creating and configuring express http servers☆17Updated 4 years ago