html-extract / hext
Domain-specific language for extracting structured data from HTML documents
☆52Updated 3 months ago
Alternatives and similar repositories for hext:
Users that are interested in hext are comparing it to the libraries listed below
- Where things come from in Who's On First.☆21Updated 10 months ago
- Browser version of Hyphe (WIP)☆30Updated 4 months ago
- API endpoint and UI for blockbuilder search page☆20Updated 2 years ago
- 📑 Read a Google Drive Doc and convert to JSON (via ArchieML)☆22Updated 6 years ago
- 🖼 A minimalistic take on responsive iframes in the spirit of Pym.js.☆26Updated 2 years ago
- experiments in sorting☆26Updated 2 years ago
- Rig for deploying DocumentCloud viewers to S3.☆13Updated 3 years ago
- A collection of visualization projects built on Wikipedia data.☆40Updated 2 years ago
- download and process d3.js blocks for further indexing and visualization☆24Updated 5 years ago
- Tools for working with Optical Character Recognition output☆16Updated 10 years ago
- Machine assisted dossiers☆19Updated 7 years ago
- DBpedia, which frequently crawls and analyses over 120 Wikipedia language editions has near complete information about (1) which facts ar…☆10Updated 2 years ago
- Examples of bad data, especially from government.☆22Updated 6 months ago
- A lightweight JavaScript client library for the Wikimedia Pageviews API for Wikipedia and various of its sister projects for Node.js and …☆27Updated 4 years ago
- Twitter, quick. Fetch and store tweets on short notice.☆80Updated 8 years ago
- My personally curated list of bash/command-line commands and snippets that are very useful yet I keep on forgetting☆18Updated 2 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆15Updated last year
- javascript multivariate data visualization☆14Updated 8 years ago
- 📄 A simple wrapper around the Google Docs API and ArchieML for easily converting the contents of a Google Doc into a ArchieML-produced d…☆23Updated last year
- Visualize the evolution of a file tracked by git☆25Updated 6 years ago
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System☆88Updated 4 years ago
- Add editing UI and other power-user features to Datasette.☆12Updated last year
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Updated 2 years ago
- livecoding observable-ish experiment, just an experiment☆22Updated 4 years ago
- Tunable full text search engine in JavaScript that: (1) works natively on web apps like Express.js; (2) easy to customize (via BM25) to s…☆33Updated 6 years ago
- A LevelDB backed URL unshortening microservice written in JavaScript☆31Updated 2 years ago
- Join data in the browser. Supports csv, tsv, psv, *json and dbf.☆11Updated 2 years ago
- Because what if you could just... write graphics sketches? On the web? Like, directly?☆18Updated last week
- See through the world!☆12Updated 9 years ago
- Datasette plugin for inserting and updating data☆20Updated 10 months ago