html-extract / hext
Domain-specific language for extracting structured data from HTML documents
☆52Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for hext
- experiments in sorting☆25Updated last year
- makes supercuts from youtube searches (alpha)☆12Updated 6 years ago
- Add website scraping abilities to Datasette☆61Updated last year
- 🖼 A minimalistic take on responsive iframes in the spirit of Pym.js.☆26Updated last year
- Add editing UI and other power-user features to Datasette.☆12Updated last year
- Generating text completions based on the Mueller report☆28Updated 5 years ago
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System☆88Updated 3 years ago
- Two-day workshop for SFPC☆30Updated 8 years ago
- Join data in the browser. Supports csv, tsv, psv, *json and dbf.☆11Updated 2 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆14Updated 10 months ago
- Where things come from in Who's On First.☆21Updated 8 months ago
- The Docker meets Machine Learning Tutorial You've Been Wanting!☆37Updated last year
- 📑 Read a Google Drive Doc and convert to JSON (via ArchieML)☆22Updated 6 years ago
- Simple JSON API for small crowdsourcing apps☆13Updated 6 years ago
- A tool for telling stories with maps.☆25Updated last month
- A node.js interface to the Wordnik API, which lets you get dictionary definitions, random words, pronunciation, and more!☆18Updated 8 years ago
- API endpoint and UI for blockbuilder search page☆20Updated last year
- Datawrapper API v3 (in Node)☆14Updated 3 years ago
- A LevelDB backed URL unshortening microservice written in JavaScript☆31Updated last year
- Browser version of Hyphe (WIP)☆29Updated last month
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆23Updated 9 months ago
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Updated last year
- My personally curated list of bash/command-line commands and snippets that are very useful yet I keep on forgetting☆18Updated 2 years ago
- a work-in-progress guide to web scraping as an artistic and critical practice☆79Updated last year
- Mapping tile server for Datasette, serving tiles from MBTiles packages☆8Updated 2 years ago
- Pull out versions of specific files from a gitscraping repo into individual files.☆13Updated 3 years ago
- basically all words, in a compressed form☆16Updated last year