html-extract / hextLinks
Domain-specific language for extracting structured data from HTML documents
☆54Updated 3 months ago
Alternatives and similar repositories for hext
Users that are interested in hext are comparing it to the libraries listed below
Sorting:
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System☆87Updated 4 years ago
- Browser version of Hyphe (WIP)☆32Updated 8 months ago
- My personally curated list of bash/command-line commands and snippets that are very useful yet I keep on forgetting☆19Updated 3 years ago
- 📑 Read a Google Drive Doc and convert to JSON (via ArchieML)☆23Updated 7 years ago
- Twitter, quick. Fetch and store tweets on short notice.☆79Updated 9 years ago
- Datasette plugin for visualizing data using Vega☆63Updated 3 weeks ago
- A suite of focused and simple tools and activities for journalists, data journalism classrooms and community advocacy groups☆63Updated last month
- Pull out versions of specific files from a gitscraping repo into individual files.☆15Updated 4 years ago
- Extract networks of entities from journalistic reporting☆49Updated 2 years ago
- JavaScript app for displaying annotated network graphs based on data from LittleSis☆105Updated 2 weeks ago
- Computer assisted video/audio transcription☆97Updated 5 years ago
- A data pipeline helper written in node to convert a folder of JS/ArchieML/JSON/YAML/CSV/TSV files into usable data.☆47Updated 2 years ago
- ALPHA ~ A web extension framework for collecting rich, customized browsing history datasets.☆21Updated 4 years ago
- Now included in rigour☆152Updated last month
- A lightweight, standardized library accessing files and datasets, especially tabular ones (CSV, Excel).☆75Updated 2 years ago
- A git scraper recording the CDC's Covid Data Tracker numbers on number of vaccinations per state.☆24Updated 2 years ago
- Export Airtable data to YAML, JSON or SQLite files on disk☆131Updated last year
- A lightweight JavaScript client library for the Wikimedia Pageviews API for Wikipedia and various of its sister projects for Node.js and …☆27Updated 5 years ago
- Markdown auto-formatting, beautification, and cleanup for Atom☆45Updated 2 years ago
- ☆86Updated 3 years ago
- a simple graph shell to explore ideas☆117Updated 5 months ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- A network clustering library for javascript☆35Updated 2 months ago
- GUI text-based speech and music editor for creating radio/audio stories☆80Updated 3 years ago
- A graphical editor for creating Idyll documents.☆86Updated 2 years ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆125Updated 4 years ago
- Uses Google Apps Scripts with Google Docs to provide a document tree in JSON exposed on a GET URL for integration into anything.☆28Updated 7 years ago
- A simple utility for SQL-like joins with Json, GeoJson or dbf data in Node, the browser and on the command line. Also creates join report…☆52Updated 3 years ago
- generative algorithm☆48Updated 9 years ago
- Faster force-directed graph layouts by reusing force approximations☆128Updated 4 years ago