html-extract / hextLinks
Domain-specific language for extracting structured data from HTML documents
☆53Updated 3 months ago
Alternatives and similar repositories for hext
Users that are interested in hext are comparing it to the libraries listed below
Sorting:
- My personally curated list of bash/command-line commands and snippets that are very useful yet I keep on forgetting☆18Updated 3 years ago
- Twitter, quick. Fetch and store tweets on short notice.☆79Updated 8 years ago
- A suite of focused and simple tools and activities for journalists, data journalism classrooms and community advocacy groups☆63Updated last year
- 📑 Read a Google Drive Doc and convert to JSON (via ArchieML)☆22Updated 6 years ago
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System☆87Updated 4 years ago
- Now included in rigour☆151Updated 3 weeks ago
- a simple graph shell to explore ideas☆115Updated last month
- Browser version of Hyphe (WIP)☆31Updated 3 months ago
- Turn spreadsheet data into a structured, dynamic API.☆110Updated 3 months ago
- Pre-render Observable notebooks for automation☆61Updated 3 years ago
- a work-in-progress guide to web scraping as an artistic and critical practice☆84Updated 2 years ago
- experiments in sorting☆27Updated 2 years ago
- Machine learning model to recommend related content☆19Updated last year
- Easily download U.S. census maps☆33Updated 2 years ago
- generate rules from lists of words☆16Updated 4 years ago
- framework to orchestrate the download and analysis of media☆100Updated 2 years ago
- A library for accessing a spreadsheet as a native Python object suitable for templating.☆226Updated 7 years ago
- A visual timeline authoring tool that extracts temporal information from freeform text☆65Updated 2 years ago
- Computer assisted video/audio transcription☆97Updated 5 years ago
- A lightweight, standardized library accessing files and datasets, especially tabular ones (CSV, Excel).☆73Updated 2 years ago
- An automated, programming-free web scraper for interactive sites☆111Updated 2 years ago
- A simple utility for SQL-like joins with Json, GeoJson or dbf data in Node, the browser and on the command line. Also creates join report…☆52Updated 2 years ago
- framework for scraping legislative/government data☆88Updated 11 months ago
- An open-source archive that gathers, saves, shares and analyzes news homepages☆144Updated 3 weeks ago
- 📄 A simple wrapper around the Google Docs API and ArchieML for easily converting the contents of a Google Doc into a ArchieML-produced d…☆23Updated 2 years ago
- NWJS os x desktop based application that given a video/audio file returns a transcription using IBM Watson Speech to text API☆41Updated 8 years ago
- A data pipeline helper written in node to convert a folder of JS/ArchieML/JSON/YAML/CSV/TSV files into usable data.☆47Updated last year
- JavaScript app for displaying annotated network graphs based on data from LittleSis☆102Updated 3 weeks ago
- A helper library full of URL-related heuristics.☆70Updated 2 months ago
- A lightweight JavaScript client library for the Wikimedia Pageviews API for Wikipedia and various of its sister projects for Node.js and …☆27Updated 4 years ago