jamesturk / spatulaLinks
A modern Python library for writing maintainable web scrapers.
☆247Updated 4 months ago
Alternatives and similar repositories for spatula
Users that are interested in spatula are comparing it to the libraries listed below
Sorting:
- ⛏ a library for scraping unreliable pages☆211Updated last month
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆155Updated last month
- Utility library to turn country names into ISO two-letter codes☆71Updated 2 months ago
- Python CLI tool and library for diffing CSV and JSON files☆325Updated last year
- ProPublica's collaborative tip-gathering framework. Import and manage CSV, Google Sheets and Screendoor data with ease.☆100Updated 2 years ago
- Find your broken links, so users don't.☆66Updated last month
- A Python module for accessing the Open States API☆30Updated 2 years ago
- Clean up all those Pythons crawling around your computer☆15Updated 2 years ago
- General programming utilities from Pew Research Center☆70Updated 3 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Updated 2 weeks ago
- Tools for generating CSV and other flat versions of the structured data☆108Updated 5 months ago
- Opinionated template for Django projects on Python 3 and PostgreSQL☆24Updated 8 years ago
- Write Datasette canned queries as plain SQL files☆14Updated 3 years ago
- Provide partial dates and retain the date precision through processing☆14Updated 2 months ago
- Datasette plugin that shows a map for any data with latitude/longitude columns☆98Updated last year
- Datasette plugin to create interactive dashboards☆152Updated 2 weeks ago
- a python parser for the .fec file format☆46Updated 5 months ago
- Add website scraping abilities to Datasette☆64Updated 2 years ago
- The data journalism platform with built in training☆309Updated 10 months ago
- Render a map for any query with a geometry column☆28Updated last year
- Effortless conversion between data formats like JSON, XML and CSV☆120Updated 3 years ago
- Demonstration project for building out a data news rig.☆10Updated 3 years ago
- A Python implementation of Lunr.js 🌖☆200Updated 7 months ago
- A git scraper recording the CDC's Covid Data Tracker numbers on number of vaccinations per state.☆24Updated 2 years ago
- Python library for reading and writing tabular data via streams.☆238Updated 4 years ago
- Datasette of earning call transcripts from the Motley Fool☆15Updated 2 years ago
- A Python wrapper for the Geocodio geolocation service API☆102Updated 4 months ago
- Scrapers for disaster data - writes to https://github.com/simonw/disaster-data☆50Updated last year
- A clever brute-force correlator for kinda-messy data☆82Updated last year
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆112Updated 11 months ago