jamesturk / spatulaLinks
A modern Python library for writing maintainable web scrapers.
☆249Updated 2 months ago
Alternatives and similar repositories for spatula
Users that are interested in spatula are comparing it to the libraries listed below
Sorting:
- ⛏ a library for scraping unreliable pages☆212Updated 2 weeks ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 4 months ago
- Python CLI tool and library for diffing CSV and JSON files☆328Updated last year
- Utility library to turn country names into ISO two-letter codes☆71Updated 5 months ago
- ProPublica's collaborative tip-gathering framework. Import and manage CSV, Google Sheets and Screendoor data with ease.☆100Updated 3 years ago
- Find your broken links, so users don't.☆66Updated last month
- Clean up all those Pythons crawling around your computer☆15Updated 2 years ago
- A git scraper recording the CDC's Covid Data Tracker numbers on number of vaccinations per state.☆24Updated 2 years ago
- A clever brute-force correlator for kinda-messy data☆83Updated last year
- Opinionated template for Django projects on Python 3 and PostgreSQL☆24Updated 8 years ago
- The data journalism platform with built in training☆311Updated last year
- Provide partial dates and retain the date precision through processing☆14Updated 5 months ago
- General programming utilities from Pew Research Center☆70Updated 3 years ago
- A Python implementation of Lunr.js 🌖☆203Updated 10 months ago
- Guess gender from first name in Python 2 and 3☆139Updated 8 months ago
- Tools for generating CSV and other flat versions of the structured data☆109Updated last month
- Write Datasette canned queries as plain SQL files☆14Updated 3 years ago
- Effortless conversion between data formats like JSON, XML and CSV☆119Updated 3 years ago
- a python parser for the .fec file format☆46Updated 8 months ago
- Datasette plugin to create interactive dashboards☆171Updated this week
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆113Updated last year
- Datasette plugin that shows a map for any data with latitude/longitude columns☆100Updated 2 months ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Updated last month
- A new python implementation of an old classic☆14Updated 3 years ago
- A Python wrapper for the Geocodio geolocation service API☆102Updated 7 months ago
- Add website scraping abilities to Datasette☆66Updated 2 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆158Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- Simple command line tool for quickly analysing the structure of an arbitrary XML file☆35Updated 2 years ago
- Scripts to make specific datasets cleaner and more convenient☆42Updated 3 years ago