jamesturk / spatula
A modern Python library for writing maintainable web scrapers.
☆246Updated 8 months ago
Alternatives and similar repositories for spatula:
Users that are interested in spatula are comparing it to the libraries listed below
- ⛏ a library for scraping unreliable pages☆210Updated 7 months ago
- The data journalism platform with built in training☆305Updated 3 months ago
- ProPublica's collaborative tip-gathering framework. Import and manage CSV, Google Sheets and Screendoor data with ease.☆100Updated 2 years ago
- A git scraper recording the CDC's Covid Data Tracker numbers on number of vaccinations per state.☆24Updated last year
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆148Updated 2 months ago
- Python library and CLI you can use to move relational data from one place to another - DBs/CSV/gsheets/dataframes/...☆37Updated 9 months ago
- Easily download U.S. census maps☆33Updated 2 years ago
- A Python module for accessing the Open States API☆29Updated last year
- a python parser for the .fec file format☆45Updated 2 years ago
- An open-source archive that gathers, saves, shares and analyzes news homepages☆140Updated 2 months ago
- A Python implementation of Lunr.js 🌖☆196Updated last week
- A maximum-strength name parser for record linkage.☆36Updated last month
- Utility library to turn country names into ISO two-letter codes☆66Updated last month
- searching large heterogenous data dumps with Universal Sentence Encoder☆62Updated 3 years ago
- A general purpose tool for text-based crosswalking☆104Updated 11 months ago
- Django app for building dashboards using raw SQL queries☆446Updated last year
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆54Updated last month
- Opinionated template for Django projects on Python 3 and PostgreSQL☆24Updated 7 years ago
- A clever brute-force correlator for kinda-messy data☆82Updated last year
- Python package for easy access to EveryPolitician data☆36Updated 8 years ago
- A Python wrapper for the Geocodio geolocation service API☆99Updated 4 months ago
- 📚 Doing all sorts of things, the DataMade way☆93Updated 2 weeks ago
- Opinionated cookiecutter template for creating a new Python library☆193Updated last month
- Parser and standardizer for politician, individual and organization names.☆129Updated 7 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated 2 weeks ago
- All of our code examples and tutorials☆66Updated 5 years ago
- Get Census Data from the API for arbitrary areas☆45Updated 6 months ago
- Find your broken links, so users don't.☆66Updated last month
- Today I Learned☆87Updated 2 weeks ago