jamesturk / spatulaLinks
A modern Python library for writing maintainable web scrapers.
☆249Updated last week
Alternatives and similar repositories for spatula
Users that are interested in spatula are comparing it to the libraries listed below
Sorting:
- ⛏ a library for scraping unreliable pages☆211Updated last week
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆152Updated 5 months ago
- A Python module for accessing the Open States API☆29Updated last year
- A general purpose tool for text-based crosswalking☆107Updated last year
- Find your broken links, so users don't.☆66Updated last month
- ProPublica's collaborative tip-gathering framework. Import and manage CSV, Google Sheets and Screendoor data with ease.☆100Updated 2 years ago
- The data journalism platform with built in training☆306Updated 6 months ago
- Python library and CLI you can use to move relational data from one place to another - DBs/CSV/gsheets/dataframes/...☆37Updated last year
- a python parser for the .fec file format☆45Updated last month
- A helper library full of URL-related heuristics.☆69Updated 2 weeks ago
- General programming utilities from Pew Research Center☆70Updated 3 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated 2 years ago
- Get Census Data from the API for arbitrary areas☆46Updated 2 months ago
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆111Updated 7 months ago
- A maximum-strength name parser for record linkage.☆37Updated last week
- Datasette plugin that shows a map for any data with latitude/longitude columns☆96Updated 10 months ago
- A Python wrapper for the Geocodio geolocation service API☆102Updated last week
- A clever brute-force correlator for kinda-messy data☆82Updated last year
- framework for scraping legislative/government data☆85Updated 9 months ago
- A Python implementation of Lunr.js 🌖☆197Updated 3 months ago
- Guess gender from first name in Python 2 and 3☆134Updated last month
- Core library for the datakit CLI framework.☆55Updated 2 years ago
- Easily download U.S. census maps☆33Updated 2 years ago
- A new python implementation of an old classic☆14Updated 3 years ago
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Clean up all those Pythons crawling around your computer☆15Updated 2 years ago
- An open-source archive that gathers, saves, shares and analyzes news homepages☆139Updated this week
- A git scraper recording the CDC's Covid Data Tracker numbers on number of vaccinations per state.☆24Updated last year
- A library for exchanging data between Python and JavaScript☆143Updated 8 months ago
- legacy backend for Open States☆87Updated 5 years ago