framework for scraping legislative/government data
☆90Nov 17, 2025Updated 4 months ago
Alternatives and similar repositories for pupa
Users that are interested in pupa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scrapers for US municipal governments.☆108Updated this week
- Canadian legislative scrapers☆35Updated this week
- OpenStates.org☆70Jan 30, 2026Updated 2 months ago
- Document management system. Based on bill tracking needs. Simple model for stages, priorities, authors, content (abstract, tags), releate…☆19Sep 16, 2014Updated 11 years ago
- source for Open States scrapers☆897Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- legacy backend for Open States☆87Jan 31, 2020Updated 6 years ago
- A glossary for the United States.☆42Apr 30, 2015Updated 10 years ago
- pre-2019 OpenStates.org☆12Dec 20, 2018Updated 7 years ago
- Curated information on all state legislators & governors.☆146Updated this week
- Archive of political ad data from the Federal Communications Commission☆20Oct 25, 2017Updated 8 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- A sample app that combines geolocated entities from Freebase with Maps API☆43Mar 20, 2014Updated 12 years ago
- Open Civic Data Division IDs definition & canonical repository☆180Updated this week
- Archived Project - Please reference 3rd party forks listed in README☆54Jul 2, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A schema loader for the Texas Ethics Commission☆12Mar 1, 2026Updated last month
- Bigquery user-defined functions (UDFs) for pathfinding. Find shortest path through a network of Bigquery geography☆11Jan 18, 2021Updated 5 years ago
- A docker container, with ffmpeg that supports scale_cuda among other things☆14Jan 27, 2025Updated last year
- The data and analysis referenced in the Dec. 7, 2015 BuzzFeed News article, "Here's What We Know About Race And Killings By Police." htt…☆14Dec 8, 2015Updated 10 years ago
- How to use govinfo sitemaps to crawl for content and metadata☆57Mar 6, 2024Updated 2 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Apr 10, 2014Updated 12 years ago
- ⛏ a library for scraping unreliable pages☆212Apr 3, 2026Updated last week
- KL3M training data collection and preprocessing☆21Apr 14, 2025Updated last year
- ☆16Sep 13, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆23Mar 7, 2015Updated 11 years ago
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year
- Quickly open Python modules in your text editor☆45Apr 6, 2026Updated last week
- an intro to Python, with a focus on hard-to-google "tribal knowledge"☆16Mar 2, 2017Updated 9 years ago
- A Los Angeles Times analysis of arrests of the homeless by the LAPD☆53Mar 19, 2021Updated 5 years ago
- Docker container to provide Apache Tika RESTful API☆41Feb 12, 2016Updated 10 years ago
- This project deals with hierarchical classification of web pages based on dmoz dataset.☆14Apr 10, 2014Updated 12 years ago
- Copy favorite and commonly used RDF schemas/ontologies to a safe place☆37May 24, 2019Updated 6 years ago
- ☆16Feb 12, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Parser for U.S. federal regulations and other regulatory information☆43Mar 27, 2023Updated 3 years ago
- Scraps all the open chats, and their last n messages, and saves them in a csv file☆38Aug 2, 2020Updated 5 years ago
- Automate your FOIAs. The real, production version.☆51Sep 21, 2017Updated 8 years ago
- Use 3rd-party validators (e.g. from WTForms and colander) with marshmallow☆23May 10, 2021Updated 4 years ago
- Comparing different zip code datasets☆10Feb 18, 2015Updated 11 years ago
- pneumatic is a bulk-upload library for DocumentCloud.☆22Sep 6, 2020Updated 5 years ago
- a list of websites using documentation to be awesome☆22Apr 10, 2017Updated 9 years ago