Alpha-quality parser for Office of Government ethics form 278 public financial disclosure PDFs
☆28Feb 11, 2022Updated 4 years ago
Alternatives and similar repositories for pfd-parser
Users that are interested in pfd-parser are comparing it to the libraries listed below
Sorting:
- Machine assisted dossiers☆19Oct 12, 2017Updated 8 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆25Dec 15, 2020Updated 5 years ago
- A small repo of notes and scripts for collecting data on U.S. deadly force police incidents☆10Aug 9, 2015Updated 10 years ago
- Tool to OCR PDFs using Google Cloud Vision☆42Dec 7, 2022Updated 3 years ago
- A sample Sinatra example for use in training at NICAR 2015☆12Mar 10, 2016Updated 10 years ago
- pneumatic is a bulk-upload library for DocumentCloud.☆22Sep 6, 2020Updated 5 years ago
- Scraper for financial disclosure reports from the US Senate☆25May 7, 2024Updated last year
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆15May 22, 2023Updated 2 years ago
- archive NYPD crime data PDFs☆14Dec 12, 2017Updated 8 years ago
- Files for my Introduction to R and RStudio Hands-On Session at NICAR 2018 on Saturday March 10 at 9 am☆10Mar 10, 2018Updated 8 years ago
- Analysis for "In Appalachia and the Mississippi Delta, Millions Face Long Drives to Stroke Care"☆14May 4, 2021Updated 4 years ago
- Data on 268 New York City traffic deaths in 2014.☆10Feb 19, 2015Updated 11 years ago
- AI agent for enhancing datasets with information from the internet☆21Nov 6, 2025Updated 4 months ago
- ☆20Apr 27, 2017Updated 8 years ago
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 4 years ago
- CFPB's streaming batch geocoder☆36Sep 1, 2016Updated 9 years ago
- Tools for analyzing the Hillary Clinton emails☆13Apr 24, 2016Updated 9 years ago
- Course materials for SMPA3193, Building Systems for Reporting☆29Apr 25, 2017Updated 8 years ago
- Friendly Slack bot for looking up cases☆21Dec 19, 2017Updated 8 years ago
- ☆13Feb 8, 2024Updated 2 years ago
- A financial disclosure data extraction tool.☆21Aug 2, 2023Updated 2 years ago
- ☆12Mar 8, 2024Updated 2 years ago
- Low latency high throughput GDAX orderbook analysis engine and trading bot☆13Mar 24, 2018Updated 7 years ago
- A Ruby parser for electronic candidate, PAC and party campaign filings from the Federal Election Commission.☆15Feb 3, 2024Updated 2 years ago
- Notes for my talk "Exploring the Radio Spectrum for News"☆13Mar 6, 2020Updated 6 years ago
- ☆16Oct 8, 2025Updated 5 months ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on scraping web data using Python.☆28Feb 27, 2026Updated 3 weeks ago
- ☆16Nov 17, 2025Updated 4 months ago
- Genderswaps your view of the web. (Chrome extension)☆78Feb 2, 2015Updated 11 years ago
- Create and manage Shlink short links from WordPress☆16Apr 4, 2025Updated 11 months ago
- A collection of lists of forms maintained by local, state and federal policing organizations. If you have a form name, you have a FOIA re…☆18Feb 17, 2026Updated last month
- A toolkit to debug and visualize local AWS step functions☆14Oct 3, 2023Updated 2 years ago
- CLI for parsing FEC files, for federal campaign finance pipelines☆23Mar 9, 2026Updated last week
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- ICE detention dashboard☆21Updated this week
- ☆29Jan 18, 2026Updated 2 months ago
- NICAR Python mini boot camp☆104Mar 1, 2026Updated 2 weeks ago
- Workbook to teach the concept of risk ratios for data journalism applications☆33Apr 15, 2022Updated 3 years ago
- Fork of dump1090-stream-parser. Takes SBS output from `dump1090` and puts it into a database.☆13Apr 16, 2019Updated 6 years ago