brandonrobertz / autoscrape-py
An automated, programming-free web scraper for interactive sites
☆107Updated last year
Related projects ⓘ
Alternatives and complementary repositories for autoscrape-py
- How Quartz used AI to help reporters search the Mauritius Leaks☆45Updated 5 years ago
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆117Updated 5 years ago
- Collector for Facebook's Political Ad API☆31Updated last year
- Module on both the MA Data Journalism and MA Multiplatform and Mobile Journalism at Birmingham City University☆28Updated 7 months ago
- Scrapers for U.S. county court sites.☆61Updated last year
- Data model and processing tools for investigative entity data☆218Updated last week
- ProPublica's collaborative tip-gathering framework. Import and manage CSV, Google Sheets and Screendoor data with ease.☆99Updated last year
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆145Updated 9 months ago
- All of our code examples and tutorials☆65Updated 5 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated last month
- A database of courts, tests and other experiments☆62Updated 3 months ago
- Extract networks of entities from journalistic reporting☆47Updated last year
- a general list of resources and articles for people interested in getting into data journalism☆16Updated last year
- Examples for getting started using https://case.law☆64Updated 2 years ago
- The data journalism platform with built in training☆306Updated last year
- A curated list of resources for (aspiring) data journalists☆22Updated 4 years ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆74Updated 4 years ago
- Run Overview on your own system☆123Updated 3 years ago
- framework for scraping legislative/government data☆85Updated 2 months ago
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆187Updated 3 years ago
- ⛏ a library for scraping unreliable pages☆208Updated 3 months ago
- An ICIJ app to conduct data validation and cleaning.☆19Updated 7 months ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆71Updated 2 weeks ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆17Updated 4 months ago
- The main repository for a collaborative text on data journalism.☆85Updated 6 years ago
- A step-by-step guide to publishing a standalone story from a dataset.☆29Updated 4 months ago
- List of publicly available, free/open source and open access resources for learning and doing data journalism.☆38Updated 8 months ago
- Command-line interface for downloading WARN Act notices of qualified plant closings and mass layoffs from state government websites☆30Updated 2 weeks ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago