brandonrobertz / autoscrape-pyLinks
An automated, programming-free web scraper for interactive sites
☆111Updated 2 years ago
Alternatives and similar repositories for autoscrape-py
Users that are interested in autoscrape-py are comparing it to the libraries listed below
Sorting:
- Scrapers for U.S. county court sites.☆70Updated 2 years ago
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆118Updated 5 years ago
- ⚡️ Enriches data, adding columns based on lookups to online services☆22Updated last month
- How Quartz used AI to help reporters search the Mauritius Leaks☆47Updated 5 years ago
- 🔎 Finds fuzzy matches between CSV files☆190Updated 4 months ago
- ProPublica's collaborative tip-gathering framework. Import and manage CSV, Google Sheets and Screendoor data with ease.☆100Updated 2 years ago
- Notebooks and files for the Python for Journalists course on Datajournalism.com☆61Updated 5 years ago
- Run Overview on your own system☆125Updated 4 years ago
- All of our code examples and tutorials☆66Updated 6 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 9 months ago
- The data journalism platform with built in training☆308Updated 8 months ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 5 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated 3 weeks ago
- searching large heterogenous data dumps with Universal Sentence Encoder☆63Updated 4 years ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆77Updated last week
- Data model and processing tools for investigative entity data☆241Updated this week
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆23Updated 5 months ago
- ☆24Updated 9 years ago
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago
- Collector for Facebook's Political Ad API☆31Updated 2 years ago
- 🎓 Practical beginner-level introductions to using different tools and technologies, with a focus on their application in the newsroom☆82Updated 2 years ago
- A helper library full of URL-related heuristics.☆70Updated last month
- ☆254Updated 2 years ago
- Loads raw FEC filings into a database☆23Updated 2 years ago
- ☆13Updated last year
- Lightweight web scraping toolkit for documents and structured data.☆313Updated last year
- a general list of resources and articles for people interested in getting into data journalism☆16Updated 2 years ago
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools☆69Updated 6 months ago
- List of newsrooms around the world that are using software engineering, data science, osint, and various tech to elevate reporting.☆100Updated 4 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 5 months ago