A step-by-step guide to writing a web scraper with Python
☆218Jan 13, 2025Updated last year
Alternatives and similar repositories for first-web-scraper
Users that are interested in first-web-scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NICAR Python mini boot camp☆104Mar 1, 2026Updated 2 months ago
- A repo to support IRE's multi-day Python bootcamp for journalists☆56Dec 7, 2022Updated 3 years ago
- Why not get started early? Keeping track of some ideas to show some easy to use command line tools for beginners.☆46Sep 24, 2015Updated 10 years ago
- NICAR 2017: Dataviz with Python☆20Mar 3, 2017Updated 9 years ago
- Scrape posts from Deadspin☆10Aug 23, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tools and lessons plans☆19Mar 14, 2017Updated 9 years ago
- NPR Visuals' fork of Quartz' Chartbuilder tool☆23Jul 24, 2018Updated 7 years ago
- A step-by-step guide to creating a simple web application that empowers you to enlist reporters in data entry and refinement.☆13Feb 10, 2024Updated 2 years ago
- Source for census.ire.org, including data processing scripts.☆140Jul 27, 2022Updated 3 years ago
- This semester we will work together to gather, analyze and visualize numbers you need to understand your audience and to tell interactive…☆17Oct 5, 2018Updated 7 years ago
- An introduction to free, automated web scraping with GitHub’s powerful new Actions framework.☆31Aug 19, 2024Updated last year
- How to use the Altair data visualization library to create an array of area charts.☆14Mar 19, 2021Updated 5 years ago
- ☆16May 8, 2017Updated 9 years ago
- HTML output for FBSnapshotTestCase☆10Oct 10, 2016Updated 9 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆11Feb 21, 2023Updated 3 years ago
- Lots and lots of web scrapers☆183Aug 31, 2021Updated 4 years ago
- Greasemonkey script to index visited websites with the YaCy P2P search engine.☆25Apr 1, 2015Updated 11 years ago
- A helper to create web scrapers using scrapy selector in a Model based structure☆57Dec 26, 2022Updated 3 years ago
- Helper methods for generating text that conforms to "The New York Times Manual of Style and Usage"☆27May 13, 2014Updated 12 years ago
- All of our code examples and tutorials☆67Apr 1, 2019Updated 7 years ago
- Various Python scripts to scrape sites that store data about you.☆28Jan 6, 2014Updated 12 years ago
- Journalists need to be better at math and data. A good place to start? Beginning reporting class in journalism school, usually taught to …☆51Feb 18, 2020Updated 6 years ago
- Hunting for daily rarities -- boozicorns, really -- from the Pennsylvania Liquor Control Board's databases.☆11Feb 18, 2017Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A simple script to look for and process all the federal data.json data inventories.☆46Mar 10, 2015Updated 11 years ago
- Watching the SCOTUS☆178Oct 7, 2015Updated 10 years ago
- A Python library that standardizes the names of U.S. states☆25Mar 24, 2015Updated 11 years ago
- Source code for 'Website Scraping with Python' by Gabor Laszlo Hajba☆35Oct 4, 2018Updated 7 years ago
- A course in journalism and data visualization, last taught in 2014.☆57Nov 1, 2018Updated 7 years ago
- Analyze the margin of error in U.S. census data☆19Feb 23, 2023Updated 3 years ago
- ScraperWiki Python library for scraping and saving data; in maintenance mode☆158May 11, 2026Updated last week
- A lightweight Python script that fetches data from a Google spreadsheet, transforms to JSON, then optionally commits a data file to a Git…☆10Apr 1, 2026Updated last month
- ☆24Mar 9, 2016Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A mirror of https://git.tecosaur.net/tec/pdftotext.el☆12Jan 4, 2024Updated 2 years ago
- Checklists☆16Sep 22, 2016Updated 9 years ago
- Inspection data and PDFs from the USDA's Animal and Plant Health Inspection Service.☆17Updated this week
- High Level 6502/NES Assembler☆12Mar 30, 2013Updated 13 years ago
- Crowd computing in javascript. Distribute tasks to execute in connected browsers☆19Dec 2, 2014Updated 11 years ago
- NICAR 2019 workshop on using Python and PDFplumber to extract text from PDFs☆12Mar 9, 2019Updated 7 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆55Dec 18, 2014Updated 11 years ago