cantabular/scraperwiki-python

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cantabular/scraperwiki-python)

cantabular / scraperwiki-python

ScraperWiki Python library for scraping and saving data; in maintenance mode

☆158

Alternatives and similar repositories for scraperwiki-python

Users that are interested in scraperwiki-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scraperwiki / code-scraper-in-browser-tool
View on GitHub
Just like on ScraperWiki Classic; now a part of QuickCode.
☆38Aug 12, 2016Updated 9 years ago
TalkAboutLocal / local-news-engine
View on GitHub
☆14Mar 9, 2017Updated 9 years ago
onyxfish / fakerwiki
View on GitHub
FakerWiki is a library for local testing of Python ScraperWiki scripts.
☆15Sep 8, 2011Updated 14 years ago
cantabular / xypath
View on GitHub
Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+
☆49Feb 1, 2023Updated 3 years ago
jplusplus / broken-promises-client
View on GitHub
What should a journalist investigate today, according to what was promised in the past?
☆17Dec 9, 2013Updated 12 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
scraperwiki / wikipedia-infobox-tool
View on GitHub
Extracts data from the infoboxes of Wikipedia articles.
☆10Aug 29, 2013Updated 12 years ago
dhmontgomery / r-data-for-beginners
View on GitHub
A tutorial of the basics of data analysis and visualization in the R programming language, for complete beginners.
☆15Sep 28, 2018Updated 7 years ago
trickvi / datapackage
View on GitHub
Manage and load dataprotocols.org Data Packages
☆27Sep 17, 2015Updated 10 years ago
cantabular / custard
View on GitHub
A platform for tools that do stuff with data
☆56Feb 14, 2019Updated 7 years ago
dkmfbk / pikes
View on GitHub
Pikes is a Knowledge Extraction Suite
☆23Nov 14, 2023Updated 2 years ago
ciudadanointeligente / bill-it
View on GitHub
Document management system. Based on bill tracking needs. Simple model for stages, priorities, authors, content (abstract, tags), releate…
☆20Sep 16, 2014Updated 11 years ago
smach / NICAR2018IntroToR
View on GitHub
Files for my Introduction to R and RStudio Hands-On Session at NICAR 2018 on Saturday March 10 at 9 am
☆10Mar 10, 2018Updated 8 years ago
datopian / datapipes
View on GitHub
Data Pipes for CSV
☆115Jan 24, 2023Updated 3 years ago
nexacenter / public-contracts
View on GitHub
☆10Apr 20, 2016Updated 10 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mysociety / popit
View on GitHub
DEPRECATED - Development on PopIt has stopped and it is no longer being maintained
☆76Aug 24, 2017Updated 8 years ago
okfn / handbook
View on GitHub
Guides and introductions for participating in Labs and some of its projects.
☆172Sep 27, 2016Updated 9 years ago
kaflesudip / grabfeed
View on GitHub
Python package to detect and return RSS / Atom feeds for a given website. The tool supports major blogging platform including Wordpress, …
☆21Oct 21, 2021Updated 4 years ago
peldszus / arg-microtexts-multilayer
View on GitHub
Argumentative microtexts annotated with RST, SDRT and argumentation structure
☆12Jun 19, 2016Updated 10 years ago
t-davidson / webscraping-tutorials
View on GitHub
Tutorials for web scraping and crawling
☆11Mar 29, 2020Updated 6 years ago
pudo-attic / scrapekit
View on GitHub
Python library with common functionality for writing web scrapers
☆102Jul 6, 2015Updated 11 years ago
associatedpress / geomancer
View on GitHub
Open source tool to help journalists easily mash up data based on shared geography.
☆59Jun 5, 2015Updated 11 years ago
ldegroot / freedive
View on GitHub
Friendly data search via Google Docs API
☆26Jun 26, 2013Updated 13 years ago
sunlightlabs / cluster-explorer
View on GitHub
Tool for exploring clusters of similar documents in a text corpus.
☆16Apr 15, 2014Updated 12 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mozilla / spade
View on GitHub
Automated scraping markup+CSS from a list of relevant URLs, using a variety of user-agent strings. Provides reporting on usage of CSS pro…
☆22Aug 29, 2013Updated 12 years ago
anthonydb / python-get-started
View on GitHub
Snippets to jump start learning Python
☆53Sep 12, 2019Updated 6 years ago
paulbradshaw / MED7369-Specialist-Investigative-Journalism
View on GitHub
Module on both the MA Data Journalism and MA Multiplatform and Mobile Journalism at Birmingham City University
☆30Jun 16, 2026Updated last month
vietansegan / sits
View on GitHub
Speaker Identity for Topic Segmentation (SITS)
☆13Dec 14, 2014Updated 11 years ago
openknowledge-archive / activityapi
View on GitHub
[Deprecated] An API which aggregates online activity of the Open Knowledge
☆18Dec 13, 2021Updated 4 years ago
kjam / python-web-scraping-tutorial
View on GitHub
A Python-based web and data scraping tutorial
☆215Oct 17, 2020Updated 5 years ago
datadotworld / foia-app
View on GitHub
R Shiny App created to predict the success rate of Freedom of Information Act requests.
☆16Dec 11, 2017Updated 8 years ago
geoparser / geolocator-3.0
View on GitHub
☆12Oct 25, 2015Updated 10 years ago
RobertHasson / ouunit
View on GitHub
LaTeX style files for creating documents in the Open University unit style
☆11Mar 30, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jensfinnas / robot-writer
View on GitHub
☆19Nov 24, 2015Updated 10 years ago
nvkelso / map-label-style-manual
View on GitHub
Abbreviations, nicknames, foreign terms, translations, transliterations, diacritical marks, suggested placements, and more
☆24Jul 18, 2012Updated 14 years ago
rOpenGov / ropengov.github.io
View on GitHub
rOpenGov
☆17Apr 16, 2021Updated 5 years ago
opennorth / represent-boundaries
View on GitHub
A web API to geographic boundaries loaded from shapefiles, packaged as a Django app
☆27Jun 26, 2024Updated 2 years ago
garysieling / chrome-scraper
View on GitHub
Chrome Based Scraper
☆22Feb 7, 2013Updated 13 years ago
EvictionLab / eviction-lab-etl
View on GitHub
Data processing for Eviction Lab map and rankings tools
☆12Feb 21, 2025Updated last year
jamesturk / scrapelib
View on GitHub
⛏ a library for scraping unreliable pages
☆212Updated this week