cantabular / scraperwiki-pythonLinks
ScraperWiki Python library for scraping and saving data
☆158Updated 2 years ago
Alternatives and similar repositories for scraperwiki-python
Users that are interested in scraperwiki-python are comparing it to the libraries listed below
Sorting:
- Scrapes sites. Gets news. Eventually events.☆85Updated 9 years ago
- Scrapes public information off of LinkedIn☆112Updated 9 years ago
- framework for scraping legislative/government data☆88Updated last year
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆92Updated 3 weeks ago
- legacy backend for Open States☆87Updated 5 years ago
- Python library to extract text from PDF, and default to OCR when text extraction fails.☆62Updated 8 years ago
- Send text when a new Craigslist posting matches a given keyword or phrase☆96Updated 10 years ago
- Easily crowdsource the analysis of your documents☆102Updated 8 years ago
- Crawl and scrape Yelp's restaurant data for every zip code in the United States (or a specified zipcode). Yelp Crawler.☆56Updated 8 years ago
- ⛏ a library for scraping unreliable pages☆212Updated last month
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆192Updated 4 years ago
- Python code to scrape and collect data from the RSS feeds Facebook uses to augment its Trending Section☆57Updated 7 years ago
- ☆36Updated 2 years ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆79Updated 2 years ago
- Scrapers for US municipal governments.☆104Updated last year
- Viewers for statistics and dashboarding of Domain Search Engine data☆125Updated 9 years ago
- Python package to detect and return RSS / Atom feeds for a given website. The tool supports major blogging platform including Wordpress, …☆21Updated 4 years ago
- Python workers that collect tweets from the twitter streaming api and track deletions☆128Updated 2 years ago
- Simple example scripts for Twitter data collection with Tweepy in Python☆170Updated 5 years ago
- Chunks of Python I've found useful.☆63Updated 5 years ago
- Python scripts for processing XML documents and converting to SQL, CSV, and JSON [UNMAINTAINED]☆248Updated 6 months ago
- Tools for tracking stories on news homepages☆48Updated 6 years ago
- Importer for US Spending data☆34Updated 11 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Updated 3 years ago
- A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.☆294Updated 2 years ago
- A python tool for collecting tweets in mongoDB using the search API☆80Updated 2 years ago
- A data processing pipeline that schedules and runs content harvesters, normalizes their data, and outputs that normalized data to a varie…☆41Updated 9 years ago
- Converts JSON files to CSV (pulling data from nested structures). Useful for Mongo data☆264Updated 4 years ago