cantabular / scraperwiki-pythonLinks
ScraperWiki Python library for scraping and saving data
☆158Updated 2 years ago
Alternatives and similar repositories for scraperwiki-python
Users that are interested in scraperwiki-python are comparing it to the libraries listed below
Sorting:
- Scrapes sites. Gets news. Eventually events.☆87Updated 9 years ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- Scrapes public information off of LinkedIn☆111Updated 9 years ago
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆189Updated 4 years ago
- Python code to scrape and collect data from the RSS feeds Facebook uses to augment its Trending Section☆57Updated 6 years ago
- Python library to extract text from PDF, and default to OCR when text extraction fails.☆62Updated 7 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- Python API for Glassdoor.com☆81Updated 9 years ago
- Send text when a new Craigslist posting matches a given keyword or phrase☆96Updated 10 years ago
- legacy backend for Open States☆87Updated 5 years ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆79Updated 2 years ago
- framework for scraping legislative/government data☆88Updated last year
- Simple example scripts for Twitter data collection with Tweepy in Python☆170Updated 4 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.☆291Updated 2 years ago
- A python tool for collecting tweets in mongoDB using the search API☆80Updated 2 years ago
- ☆36Updated last year
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆111Updated 9 years ago
- Python module for storing Twitter data in a Postgres database and operating on it via SQLAlchemy☆34Updated 10 years ago
- Python workers that collect tweets from the twitter streaming api and track deletions☆128Updated 2 years ago
- ⛏ a library for scraping unreliable pages☆213Updated 3 weeks ago
- Open Knowledge Labs website (and general issue tracker).☆80Updated 7 months ago
- Python scripts for processing XML documents and converting to SQL, CSV, and JSON [UNMAINTAINED]☆248Updated 4 months ago
- A set of utilities to track and mine Twitter streaming API data☆46Updated 11 years ago
- Converts JSON files to CSV (pulling data from nested structures). Useful for Mongo data☆263Updated 4 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 10 years ago
- Canadian legislative scrapers☆35Updated last month
- Python package to detect and return RSS / Atom feeds for a given website. The tool supports major blogging platform including Wordpress, …☆21Updated 3 years ago
- A (comprehensive) collection of open source tools used by the data community.☆52Updated 9 years ago