charman / trawler
Twitter crawler
☆11Updated 10 years ago
Alternatives and similar repositories for trawler:
Users that are interested in trawler are comparing it to the libraries listed below
- Stream processing in Python of twitter searches using public APIs.☆9Updated 9 years ago
- Scrapy middleware for the autologin☆37Updated 6 years ago
- ... just because nltk is too heavy☆35Updated 14 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 11 months ago
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- Twitter Futures for Python☆58Updated 10 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- A bot framework for the Facebook Messenger platform, built on asyncio and aiohttp☆30Updated 7 years ago
- Processes data from images which are tagged with the specified Instagram tag.☆13Updated 11 years ago
- Parse domains using the TLD list maintained by publicsuffix.org☆61Updated 4 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago
- framework for making streamcorpus data☆11Updated 8 years ago
- ☆18Updated 8 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 3 years ago
- Security audit tool for Django sites☆14Updated 6 months ago
- Tweet Lake is a commandline interface to Twitter Streaming API and big data project that extracts interesting stats out of tweet corpus.☆20Updated 2 years ago
- The DeveloperRank is analysis project developer's rank on Github. We are inspired by the PageRank.☆12Updated 9 years ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆39Updated 7 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Python client for Elasticsearch Watcher (deprecated)☆23Updated 6 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 10 years ago
- Small library to fetch files over HTTP and resuming their download☆13Updated 4 years ago
- Python code examples for working with the Slack API. 2.x and 3.x compatible code.☆13Updated 8 years ago
- extract difference between two html pages☆32Updated 6 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Updated last year
- Python Dictionaries and Lists on Steroids☆23Updated 10 years ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago