ericwhyne / http-ricochet
A simple proxy web service in 19 lines of Python code.
☆23Updated 10 years ago
Alternatives and similar repositories for http-ricochet:
Users that are interested in http-ricochet are comparing it to the libraries listed below
- [UNMAINTAINED] Firefox addon for Scrapely☆5Updated 9 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆56Updated 3 years ago
- Faceted search engine for domain-specific exploration of the Web☆45Updated 8 years ago
- MITIE: library and tools for information extraction☆29Updated 10 years ago
- Seed acquisition tool to bootstrap focused crawlers☆23Updated 7 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 10 months ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 10 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Topic modeling web application☆40Updated 9 years ago
- Facet Search interface for MEMEX.☆13Updated 10 years ago
- Using Scrapy to get company profiles from http://crunchbase.com☆31Updated 11 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆13Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- ☆21Updated 10 years ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- An Exploration into Graph Databases☆28Updated 9 years ago
- Slides to learn a little natural language processing (NLP) with Python. Written in reST with S5/Docutils.☆28Updated 12 years ago
- Hadoop MapReduce over Hive based implementation of attributed network pattern matching.☆40Updated 10 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- A space for code and projects around analysing news content☆23Updated 7 years ago
- Visualization and summarization of a collection of documents.☆20Updated 2 years ago
- Scrapes sites. Gets news. Eventually events.☆85Updated 9 years ago
- What lies in your email data?☆43Updated 10 years ago
- Online social media research and computational journalism project by the Journalism and Media Studies Centre at the University of Hong Ko…☆45Updated 12 years ago
- ScraperWiki Python library for scraping and saving data☆159Updated 2 years ago
- ☆25Updated 9 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 3 years ago
- Discover repositories you should be following on Github.☆31Updated 12 years ago
- A bot that offers sympathy to people who have suffered paper cuts.☆17Updated 12 years ago
- Automatic, zero-config web scraping -- written in Java, has no dependency on Java EE or app servers, and the web scraper has a restful/JS…☆155Updated 7 years ago