ericwhyne / http-ricochet
A simple proxy web service in 19 lines of Python code.
☆23Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for http-ricochet
- MITIE: library and tools for information extraction☆29Updated 9 years ago
- [UNMAINTAINED] Firefox addon for Scrapely☆5Updated 8 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆55Updated 3 years ago
- Slides to learn a little natural language processing (NLP) with Python. Written in reST with S5/Docutils.☆28Updated 12 years ago
- Topic modeling web application☆39Updated 9 years ago
- ☆42Updated 8 years ago
- ☆21Updated 9 years ago
- Faceted search engine for domain-specific exploration of the Web☆45Updated 7 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 9 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- ☆41Updated 4 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- A Topic Modeling toolbox☆93Updated 8 years ago
- Hadoop MapReduce over Hive based implementation of attributed network pattern matching.☆40Updated 10 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 2 years ago
- General Architecture for Text Engineering☆45Updated 8 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- A POC at replicating Facebook Graph Search with Cypher and Neo4j☆102Updated 11 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 7 years ago
- Facet Search interface for MEMEX.☆13Updated 9 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆54Updated 9 years ago
- Extracts synonyms for various terms, exploiting the redirects between terms in Wikipedia☆12Updated 6 years ago
- ☆25Updated 8 years ago
- A space for code and projects around analysing news content☆23Updated 6 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆102Updated 8 years ago
- ScraperWiki Python library for scraping and saving data☆159Updated last year