ceteri / slinky
Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi
☆41Updated 14 years ago
Alternatives and similar repositories for slinky:
Users that are interested in slinky are comparing it to the libraries listed below
- A platform for storing large semantic networks on MongoDB☆22Updated 13 years ago
- Convert URL's to a normalized unicode format☆67Updated 6 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- framework for making streamcorpus data☆11Updated 7 years ago
- Stream processing in Python of twitter searches using public APIs.☆9Updated 9 years ago
- ... just because nltk is too heavy☆36Updated 14 years ago
- ☆13Updated 10 years ago
- Python Client for WebHDFS REST API☆43Updated 9 years ago
- Python natural language processing work☆29Updated 15 years ago
- a Simple API for RDF☆29Updated 15 years ago
- A visualizer for multi-dimensional semantic data☆38Updated 13 years ago
- iCQA - Intelligent Community Question Answering Framework☆31Updated 8 years ago
- Lightweight, multilingual natural language processing☆63Updated 11 years ago
- KEA 5.0 (keyphrase extraction software), modified to be an XML-RPC service☆42Updated 13 years ago
- A plugin for Errbot that allows chat users to create "factoids" which the bot can recall on demand.☆10Updated 8 years ago
- A Django based search engine powered by CouchDB, celery and whoosh.☆49Updated 9 years ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- A copy of the source for Grinstead and Snell's lovely probability book☆14Updated 9 years ago
- A Python version (almost a port) of ProPublica's TableFu☆233Updated 11 years ago
- Turn your IPython console into a cross-database SQL client☆31Updated 8 years ago
- An example of how to use Redis as a task queue within a Flask web application.☆26Updated 8 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Simple spill-to-disk dictionary☆17Updated 8 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 3 years ago
- Maybe next gen of Pyzo IDE based on Flexx☆17Updated 7 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- JSONL and YAML query tool☆12Updated last year
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- The first Open Source document analysis platform☆65Updated 3 years ago