ceteri / slinky
Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi
☆41Updated 14 years ago
Alternatives and similar repositories for slinky:
Users that are interested in slinky are comparing it to the libraries listed below
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- ... just because nltk is too heavy☆35Updated 14 years ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- ☆13Updated 10 years ago
- A visualizer for multi-dimensional semantic data☆38Updated 13 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Markdown -> IPython conversion tool☆15Updated 10 years ago
- A platform for storing large semantic networks on MongoDB☆22Updated 13 years ago
- framework for making streamcorpus data☆11Updated 8 years ago
- A very simple way to interact with python AMQPlib.☆44Updated 15 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Where 2.0 Workshop Code: Spatial Analysis of Tweets using Hadoop, Pig, Python & Mechanical Turk. Slides here: http://www.slideshare.net/…☆134Updated 15 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- Python Client for WebHDFS REST API☆43Updated 9 years ago
- Mirror of Apache MRUnit☆38Updated 6 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Deep learning certificate part 1☆10Updated 2 years ago
- KEA 5.0 (keyphrase extraction software), modified to be an XML-RPC service☆42Updated 13 years ago
- collection of modules to build distributed and reliable concurrent systems in Python.☆205Updated 11 years ago
- Stream processing in Python of twitter searches using public APIs.☆9Updated 9 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago
- Recommendations Serving Engine using python☆28Updated 9 years ago
- IPython Notebook Cookbook for Deployment via Chef☆41Updated 8 years ago
- A Python version (almost a port) of ProPublica's TableFu☆231Updated 11 years ago
- A copy of the source for Grinstead and Snell's lovely probability book☆14Updated 9 years ago
- simple python datastructure wrappings for redis☆104Updated 3 years ago
- RGP -- Redis Graph via Python☆30Updated 9 years ago
- Simple spill-to-disk dictionary☆17Updated 8 years ago
- An analysis of adverse drug event data using Hadoop, R, and Gephi☆44Updated 9 years ago