ceteri / slinky
Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi
☆41Updated 14 years ago
Alternatives and similar repositories for slinky:
Users that are interested in slinky are comparing it to the libraries listed below
- A platform for storing large semantic networks on MongoDB☆22Updated 13 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- A visualizer for multi-dimensional semantic data☆38Updated 13 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- Recommendations Serving Engine using python☆28Updated 9 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago
- framework for making streamcorpus data☆11Updated 8 years ago
- ... just because nltk is too heavy☆35Updated 14 years ago
- Stream processing in Python of twitter searches using public APIs.☆9Updated 9 years ago
- ☆13Updated 10 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Vizlinc☆14Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆15Updated 9 years ago
- Lightweight, multilingual natural language processing☆63Updated 11 years ago
- a Simple API for RDF☆29Updated 15 years ago
- Markdown -> IPython conversion tool☆15Updated 10 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 7 years ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- Python natural language processing work☆29Updated 15 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- Exploration Library in Java☆12Updated last year
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 3 years ago
- A Python library implementing a generic promises facility☆9Updated 5 months ago
- Mirror of Apache MRUnit☆38Updated 6 years ago
- collection of modules to build distributed and reliable concurrent systems in Python.☆205Updated 11 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Twitter-Kafka Data Pipeline☆16Updated 4 months ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago