dansandland / cassandra-scrapy-pipeline
☆15Updated 9 years ago
Alternatives and similar repositories for cassandra-scrapy-pipeline:
Users that are interested in cassandra-scrapy-pipeline are comparing it to the libraries listed below
- High Level Kafka Scanner☆19Updated 7 years ago
- A javascript shell for elasticsearch☆105Updated 9 years ago
- A platform for real-time streaming search☆103Updated 9 years ago
- a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine☆96Updated last year
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- Code reference from my Qbox blog posts.☆87Updated 9 years ago
- Open source analytics platform powered by Apache Cassandra, Spark, and Kafka☆34Updated 9 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Load a CSV (or TSV) file into an Elasticsearch instance☆61Updated 2 years ago
- Few things we've met during our etl project based on spark☆24Updated 7 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- Luigi Plugin for Hubot☆36Updated 8 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Updated 7 years ago
- ☆54Updated 7 years ago
- A DC/OS time series demo☆62Updated 9 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- PySpark for Elastic Search☆55Updated 8 years ago
- Resize image on the fly using flask, zappa, pillow, opencv-python☆18Updated 7 years ago
- ☆19Updated 11 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Updated 8 years ago
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 10 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- ☆39Updated 8 years ago
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 6 years ago
- A custom SimilarityProvider example for Elasticsearch☆36Updated 9 years ago
- Let's perform Twitter sentiment analysis using Python, Docker, Elasticsearch, and Kibana!☆137Updated 4 years ago