ActivisionGameScience / python-kafka-benchmark
☆14Updated 8 years ago
Alternatives and similar repositories for python-kafka-benchmark:
Users that are interested in python-kafka-benchmark are comparing it to the libraries listed below
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- ☆9Updated 9 years ago
- ☆110Updated 7 years ago
- Open source analytics platform powered by Apache Cassandra, Spark, and Kafka☆34Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Luigi Plugin for Hubot☆35Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Notes on Lambda Architecture☆12Updated 7 years ago
- PySpark for Elastic Search☆55Updated 7 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 8 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago
- Docker image for apache zeppelin☆38Updated 7 years ago
- Tail a log file and send log lines automatically to a kafka topic☆57Updated 12 years ago
- ☆146Updated 8 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Updated 7 years ago
- A spark package for loading Spark ML models to Redis-ML☆63Updated 5 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- Few things we've met during our etl project based on spark☆24Updated 6 years ago
- Code reference from my Qbox blog posts.☆87Updated 9 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Tool for exploring data on an Apache Kafka cluster☆42Updated 4 years ago
- Coding exercises for Apache Spark☆104Updated 9 years ago
- Spark Application : Spark Summit 2018 : Streaming Trend Discovery☆11Updated 6 years ago
- Cascading on Apache Flink®☆54Updated last year
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated last year