An extension of the kafka-python package that adds features like multiprocess consumers.
☆39Aug 24, 2023Updated 2 years ago
Alternatives and similar repositories for yelp_kafka
Users that are interested in yelp_kafka are comparing it to the libraries listed below
Sorting:
- Provides a Pythonic interface for reading and writing Avro schemas☆27Aug 17, 2022Updated 3 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆88Mar 2, 2021Updated 5 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Aug 17, 2022Updated 3 years ago
- MySQLStreamer is a database change data capture and publish system.☆411Aug 17, 2022Updated 3 years ago
- Python client for Apache Kafka☆17Sep 29, 2025Updated 5 months ago
- Variational Factorization Machines☆17Dec 20, 2016Updated 9 years ago
- A WIP Udemy downloader written in Go☆11Mar 20, 2022Updated 4 years ago
- Csv2Hive is an useful CSV schema finder for the Big Data. It discovers automatically schemas in big CSV files, generates the 'CREATE TABL…☆27Oct 13, 2017Updated 8 years ago
- Partial dumper for MySQL☆16May 30, 2016Updated 9 years ago
- Yara Plugin for Binary Ninja☆13Feb 13, 2018Updated 8 years ago
- ☆14Nov 3, 2016Updated 9 years ago
- Easily write tests and fuzz many different programs.☆12Dec 13, 2022Updated 3 years ago
- R files containing the code used to predict rugby world cup matches☆10Sep 18, 2015Updated 10 years ago
- A tutorial on Apache Spark Unit Testing☆37Jan 27, 2016Updated 10 years ago
- Monitor docker Swarm services and sends a pushover notification if anyone is down☆22Nov 27, 2019Updated 6 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Apr 11, 2016Updated 9 years ago
- A scalable framework for binary analysis in a containered environment.☆13May 20, 2019Updated 6 years ago
- integrating bro into yara☆33Dec 9, 2014Updated 11 years ago
- Working example of consuming Avro data from Kafka with Spark Streaming☆12Feb 21, 2016Updated 10 years ago
- An analysis on Aadhaar dataset using Mapreduce and Spark☆14Feb 28, 2018Updated 8 years ago
- ☆23Apr 4, 2018Updated 7 years ago
- ☆11Dec 6, 2017Updated 8 years ago
- Code for Springer Book: High Performance Distributed Computing: Case Studies with Hadoop, Scalding and Spark☆15Oct 6, 2017Updated 8 years ago
- Python DBAPI Driver and Sqlalchemy Dialect for Apache Kylin, the "Extreme OLAP Engine for Big Data"☆32Oct 9, 2016Updated 9 years ago
- A Text Comprehension Engine in Python☆15Aug 23, 2015Updated 10 years ago
- A collection of examples and best practices for AngularJS projects☆15Apr 8, 2014Updated 11 years ago
- my blog☆10Apr 10, 2020Updated 5 years ago
- ☆11Aug 3, 2021Updated 4 years ago
- Contain Interview Questions Solutions☆12May 18, 2018Updated 7 years ago
- An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in …☆19Jun 22, 2021Updated 4 years ago
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Nov 9, 2019Updated 6 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Nov 18, 2019Updated 6 years ago
- A Diamond collector for capturing memcached slab statistics.☆14Mar 2, 2017Updated 9 years ago
- PoC exploit code for CVE-2015-5477 BIND9 TKEY remote DoS vulnerability☆14Aug 1, 2015Updated 10 years ago
- All your mongos in one place☆33Apr 8, 2024Updated last year
- Automatically exported from code.google.com/p/david-mysql-tools☆11Mar 1, 2016Updated 10 years ago
- Performs OCR on image files and scans them for matches to YARA rules☆42Oct 30, 2018Updated 7 years ago
- Python client for Elasticsearch Watcher (deprecated)☆23Jun 4, 2018Updated 7 years ago
- ☆18Sep 22, 2017Updated 8 years ago