cpbaranwal / Avro-SparkStreaming-KafkaView external linksLinks
Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Sep 9, 2016Updated 9 years ago
Alternatives and similar repositories for Avro-SparkStreaming-Kafka
Users that are interested in Avro-SparkStreaming-Kafka are comparing it to the libraries listed below
Sorting:
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆44Aug 2, 2017Updated 8 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Sep 9, 2016Updated 9 years ago
- 手动管理spark streaming集成kafka的数据偏移量到zookeeper中☆21Jul 6, 2018Updated 7 years ago
- PoC application for Microservice Architecture Pattern☆10Jul 14, 2017Updated 8 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Apr 14, 2016Updated 9 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Ambari stack for easily installing and managing Mongo DB on HDP cluster☆24May 23, 2016Updated 9 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Dec 5, 2019Updated 6 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- 杭州第六次 Spark & Flink Meetup☆30May 14, 2018Updated 7 years ago
- 分布式任务调度框架教程, 包括: Quartz、Elastic-Job和TBSchedule.☆32Mar 4, 2019Updated 6 years ago
- Ambari Service for OpenTSDB☆34Dec 14, 2016Updated 9 years ago
- Companion Code for Using Flume Book☆32May 27, 2015Updated 10 years ago
- SparkOnHBase☆278Mar 30, 2021Updated 4 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 8 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆133Dec 17, 2025Updated last month
- Golang Web Toolkit☆16Nov 5, 2011Updated 14 years ago
- A wrapper for applications to help with running Istio Sidecars☆11Feb 4, 2026Updated last week
- A simple elasticsearch frontend for serving astrophysical simulation catalog data☆10Aug 29, 2025Updated 5 months ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- MongoDB extension for k6 - High-performance load testing with MongoDB support☆14Feb 3, 2026Updated last week
- code repository for Deep learning for NLP using Python (v), Published by Packt☆11Jan 15, 2021Updated 5 years ago
- Open-source distribute workflow schedule tools, also support streaming task.☆39Nov 11, 2017Updated 8 years ago
- elasticsearch reader and writer plugin for datax☆39Aug 31, 2017Updated 8 years ago
- Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等☆36Dec 18, 2019Updated 6 years ago
- 基于TBSchedule开发的一个分布式任务调度框架,可以解析任务间的依赖,并执行任务(执行Shell、bat脚本)☆12Aug 5, 2016Updated 9 years ago
- This crate provides a procedure macro to create request guards used for authorization.☆11Nov 24, 2025Updated 2 months ago
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated 9 months ago
- Self-hosted email subscriptions list using serverless AWS stack☆11Aug 31, 2020Updated 5 years ago
- Dockerfile for Nginx + Gunicorn + Flask☆12Dec 24, 2017Updated 8 years ago
- An Azure Application Insights exporter for axum via tracing.☆11Feb 7, 2025Updated last year
- Exploration of spark streaming based on the BigData.be project 2☆15Sep 2, 2013Updated 12 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Jun 8, 2016Updated 9 years ago
- Track user actions quickly in Redis with Node.js☆10Dec 11, 2019Updated 6 years ago
- Practical utilities for spark applications☆11Jan 16, 2024Updated 2 years ago
- A Docsy theme example for "mostly docs"☆12Feb 12, 2023Updated 3 years ago
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago