Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Sep 9, 2016Updated 9 years ago
Alternatives and similar repositories for Avro-SparkStreaming-Kafka
Users that are interested in Avro-SparkStreaming-Kafka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆43Aug 2, 2017Updated 8 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Apr 21, 2023Updated 3 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- MCP = Multiple source Convert Platform☆11Aug 2, 2022Updated 3 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Sep 9, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Nov 3, 2016Updated 9 years ago
- 封装sparkstreaming动态调节batch time(有数据就执行计算); 支持运行过程中增删topic; 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。☆181Apr 15, 2021Updated 5 years ago
- 记录Spark、Flink研究经验☆25Aug 11, 2019Updated 6 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Apr 14, 2016Updated 10 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- 基于flink1.12,使用java,flink sql的demo,包含Mylsql, flinkcdc内置的Mysqlcdc☆12May 27, 2021Updated 4 years ago
- Project for reading data from kafka and writing to kafka and HBase with kerberos☆24Dec 8, 2016Updated 9 years ago
- A Spark SQL HBase connector☆29May 4, 2015Updated 11 years ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆32May 22, 2013Updated 12 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SparkOnHBase☆278Mar 30, 2021Updated 5 years ago
- A document tool build on Vue.☆10May 13, 2016Updated 9 years ago
- Exploration of spark streaming based on the BigData.be project 2☆15Sep 2, 2013Updated 12 years ago
- Python SDK for BearyChat☆17Jul 31, 2019Updated 6 years ago
- API Gateway for *.ik.am☆15Jun 10, 2021Updated 4 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Dec 5, 2019Updated 6 years ago
- Spark Streaming与OpenCV传感器数据实时获取☆13Jun 20, 2016Updated 9 years ago
- spring-boot利用scala写spark程序骨架☆28Oct 22, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 9 years ago
- A benchmark of globally-optimal anonymization methods for biomedical data☆16Dec 11, 2014Updated 11 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆636Apr 24, 2026Updated last week
- An Apache Phoenix Hibernate dialect☆21May 23, 2018Updated 7 years ago
- SparkListener that converts SparkListenerEvents to JSON and forwards them to an external service via RPC.☆17Apr 6, 2021Updated 5 years ago
- Ambari Service for OpenTSDB☆34Dec 14, 2016Updated 9 years ago
- Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等☆36Dec 18, 2019Updated 6 years ago
- A simple elasticsearch frontend for serving astrophysical simulation catalog data☆10Mar 14, 2026Updated last month
- ☆50Feb 11, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50May 19, 2016Updated 9 years ago
- Spring Data implementation for ElasticSearch☆63Feb 22, 2022Updated 4 years ago
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆133Dec 17, 2025Updated 4 months ago
- A docker image to export (live) mysqlbinlog data to a pretty printed json format☆18Mar 23, 2022Updated 4 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆186Feb 7, 2023Updated 3 years ago
- java 人脸识别; Face recognition☆14Aug 20, 2018Updated 7 years ago
- An OpenCalais API Interface for Python.☆21Mar 13, 2012Updated 14 years ago