Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Sep 9, 2016Updated 9 years ago
Alternatives and similar repositories for Avro-SparkStreaming-Kafka
Users that are interested in Avro-SparkStreaming-Kafka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆43Aug 2, 2017Updated 8 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Apr 21, 2023Updated 3 years ago
- Kafka stream for Spark with storage of the offsets in ZooKeeper☆60Apr 18, 2017Updated 9 years ago
- MCP = Multiple source Convert Platform☆11Aug 2, 2022Updated 3 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Sep 9, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 封装sparkstreaming动态调节batch time(有数据就执行计算); 支持运行过程中增删topic; 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。☆181Apr 15, 2021Updated 5 years ago
- 记录Spark、Flink研究经验☆25Aug 11, 2019Updated 6 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Apr 14, 2016Updated 10 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- 基于flink1.12,使用java,flink sql的demo,包含Mylsql, flinkcdc内置的Mysqlcdc☆12May 27, 2021Updated 5 years ago
- Project for reading data from kafka and writing to kafka and HBase with kerberos☆24Dec 8, 2016Updated 9 years ago
- A Spark SQL HBase connector☆29May 4, 2015Updated 11 years ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆32May 22, 2013Updated 13 years ago
- SparkOnHBase☆278Mar 30, 2021Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A document tool build on Vue.☆10May 13, 2016Updated 10 years ago
- Golang log package extension. Automates logging routine: split files, archive, etc.☆11Nov 27, 2017Updated 8 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Dec 5, 2019Updated 6 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 9 years ago
- A benchmark of globally-optimal anonymization methods for biomedical data☆16Dec 11, 2014Updated 11 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆635Apr 24, 2026Updated last month
- SparkSQL数据分析案例☆23Dec 3, 2016Updated 9 years ago
- An Apache Phoenix Hibernate dialect☆21May 23, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- SparkListener that converts SparkListenerEvents to JSON and forwards them to an external service via RPC.☆17Apr 6, 2021Updated 5 years ago
- Ambari Service for OpenTSDB☆34Dec 14, 2016Updated 9 years ago
- Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等☆36Dec 18, 2019Updated 6 years ago
- A simple elasticsearch frontend for serving astrophysical simulation catalog data☆11Mar 14, 2026Updated 3 months ago
- ☆51May 21, 2026Updated 3 weeks ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50May 19, 2016Updated 10 years ago
- A docker image to export (live) mysqlbinlog data to a pretty printed json format☆18Mar 23, 2022Updated 4 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- This will help you to generate AVRO schema from JSON schema.☆35Nov 16, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Webapp for fetching vehicle positions from a SIRI Vehicle Monitoring (VM)-feed and displaying a map via Leaflet w/ plugins.☆18Feb 2, 2017Updated 9 years ago
- DataNode Volumes Rebalancing tool for Apache Hadoop HDFS (HDFS-1312)☆23Dec 12, 2017Updated 8 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Jun 8, 2016Updated 10 years ago
- import jdbc meta to atlas☆10Sep 8, 2022Updated 3 years ago
- Helpful script to resolve hbase issues☆12Aug 6, 2022Updated 3 years ago
- spark实例代码☆78Nov 11, 2017Updated 8 years ago
- ACL Management for Apache Spark SQL with Apache Ranger☆17Jun 18, 2020Updated 5 years ago