Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Sep 9, 2016Updated 9 years ago
Alternatives and similar repositories for Avro-SparkStreaming-Kafka
Users that are interested in Avro-SparkStreaming-Kafka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆44Aug 2, 2017Updated 8 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Apr 21, 2023Updated 2 years ago
- Kafka stream for Spark with storage of the offsets in ZooKeeper☆60Apr 18, 2017Updated 8 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- MCP = Multiple source Convert Platform☆11Aug 2, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 手动管理spark streaming集成kafka的数据偏移量到zookeeper中☆21Jul 6, 2018Updated 7 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Sep 9, 2016Updated 9 years ago
- ☆14Nov 3, 2016Updated 9 years ago
- 封装sparkstreaming动态调节batch time(有数据就执行计算); 支持运行过程中增删topic; 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。☆181Apr 15, 2021Updated 5 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Apr 14, 2016Updated 10 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- 基于flink1.12,使用java,flink sql的demo,包含Mylsql, flinkcdc内置的Mysqlcdc☆12May 27, 2021Updated 4 years ago
- Project for reading data from kafka and writing to kafka and HBase with kerberos☆24Dec 8, 2016Updated 9 years ago
- A Spark SQL HBase connector☆29May 4, 2015Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆33May 22, 2013Updated 12 years ago
- SparkOnHBase☆278Mar 30, 2021Updated 5 years ago
- Golang log package extension. Automates logging routine: split files, archive, etc.☆11Nov 27, 2017Updated 8 years ago
- Python SDK for BearyChat☆17Jul 31, 2019Updated 6 years ago
- API Gateway for *.ik.am☆15Jun 10, 2021Updated 4 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Dec 5, 2019Updated 6 years ago
- Total Anomaly Detection System for software logs and traces☆10Dec 7, 2015Updated 10 years ago
- a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.☆13Jun 13, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 8 years ago
- A benchmark of globally-optimal anonymization methods for biomedical data☆16Dec 11, 2014Updated 11 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆636Feb 26, 2022Updated 4 years ago
- An Apache Phoenix Hibernate dialect☆21May 23, 2018Updated 7 years ago
- SparkListener that converts SparkListenerEvents to JSON and forwards them to an external service via RPC.☆17Apr 6, 2021Updated 5 years ago
- Ambari Service for OpenTSDB☆34Dec 14, 2016Updated 9 years ago
- Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等☆35Dec 18, 2019Updated 6 years ago
- A simple elasticsearch frontend for serving astrophysical simulation catalog data☆10Mar 14, 2026Updated last month
- ☆50Feb 11, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Jan 31, 2018Updated 8 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50May 19, 2016Updated 9 years ago
- NiFi provenance reporting tasks☆14Sep 21, 2023Updated 2 years ago
- Spring Data implementation for ElasticSearch☆63Feb 22, 2022Updated 4 years ago
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆133Dec 17, 2025Updated 3 months ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- java 人脸识别; Face recognition☆14Aug 20, 2018Updated 7 years ago