Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Sep 9, 2016Updated 9 years ago
Alternatives and similar repositories for Avro-SparkStreaming-Kafka
Users that are interested in Avro-SparkStreaming-Kafka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆44Aug 2, 2017Updated 8 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Apr 21, 2023Updated 2 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- 手动管理spark streaming集成kafka的数据偏移量到zookeeper中☆21Jul 6, 2018Updated 7 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Sep 9, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Nov 3, 2016Updated 9 years ago
- 封装sparkstreaming动态调节batch time(有数据就执行计算); 支持运行过程中增删topic; 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。☆179Apr 15, 2021Updated 4 years ago
- 记录Spark、Flink研究经验☆26Aug 11, 2019Updated 6 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Apr 14, 2016Updated 9 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- Project for reading data from kafka and writing to kafka and HBase with kerberos☆24Dec 8, 2016Updated 9 years ago
- A Spark SQL HBase connector☆29May 4, 2015Updated 10 years ago
- SparkOnHBase☆278Mar 30, 2021Updated 4 years ago
- A document tool build on Vue.☆10May 13, 2016Updated 9 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Golang log package extension. Automates logging routine: split files, archive, etc.☆11Nov 27, 2017Updated 8 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- API Gateway for *.ik.am☆15Jun 10, 2021Updated 4 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆530Oct 24, 2019Updated 6 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Dec 5, 2019Updated 6 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 8 years ago
- A benchmark of globally-optimal anonymization methods for biomedical data☆16Dec 11, 2014Updated 11 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆635Feb 26, 2022Updated 4 years ago
- SparkSQL数据分析案例☆23Dec 3, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- SparkListener that converts SparkListenerEvents to JSON and forwards them to an external service via RPC.☆17Apr 6, 2021Updated 4 years ago
- Ambari Service for OpenTSDB☆34Dec 14, 2016Updated 9 years ago
- Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等☆35Dec 18, 2019Updated 6 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- Monitoring Traffinal monitoring signal using Apache Flink☆11Oct 25, 2016Updated 9 years ago
- NiFi provenance reporting tasks☆14Sep 21, 2023Updated 2 years ago
- 一个手动管理spark streaming集成kafka时的偏移量到zookeeper中的小项目☆133Dec 17, 2025Updated 3 months ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- This will help you to generate AVRO schema from JSON schema.☆34Nov 16, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- suricata IDS的规则,测试在用的,部分自写的规则视情况放出。☆18Apr 16, 2019Updated 6 years ago
- An OpenCalais API Interface for Python.☆21Mar 13, 2012Updated 14 years ago
- Webapp for fetching vehicle positions from a SIRI Vehicle Monitoring (VM)-feed and displaying a map via Leaflet w/ plugins.☆18Feb 2, 2017Updated 9 years ago
- DataNode Volumes Rebalancing tool for Apache Hadoop HDFS (HDFS-1312)☆23Dec 12, 2017Updated 8 years ago
- 在mysql-binlog-connector-java基础上参考 keking-binlog-distributor,提供了监听mysql数据库二进制日志并进行分发的功能☆15Jun 21, 2022Updated 3 years ago
- import jdbc meta to atlas☆11Sep 8, 2022Updated 3 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Jun 8, 2016Updated 9 years ago