SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失
☆44Aug 2, 2017Updated 8 years ago
Alternatives and similar repositories for spark_streaming_kafka_offset
Users that are interested in spark_streaming_kafka_offset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 8 years ago
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Sep 9, 2016Updated 9 years ago
- ☆12May 11, 2016Updated 9 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- Use Scala API to read/write data from different databases,HBase,MySQL,etc.☆24Feb 28, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 封装sparkstreaming动态调节batch time(有数据就执行计算); 支持运行过程中增删topic; 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。☆181Apr 15, 2021Updated 5 years ago
- 手动管理spark streaming集成kafka的数据偏移量到zookeeper中☆21Jul 6, 2018Updated 7 years ago
- 1.Spark离线批处理,用户实时点击统计;2.SparkSQL日志内容分析;3.受众电影分析 =>(Kafka + SparkStreaming + Redis)和(Kafka + SparkStreaming + Mysql)☆29Jun 21, 2022Updated 3 years ago
- 实时分析nginx日志,计算接口访问次数,uv,时延,异常IP等指标☆29Apr 14, 2017Updated 9 years ago
- spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议☆32Mar 24, 2019Updated 7 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Sep 9, 2016Updated 9 years ago
- SparkStreaming项目,显示flume->Kafka->Spark->hbase(实时数据处理方案),Scala实现☆36Feb 19, 2018Updated 8 years ago
- Spark Streaming实时流处理项目实战☆18Jul 12, 2025Updated 9 months ago
- Scalable CDC Pattern Implemented using PySpark☆18Oct 8, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于袋鼠云提供的开源flinkStreamSQL项目,对其实时sql进行可视化功能开发;通过tcpip通信,前端页面选择需要连接的数据库信息,并写sql语句,点击提交后,后端自动执行集群启动和JobGraph提交,并返回结果给前端页面。实现了使用者即使不了解Kafka、fl…☆11Jun 23, 2019Updated 6 years ago
- spark流数据处理,可以从flume-ng,kafka接收数据☆11Sep 16, 2015Updated 10 years ago
- 在公司接了一个任务,完成一个项目数据同步的模块。要求是不能操作项目的数据库。怕操作不当,数据丢失。所以想到的方案是使用log4jdbc记录数据源的SQL语句到日志文件。然后按行读取日志文件中的数据,记录读取的Point,以便下次继续读取。读取的数据进入bigqueue队列,…☆12Aug 10, 2017Updated 8 years ago
- This project compose of two parts: 1) write, spark job to write to hbase using bulk load to; 2)read, rest api reading from hbase base on …☆20Oct 25, 2017Updated 8 years ago
- hive sql parser☆11Aug 27, 2014Updated 11 years ago
- 使用spark对hive、hbase、ES的读写, 实现一次配置可对不同数据库进行导入导出,并对ES、hbase进行封装☆32May 6, 2017Updated 8 years ago
- Spark机器学习书代码☆25Dec 22, 2017Updated 8 years ago
- 离线调度, hive, 任务依赖, 任务调度, 大数据开发平台☆14May 10, 2018Updated 7 years ago
- Spark structured-streaming 消费kafka数据写入hbase☆33Jan 22, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Kafka stream for Spark with storage of the offsets in ZooKeeper☆60Apr 18, 2017Updated 8 years ago
- Apache flink☆18Feb 8, 2023Updated 3 years ago
- Streaming 相关项目☆15Mar 27, 2017Updated 9 years ago
- Spark Streaming HBase Example☆94Apr 4, 2016Updated 10 years ago
- 基于TBSchedule开发的一个分布式任务调度框架,可以解析任务间的依赖,并执行任务(执行Shell、bat脚本)☆12Aug 5, 2016Updated 9 years ago
- conbine flume,spark-streaming and redis for real-time computing☆22Oct 20, 2014Updated 11 years ago
- Flink Hadoop Compatibility + Elasticsearch for Apache Hadoop = Flink Connector Elasticsearch Source Table。结合flink+hadoop+es 实现的es table s…☆20Jun 28, 2021Updated 4 years ago
- Machine Learning with Spark - Second Edition, by Packt☆115Jan 14, 2021Updated 5 years ago
- spring+spark streaming+kafka 10版本集成和异常问题处理☆17Jul 21, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- kudu学习的一些资料,以及和spark/impala的集成使用☆33Sep 11, 2017Updated 8 years ago
- spring-boot利用scala写spark程序骨架☆28Oct 22, 2019Updated 6 years ago
- My branch of Apache Flume with a generic JDBC sink (not yet licensed to Apache)☆11Feb 12, 2022Updated 4 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆636Feb 26, 2022Updated 4 years ago
- Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.☆20Updated this week
- RESP (REdis Serialization Protocol) encoder and decoder.☆19Dec 6, 2015Updated 10 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago