Kafka stream for Spark with storage of the offsets in ZooKeeper
☆60Apr 18, 2017Updated 8 years ago
Alternatives and similar repositories for spark-kafka-source
Users that are interested in spark-kafka-source are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Sep 9, 2016Updated 9 years ago
- ☆243Jun 14, 2018Updated 7 years ago
- 使用shell脚本部署Apache Doris (incubating) FE & BE☆11Jul 8, 2019Updated 6 years ago
- Example of use of Spark Streaming with Kafka☆90Jul 11, 2014Updated 11 years ago
- ☆48Feb 4, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- spark实例代码☆78Nov 11, 2017Updated 8 years ago
- Ingress data from kafka topic into clickhouse table (JSON format)☆24Apr 12, 2018Updated 8 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Sep 9, 2016Updated 9 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆636Feb 26, 2022Updated 4 years ago
- Write your Spark data to Kafka seamlessly☆174Jul 10, 2024Updated last year
- 基于TBSchedule开发的一个分布式任务调度框架,可以解析任务间的依赖,并执行任务(执行Shell、bat脚本)☆12Aug 5, 2016Updated 9 years ago
- Clickhouse typesafe RowBinary insert tooling☆13Jul 6, 2019Updated 6 years ago
- 基于 spark 混合查询平台,支持不同源数据库的联合查询,mysql hive presto ...☆14Aug 3, 2017Updated 8 years ago
- AWS SSM in Action, the next generation of SSH☆23Mar 14, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Example usage of spark cassandra connector☆25Nov 21, 2014Updated 11 years ago
- Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash☆10Mar 21, 2017Updated 9 years ago
- A Kafka metric sink for Apache Spark☆11Apr 13, 2017Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Feb 21, 2014Updated 12 years ago
- Example using Grafana with Druid☆11Mar 27, 2015Updated 11 years ago
- Real Time Streaming using Apache Spark Streaming [Video], published by Packt☆10Oct 31, 2022Updated 3 years ago
- Notes about Spark Streaming in Apache Spark☆61Apr 7, 2017Updated 9 years ago
- Unix tee, but for Kinesis streams☆12Oct 19, 2021Updated 4 years ago
- HBase RDD example project☆19Jan 22, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 记录Spark、Flink研究经验☆26Aug 11, 2019Updated 6 years ago
- A testing DSL for kafka-streams☆14Aug 9, 2017Updated 8 years ago
- real time log event processing using spark, kafka & cassandra☆13Dec 4, 2014Updated 11 years ago
- An introduction of Scala learning and some frequently asked questions(FAQ);有关Scala的学习笔记,记录Scala的常用语法及标准库的部分设计原理☆26May 12, 2016Updated 9 years ago
- Play-ParSeq is a Play module which seamlessly integrates ParSeq with Play Framework☆17May 20, 2023Updated 2 years ago
- Pinot 是一个实时分布式的 OLAP 数据存储和分析系统。LinkedIn 使用它实现低延迟可伸缩的实时分析。Pinot 从离线数据源(包括 Hadoop 和各类文件)和在线数据源(如 Kafka)中攫取数据进行分析。Pinot 被设计是可以进行水平扩展的☆16Nov 8, 2015Updated 10 years ago
- NuCypher for Kafka. Start building from this module (it fetches the appropriate branch from Kafka repository)☆17Oct 13, 2017Updated 8 years ago
- This project is a unified ETL platform that support various data processing technologies, including Spark, Hive, Hadoop, Python, Linux Sh…☆17Oct 16, 2015Updated 10 years ago
- A simple Spark LDA example. to demonstrate a full fletched clustering algorithm, with data cleaning using the processess like lemmatizati…☆23Oct 8, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Node.js REST interface for an Apache Spark REPL which can execute JavaScript.☆17Mar 21, 2016Updated 10 years ago
- This application comes as Spark2.1-as-Service-Provider using an embedded, Reactive-Streams-based, fully asynchronous HTTP server☆50Jul 16, 2023Updated 2 years ago
- An sbt plugin for source code statistics☆83May 23, 2019Updated 6 years ago
- something to help you spark☆19Jul 5, 2017Updated 8 years ago
- An example of integration between angular, ionic, and require, inspired by directory-angular-ionic of Christophe Coenraets☆25Jun 6, 2016Updated 9 years ago
- A type driven approach to string interpolation, aiming at consistent, secure, and only-human-readable logs and console outputs !☆14May 5, 2024Updated last year
- This repo demonstrates how to capture any incoming request and write it as JSON to nginx log using Nginx and Lua. For more details refer …☆12May 22, 2017Updated 8 years ago