justinrmiller / spark-kafka-parquet-exampleView external linksLinks
An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in S3.
☆19Jun 22, 2021Updated 4 years ago
Alternatives and similar repositories for spark-kafka-parquet-example
Users that are interested in spark-kafka-parquet-example are comparing it to the libraries listed below
Sorting:
- ☆14Nov 3, 2016Updated 9 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- graphx example☆24Jan 23, 2016Updated 10 years ago
- Rust implementation of VibeVoice text-to-speech with voice cloning and multi-speaker synthesis.☆58Jan 30, 2026Updated 2 weeks ago
- A fully functional Interpreter for Lox in Scala 3 (WIP).☆31Feb 8, 2026Updated last week
- Spark—Python学习笔记☆11Sep 25, 2018Updated 7 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Dec 5, 2019Updated 6 years ago
- A set of tools that make working with the Scala ecosystem even better.☆12Feb 10, 2026Updated last week
- Import data from clickhouse to hadoop with pure SQL☆36Mar 19, 2019Updated 6 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆34Jan 6, 2020Updated 6 years ago
- Java library to fulfil the requirement of numpy in java☆22Oct 23, 2024Updated last year
- AQIPython is a Python module that calculates the Air Quality Index (AQI) for various air pollutants based on different standards.☆10Mar 5, 2024Updated last year
- ☆15Apr 23, 2025Updated 9 months ago
- breast Cancer乳腺癌数据挖掘,python sklearn☆11Apr 13, 2019Updated 6 years ago
- Flink Examples☆38Apr 27, 2016Updated 9 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 8 years ago
- GnuCash Java API☆13Updated this week
- A python wrapper for the QuantAQ RESTful API☆11Dec 24, 2025Updated last month
- 基于FastAPI + LangChain + OpenAI API + Vue的AI表格处理工具,用于智能化处理和分析表格数据。☆17Jul 14, 2025Updated 7 months ago
- A lightweight, native macOS editor for Typst.☆43Dec 9, 2025Updated 2 months ago
- Scraper for aqicn.org☆11Sep 4, 2018Updated 7 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models☆83Jan 2, 2023Updated 3 years ago
- ☆10Feb 12, 2020Updated 6 years ago
- Ejemplo de cómo trabajar con gráficos en Kotlin☆12Sep 29, 2022Updated 3 years ago
- High-performance Rust port of Python's popular Rich library☆21Updated this week
- json或SQL语言转为flink或者spark流/批任务☆12Jun 21, 2022Updated 3 years ago
- Scala command-line wrapper around ffmpeg, ffprobe, ImageMagick, and other tools relating to media.☆36Dec 21, 2024Updated last year
- Interplanetary Database: A Database built on top of IPFS and made immutable using Ethereum blockchain.☆10Sep 19, 2022Updated 3 years ago
- PowerShell Module for interacting with Redis caches☆14Jul 7, 2020Updated 5 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- Automatically updating projects.json files from raku ecosystems☆11Mar 3, 2023Updated 2 years ago
- SBT plugin for running mocha JavaScript unit tests on node☆17Jan 13, 2026Updated last month
- docker scripts to build and run a minimal version of TDengine☆10Jul 17, 2019Updated 6 years ago
- 尚硅谷数仓文档☆11Sep 7, 2019Updated 6 years ago
- springboot demo combined with scala and java☆11Dec 7, 2017Updated 8 years ago
- A small markdown TUI note keeper☆15Apr 7, 2025Updated 10 months ago
- comparison study of tab transformer and ft transformer for credit card fraud detection☆11Jan 6, 2023Updated 3 years ago
- Show source locations of core methods and subs☆11Jan 17, 2023Updated 3 years ago