An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in S3.
☆19Jun 22, 2021Updated 4 years ago
Alternatives and similar repositories for spark-kafka-parquet-example
Users that are interested in spark-kafka-parquet-example are comparing it to the libraries listed below
Sorting:
- ☆14Nov 3, 2016Updated 9 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- graphx example☆24Jan 23, 2016Updated 10 years ago
- A fully functional Interpreter for Lox in Scala 3 (WIP).☆31Mar 2, 2026Updated last week
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Dec 5, 2019Updated 6 years ago
- Spark—Python学习笔记☆11Sep 25, 2018Updated 7 years ago
- ☆14Jan 3, 2023Updated 3 years ago
- A set of tools that make working with the Scala ecosystem even better.☆12Updated this week
- Import data from clickhouse to hadoop with pure SQL☆36Mar 19, 2019Updated 6 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆34Jan 6, 2020Updated 6 years ago
- AQIPython is a Python module that calculates the Air Quality Index (AQI) for various air pollutants based on different standards.☆10Mar 5, 2024Updated 2 years ago
- A Google Chrome extension with the sole purpose of adding copy and paste functionality to Overleaf on your Chrome browser.☆17May 13, 2024Updated last year
- ☆15Apr 23, 2025Updated 10 months ago
- Java library to fulfil the requirement of numpy in java☆22Oct 23, 2024Updated last year
- breast Cancer乳腺癌数据挖掘,python sklearn☆11Apr 13, 2019Updated 6 years ago
- Flink Examples☆38Apr 27, 2016Updated 9 years ago
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 8 years ago
- PowerShell Module for interacting with Redis caches☆14Jul 7, 2020Updated 5 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- A python wrapper for the QuantAQ RESTful API☆11Dec 24, 2025Updated 2 months ago
- Ejemplo de cómo trabajar con gráficos en Kotlin☆12Sep 29, 2022Updated 3 years ago
- GnuCash Java API☆13Mar 1, 2026Updated last week
- 基于FastAPI + LangChain + OpenAI API + Vue的AI表格处理工具,用于智能化处理和分析表格数据。☆18Jul 14, 2025Updated 7 months ago
- ☆11Dec 10, 2015Updated 10 years ago
- Scraper for aqicn.org☆11Sep 4, 2018Updated 7 years ago
- Automatically updating projects.json files from raku ecosystems☆11Mar 3, 2023Updated 3 years ago
- ☆10Feb 12, 2020Updated 6 years ago
- ☆12Jan 12, 2024Updated 2 years ago
- High-performance Rust port of Python's popular Rich library☆21Feb 22, 2026Updated 2 weeks ago
- json或SQL语言转为flink或者spark流/批任务☆12Jun 21, 2022Updated 3 years ago
- Scala command-line wrapper around ffmpeg, ffprobe, ImageMagick, and other tools relating to media.☆36Dec 21, 2024Updated last year
- Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models☆83Jan 2, 2023Updated 3 years ago
- Cloyster HPC is a turnkey HPC cluster solution with an user-friendly installer☆10Oct 2, 2025Updated 5 months ago
- Organize test codes for all languages☆10Mar 23, 2025Updated 11 months ago
- Terraform module to deploy Azure Landing Zone Management resources.☆11Jan 30, 2025Updated last year
- docker scripts to build and run a minimal version of TDengine☆10Jul 17, 2019Updated 6 years ago
- ☆10Jul 31, 2018Updated 7 years ago
- ☆11Mar 5, 2015Updated 11 years ago
- A python library to prepare data for AERMOD model inputs (Hong Kong).☆11Dec 2, 2021Updated 4 years ago