An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in S3.
☆19Jun 22, 2021Updated 4 years ago
Alternatives and similar repositories for spark-kafka-parquet-example
Users that are interested in spark-kafka-parquet-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Nov 3, 2016Updated 9 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- A WIP Udemy downloader written in Go☆11Mar 20, 2022Updated 4 years ago
- Play-ParSeq is a Play module which seamlessly integrates ParSeq with Play Framework☆17May 20, 2023Updated 2 years ago
- Demo for service oriented application hosted on Hadoop YARN cluster for HA and scheduling☆23Apr 2, 2018Updated 7 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 8 years ago
- Problems can be found over - https://www.hackerrank.com/domains/shell/bash/☆13Jan 20, 2015Updated 11 years ago
- Working example of consuming Avro data from Kafka with Spark Streaming☆12Feb 21, 2016Updated 10 years ago
- These are a select few projects related to Big Data Analytics and Management. The projects listed are a combination of both small and big…☆11Oct 11, 2019Updated 6 years ago
- Code for Springer Book: High Performance Distributed Computing: Case Studies with Hadoop, Scalding and Spark☆15Oct 6, 2017Updated 8 years ago
- Contain Interview Questions Solutions☆12May 18, 2018Updated 7 years ago
- graphx example☆24Jan 23, 2016Updated 10 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆34Jan 6, 2020Updated 6 years ago
- Example of running the flume log4j appender using CDH4 Flume☆15Jan 17, 2013Updated 13 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Apache Flink 学习的Demo☆10Jun 21, 2017Updated 8 years ago
- Reusable code for Hive☆16Aug 19, 2014Updated 11 years ago
- Hive Web Interface☆29Apr 29, 2014Updated 11 years ago
- ☆35Dec 2, 2016Updated 9 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Jun 28, 2020Updated 5 years ago
- 一些机器学习的实践☆11Jun 29, 2022Updated 3 years ago
- Apache Solr 官方参考手册☆15Sep 16, 2015Updated 10 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Dec 5, 2019Updated 6 years ago
- Simple concept of Actor Model in Objective-C based on the idea of Valletta Ventures Actors library.☆57Jun 23, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models☆84Jan 2, 2023Updated 3 years ago
- 这是一个由LangGraph协议主导的因果分析Muti-Agent,结合MCP,RAG等多种工具进行辅助进行因果分析,提供给用户一份完善的因果分析的分析报告和因果图☆34Mar 21, 2026Updated last week
- Flink Examples☆38Apr 27, 2016Updated 9 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- IoT Trucking App with Flink (with Table API & SQL)☆14Jul 4, 2018Updated 7 years ago
- ☆10Feb 12, 2020Updated 6 years ago
- Import data from clickhouse to hadoop with pure SQL☆36Mar 19, 2019Updated 7 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- A set of tools that make working with the Scala ecosystem even better.☆12Mar 16, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- linux版的微信小程序开发工具. 源码与官方一致☆16May 9, 2017Updated 8 years ago
- Shadowsocks-libev for OpenWrt☆12Aug 17, 2015Updated 10 years ago
- 基于FastAPI + LangChain + OpenAI API + Vue的AI表格处理工具,用于智能化处理和分析表格数据。☆19Jul 14, 2025Updated 8 months ago
- 使用spring-boot-spark的一个样例☆11Aug 3, 2018Updated 7 years ago
- Python library that finds the size / type of an image given its URI by fetching as little as needed☆28Jun 1, 2017Updated 8 years ago
- CSSAppy sample project☆21Aug 17, 2011Updated 14 years ago
- Take a peek at HN/知乎日报/V2EX/SBBS within Emacs☆13Jun 7, 2015Updated 10 years ago