☆248Oct 7, 2019Updated 6 years ago
Alternatives and similar repositories for Apache-Kafka-poc-and-notes
Users that are interested in Apache-Kafka-poc-and-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆130Apr 8, 2017Updated 8 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Jan 22, 2019Updated 7 years ago
- Flume-to-Spark-Streaming Log Parser☆23Jun 3, 2016Updated 9 years ago
- Examples of Spark 2.0☆212Aug 11, 2021Updated 4 years ago
- The Internals of Apache Spark☆1,544Jul 5, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆360Jun 6, 2017Updated 8 years ago
- ☆314Nov 26, 2018Updated 7 years ago
- Implementation of unsupervised feature selection algorithm proposed by [Huang, et al. 2015]☆10Dec 25, 2015Updated 10 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Jul 1, 2019Updated 6 years ago
- Example projects for using Spark and Cassandra With DSE Analytics☆58Oct 10, 2025Updated 5 months ago
- This project enables you to use spring inside of a spark application.☆11May 6, 2015Updated 10 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Jun 28, 2020Updated 5 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- ☆38Feb 28, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Jan 21, 2019Updated 7 years ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- Apache Spark applications☆70Dec 17, 2017Updated 8 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Jan 22, 2024Updated 2 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Feb 1, 2018Updated 8 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 6 months ago
- ☆14Jun 24, 2016Updated 9 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- A boilerplate for writing PySpark Jobs☆395Jan 21, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆1,435Mar 21, 2026Updated last week
- Repository used for Spark Trainings☆54Apr 21, 2023Updated 2 years ago
- Build end-to-end Machine Learning pipeline to predict accessibility of playgrounds in NYC☆15Jul 9, 2020Updated 5 years ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆198Apr 15, 2018Updated 7 years ago
- Examples for High Performance Spark☆527Mar 21, 2026Updated last week
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- Base classes to use when writing tests with Spark☆1,549Mar 23, 2026Updated last week
- ☆37May 27, 2025Updated 10 months ago
- Apache Spark™ and Scala Workshops☆265Jul 29, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This tutorial provides a quick introduction to using Spark☆57Mar 31, 2016Updated 9 years ago
- This is an activator project for showcasing best practices, writing unit test and providing a seed for starting with Slick.☆13May 28, 2017Updated 8 years ago
- ☆11Aug 22, 2023Updated 2 years ago
- CDM conversion of MIMIC dataset.☆17Jun 19, 2016Updated 9 years ago
- Developing Spark External Data Sources using the V2 API☆48Apr 29, 2018Updated 7 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Aug 1, 2016Updated 9 years ago
- Demos of discovering weaknesses in various systems☆16Nov 26, 2018Updated 7 years ago