vaquarkhan / Apache-Kafka-poc-and-notesView external linksLinks
☆248Oct 7, 2019Updated 6 years ago
Alternatives and similar repositories for Apache-Kafka-poc-and-notes
Users that are interested in Apache-Kafka-poc-and-notes are comparing it to the libraries listed below
Sorting:
- ☆129Apr 8, 2017Updated 8 years ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆360Jun 6, 2017Updated 8 years ago
- Examples of Spark 2.0☆212Aug 11, 2021Updated 4 years ago
- ☆314Nov 26, 2018Updated 7 years ago
- The Internals of Apache Spark☆1,538Jul 5, 2025Updated 7 months ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Jan 22, 2019Updated 7 years ago
- This project enables you to use spring inside of a spark application.☆11May 6, 2015Updated 10 years ago
- ☆38Feb 28, 2018Updated 7 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- CDM conversion of MIMIC dataset.☆17Jun 19, 2016Updated 9 years ago
- Some AWS EMR examples☆16Jan 18, 2018Updated 8 years ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆108Feb 1, 2018Updated 8 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Jun 28, 2020Updated 5 years ago
- Apache Spark™ and Scala Workshops☆264Jul 29, 2024Updated last year
- Examples for High Performance Spark☆526Jan 21, 2026Updated 3 weeks ago
- Essential Spark extensions and helper methods ✨😲☆765Sep 14, 2025Updated 5 months ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Apr 15, 2018Updated 7 years ago
- Developing Spark External Data Sources using the V2 API☆48Apr 29, 2018Updated 7 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- Base classes to use when writing tests with Spark☆1,550Dec 22, 2025Updated last month
- This tutorial provides a quick introduction to using Spark☆57Mar 31, 2016Updated 9 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 3 months ago
- Learning PySpark video series☆11Mar 5, 2018Updated 7 years ago
- Repository used for Spark Trainings☆54Apr 21, 2023Updated 2 years ago
- Dumping ground for random stuff☆55Jun 14, 2025Updated 8 months ago
- A boilerplate for writing PySpark Jobs☆395Jan 21, 2024Updated 2 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Jan 21, 2019Updated 7 years ago
- Example projects for using Spark and Cassandra With DSE Analytics☆58Oct 10, 2025Updated 4 months ago
- Because its never late to start taking notes and 'public' it...☆62Jun 3, 2025Updated 8 months ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- A tutorial on Apache Spark Unit Testing☆37Jan 27, 2016Updated 10 years ago
- ☆37May 27, 2025Updated 8 months ago
- Alchemist: an Apache Spark<->MPI interface☆26May 24, 2018Updated 7 years ago
- This repo contains the code demonstrated in the Analytics Vidhya article about PyWebIO usage and the ML model prediction code.☆11Apr 22, 2021Updated 4 years ago
- ☆11Aug 22, 2023Updated 2 years ago
- Contain Interview Questions Solutions☆12May 18, 2018Updated 7 years ago