☆248Oct 7, 2019Updated 6 years ago
Alternatives and similar repositories for Apache-Kafka-poc-and-notes
Users that are interested in Apache-Kafka-poc-and-notes are comparing it to the libraries listed below
Sorting:
- ☆130Apr 8, 2017Updated 8 years ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆360Jun 6, 2017Updated 8 years ago
- Examples of Spark 2.0☆212Aug 11, 2021Updated 4 years ago
- ☆314Nov 26, 2018Updated 7 years ago
- The Internals of Apache Spark☆1,541Jul 5, 2025Updated 8 months ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Jan 22, 2019Updated 7 years ago
- This project enables you to use spring inside of a spark application.☆11May 6, 2015Updated 10 years ago
- ☆38Feb 28, 2018Updated 8 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- Flume-to-Spark-Streaming Log Parser☆23Jun 3, 2016Updated 9 years ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- Some AWS EMR examples☆16Jan 18, 2018Updated 8 years ago
- CDM conversion of MIMIC dataset.☆17Jun 19, 2016Updated 9 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆108Feb 1, 2018Updated 8 years ago
- Apache Spark™ and Scala Workshops☆265Jul 29, 2024Updated last year
- Examples for High Performance Spark☆527Updated this week
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 5 months ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆198Apr 15, 2018Updated 7 years ago
- Developing Spark External Data Sources using the V2 API☆48Apr 29, 2018Updated 7 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- Base classes to use when writing tests with Spark☆1,550Dec 22, 2025Updated 2 months ago
- This tutorial provides a quick introduction to using Spark☆57Mar 31, 2016Updated 9 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 4 months ago
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- Repository used for Spark Trainings☆54Apr 21, 2023Updated 2 years ago
- Dumping ground for random stuff☆55Jun 14, 2025Updated 8 months ago
- This repository houses the Query It! experience.☆11Apr 29, 2020Updated 5 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Jan 21, 2019Updated 7 years ago
- Example projects for using Spark and Cassandra With DSE Analytics☆58Oct 10, 2025Updated 4 months ago
- Because its never late to start taking notes and 'public' it...☆63Jun 3, 2025Updated 9 months ago
- ☆37May 27, 2025Updated 9 months ago
- A tutorial on Apache Spark Unit Testing☆37Jan 27, 2016Updated 10 years ago
- Contain Interview Questions Solutions☆12May 18, 2018Updated 7 years ago
- Jasmine "lnishan" Chen's Curriculum Vitae (CV) in Markdown☆10May 23, 2018Updated 7 years ago
- This repo contains the code demonstrated in the Analytics Vidhya article about PyWebIO usage and the ML model prediction code.☆11Apr 22, 2021Updated 4 years ago
- ☆11Aug 22, 2023Updated 2 years ago
- Big Data Processing Framework - Unified Data API or SQL on Any Storage☆251Jul 10, 2025Updated 7 months ago