vaquarkhan/Apache-Kafka-poc-and-notes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vaquarkhan/Apache-Kafka-poc-and-notes)

vaquarkhan / Apache-Kafka-poc-and-notes

☆246

Alternatives and similar repositories for Apache-Kafka-poc-and-notes

Users that are interested in Apache-Kafka-poc-and-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rohgar / scala-spark-4
View on GitHub
☆130Apr 8, 2017Updated 9 years ago
Pushkr / Apache-Spark-Hands-On
View on GitHub
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
☆87Jan 22, 2019Updated 7 years ago
looker / spark_log_data
View on GitHub
Flume-to-Spark-Streaming Log Parser
☆23Jun 3, 2016Updated 10 years ago
phatak-dev / spark2.0-examples
View on GitHub
Examples of Spark 2.0
☆213Aug 11, 2021Updated 4 years ago
japila-books / apache-spark-internals
View on GitHub
The Internals of Apache Spark
☆1,547Jul 18, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
awesome-spark / spark-gotchas
View on GitHub
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
☆359Jun 6, 2017Updated 9 years ago
spoddutur / spark-notes
View on GitHub
☆313Nov 26, 2018Updated 7 years ago
qubole / streaminglens
View on GitHub
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
☆17Jan 21, 2020Updated 6 years ago
elbaulp / DPASF
View on GitHub
My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)
☆18Jul 1, 2019Updated 7 years ago
bartosz25 / spark-scala-playground
View on GitHub
Sample processing code using Spark 2.1+ and Scala
☆51Jun 28, 2020Updated 6 years ago
holdenk / spark-structured-streaming-ml
View on GitHub
Structured Streaming Machine Learning example with Spark 2.0
☆95Apr 24, 2017Updated 9 years ago
DataStax-Examples / SparkBuildExamples
View on GitHub
Example projects for using Spark and Cassandra With DSE Analytics
☆59Oct 10, 2025Updated 9 months ago
lgscofield / spring-spark
View on GitHub
This project enables you to use spring inside of a spark application.
☆11May 6, 2015Updated 11 years ago
devmindset / sparkscalainterview
View on GitHub
Contain Interview Questions Solutions
☆12May 18, 2018Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
DataDog / spark-jvm-profiler
View on GitHub
## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends
☆15Oct 3, 2023Updated 2 years ago
zaratsian / Spark
View on GitHub
Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References
☆69Jan 21, 2019Updated 7 years ago
holdenk / spark-validator
View on GitHub
A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…
☆111Feb 1, 2018Updated 8 years ago
stackleader / karaf-grpc
View on GitHub
☆14Jun 24, 2016Updated 10 years ago
ekampf / PySpark-Boilerplate
View on GitHub
A boilerplate for writing PySpark Jobs
☆393Jan 21, 2024Updated 2 years ago
Instrument / query-it
View on GitHub
This repository houses the Query It! experience.
☆11Apr 29, 2020Updated 6 years ago
aws-samples / amazon-sagemaker-predict-accessibility
View on GitHub
Build end-to-end Machine Learning pipeline to predict accessibility of playgrounds in NYC
☆15Jul 9, 2020Updated 6 years ago
dimajix / spark-training
View on GitHub
Repository used for Spark Trainings
☆54Apr 21, 2023Updated 3 years ago
spirom / spark-streaming-with-kafka
View on GitHub
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
☆196Apr 15, 2018Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
high-performance-spark / high-performance-spark-examples
View on GitHub
Examples for High Performance Spark
☆533May 3, 2026Updated 2 months ago
brfulu / us-accidents-data-engineering
View on GitHub
Udacity Data Engineer Nanodegree - Capstone project
☆11Dec 19, 2019Updated 6 years ago
holdenk / spark-testing-base
View on GitHub
Base classes to use when writing tests with Spark
☆1,555Apr 20, 2026Updated 3 months ago
drabastomek / learningPySpark_video
View on GitHub
Learning PySpark video series
☆11Mar 5, 2018Updated 8 years ago
TomLous / databricks-spark-training
View on GitHub
☆38May 27, 2025Updated last year
jaceklaskowski / spark-workshop
View on GitHub
Apache Spark™ and Scala Workshops
☆264Jul 29, 2024Updated 2 years ago
rklick-solutions / spark-tutorial
View on GitHub
This tutorial provides a quick introduction to using Spark
☆58Mar 31, 2016Updated 10 years ago
qubole / s3-sqs-connector
View on GitHub
A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).
☆19Apr 20, 2024Updated 2 years ago
phatak-dev / spark-3.0-examples
View on GitHub
Examples of Spark 3.0
☆44Nov 11, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shamsbayzid / mimic-cdm
View on GitHub
CDM conversion of MIMIC dataset.
☆17Jun 19, 2016Updated 10 years ago
firmai / financial-machine-learning-regulation
View on GitHub
A look at regulatory challenges and recommendation in the age of AI. Investigating topics like monopoly formation, machine learning audit…
☆14Jun 7, 2019Updated 7 years ago
databricks / spark-csv
View on GitHub
CSV Data Source for Apache Spark 1.x
☆1,057Dec 13, 2018Updated 7 years ago
bosea / spark-unit-testing
View on GitHub
A tutorial on Apache Spark Unit Testing
☆38Jan 27, 2016Updated 10 years ago
chaosiq / demos
View on GitHub
Demos of discovering weaknesses in various systems
☆16Nov 26, 2018Updated 7 years ago
tharsha18 / gluelabs
View on GitHub
☆14Aug 10, 2021Updated 4 years ago
sigopt / sigopt-spark
View on GitHub
☆11Aug 22, 2023Updated 2 years ago