apssouza22 / big-data-pipeline-lambda-archLinks
A full big data pipeline (Lambda Architecture) with Spark, Kafka, HDFS and Cassandra.
☆179Updated 2 years ago
Alternatives and similar repositories for big-data-pipeline-lambda-arch
Users that are interested in big-data-pipeline-lambda-arch are comparing it to the libraries listed below
Sorting:
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- How to build an awesome data engineering team☆100Updated 5 years ago
- Supporting repository for the blog post at https://medium.com/@stephane.maarek/how-to-use-apache-kafka-to-transform-a-batch-pipeline-into…☆244Updated last year
- Databricks - Apache Spark™ - 2X Certified Developer☆266Updated 4 years ago
- Apache Spark 3 - Structured Streaming Course Material☆45Updated 4 years ago
- A simple spark standalone cluster for your testing environment purposses☆570Updated last year
- This is the central repository for all materials related to Kafka Streams : Real-time Stream Processing! Book by Prashant Pandey.☆166Updated 4 years ago
- This repository contains code for Spark Streaming☆22Updated 4 years ago
- Examples To Help You Learn Apache Spark☆77Updated 6 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆489Updated 2 years ago
- Apache Spark Course Material☆90Updated 2 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40Updated 6 years ago
- ☆75Updated 4 years ago
- Apache Spark 3 - Structured Streaming Course Material☆121Updated last year
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Code for docker images☆39Updated 2 years ago
- Simple stream processing pipeline☆103Updated 11 months ago
- A simplified, lightweight ETL Framework based on Apache Spark☆586Updated last year
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Updated 4 years ago
- PySpark-ETL☆23Updated 5 years ago
- ☆309Updated 6 years ago
- Maven quick start for building Kafka Connect connectors.☆146Updated 4 years ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Twitter Sentiment Analysis using Spark and Kafka☆115Updated 6 years ago
- Docker with Airflow and Spark standalone cluster☆256Updated last year
- Learn the Confluent Schema Registry & REST Proxy☆191Updated last year
- Fully reproducible, Dockerized, step-by-step, demo on how to stream tables from Postgres to Kafka/KSQL back to Postgres. Detailed blog p…☆152Updated 3 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆130Updated 2 years ago
- ☆247Updated 5 years ago
- Spark Examples☆125Updated 3 years ago