liliasfaxi / Atelier-SparkLinks
Cours et TP sur Apache Spark
☆11Updated 3 years ago
Alternatives and similar repositories for Atelier-Spark
Users that are interested in Atelier-Spark are comparing it to the libraries listed below
Sorting:
- ☆17Updated 10 months ago
- ☆26Updated 9 months ago
- Realtime Data Engineering Project☆31Updated 5 months ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆36Updated last year
- Collection of NiFi-related stuff☆24Updated 2 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆62Updated last year
- The Ultimate Hands-On Hadoop - Tame your Big Data!: https://www.udemy.com/the-ultimate-hands-on-hadoop-tame-your-big-data/☆8Updated 6 years ago
- ☆22Updated 4 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆257Updated 4 months ago
- An awesome Analytics Engineering repository to learn and apply for real world problems.☆38Updated last year
- EverythingApacheNiFi☆112Updated last year
- Cassandra + Spark = ❤️ Machine Learning with Apache Spark & Cassandra☆20Updated 3 years ago
- Complete PySpark Guide for the beginners... I prepared this notebook for my students.☆18Updated 5 years ago
- MLflow related work☆39Updated last year
- Python data repo, jupyter notebook, python scripts and data.☆518Updated 6 months ago
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆64Updated 2 years ago
- ☆87Updated 2 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated 2 years ago
- Code base for airflow training series Getting easy with Apache Airflow☆40Updated last year
- Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j)☆305Updated last year
- ☆24Updated 3 years ago
- Django-based course management platform for Zoomcamps☆67Updated last week
- Spark all the ETL Pipelines☆32Updated last year
- Exercises performed as part of the ML Zoomcamp course☆30Updated 3 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆57Updated 2 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆30Updated 4 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆96Updated 3 months ago
- ☆32Updated 3 years ago