liliasfaxi / Atelier-Spark
Cours et TP sur Apache Spark
☆11Updated 3 years ago
Alternatives and similar repositories for Atelier-Spark:
Users that are interested in Atelier-Spark are comparing it to the libraries listed below
- ☆45Updated 4 years ago
- Run Hadoop Custer within Docker Containers☆28Updated 9 months ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆45Updated last year
- Main TDP repository☆59Updated 2 months ago
- Spark all the ETL Pipelines☆32Updated last year
- Complete PySpark Guide for the beginners... I prepared this notebook for my students.☆18Updated 5 years ago
- The Ultimate Hands-On Hadoop - Tame your Big Data!: https://www.udemy.com/the-ultimate-hands-on-hadoop-tame-your-big-data/☆8Updated 6 years ago
- used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline☆27Updated last year
- ☆82Updated last year
- Enrolled in DataTalks Zoomcamp https://github.com/DataTalksClub/mlops-zoomcamp☆21Updated 2 years ago
- ☆22Updated 4 years ago
- Realtime Data Engineering Project☆27Updated 2 months ago
- This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop☆33Updated 4 years ago
- Template to spin up delta lake locally using docker☆23Updated last year
- This is project documentation templates derived from CRISP-DM to be used for Data Engineering projects.☆53Updated 3 years ago
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆57Updated 2 years ago
- ☆26Updated 7 months ago
- An awesome Analytics Engineering repository to learn and apply for real world problems.☆37Updated last year
- ☆33Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆35Updated last year
- Azure Data Engineer Associate Certification Guide, published by Packt☆78Updated last year
- Repository related to Spark SQL and Pyspark using Python3☆37Updated 2 years ago
- A collection of helm (https://helm.sh) charts for datascience. Usable with Onyxia (https://github.com/inseefrlab/onyxia-api).☆18Updated 6 months ago
- My solution for the 7th place / 245 in the Umoja Hack 2022 challenge☆18Updated 3 years ago
- ☆15Updated 3 years ago
- ☆64Updated 2 weeks ago
- Tunisian Sentiment Analysis Corpus.☆26Updated 4 years ago
- Data Engineering on GCP☆34Updated 2 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆36Updated 10 months ago
- Exercise Staters and solutions for cd0581-building-a-reproducible-model-workflow by Giacomo Vianello☆20Updated last year