liliasfaxi / Atelier-Spark
Cours et TP sur Apache Spark
☆11Updated 3 years ago
Alternatives and similar repositories for Atelier-Spark:
Users that are interested in Atelier-Spark are comparing it to the libraries listed below
- Realtime Data Engineering Project☆27Updated last month
- ☆42Updated 3 years ago
- ☆15Updated 3 years ago
- This repo contains Big Data Project, its about "Real Time Twitter Sentiment Analysis via Kafka, Spark Streaming, MongoDB and Django Dashb…☆16Updated 9 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆59Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆230Updated last week
- PDF DataSource for Apache Spark☆41Updated 3 weeks ago
- Run Hadoop Custer within Docker Containers☆26Updated 7 months ago
- MLflow related work☆38Updated last year
- Main TDP repository☆57Updated last month
- This project shows how to capture changes from postgres database and stream them into kafka☆35Updated 9 months ago
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆58Updated 2 years ago
- used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline☆27Updated last year
- ☆11Updated last year
- Docker Apache Airflow☆13Updated last year
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆38Updated 11 months ago
- A Kafka Connect Single Message Transform (SMT) that enables you to append the record key to the value as a named field☆18Updated this week
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- ☆11Updated 4 years ago
- Spark all the ETL Pipelines☆32Updated last year
- Public Docker Images for popular services☆21Updated last month
- My Setup Development Environment as Data Engineer☆23Updated 3 weeks ago
- ☆92Updated 3 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆19Updated last year
- NLP course for ITI AI-Pro track☆58Updated 3 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆43Updated last year
- Tutorial repo for an end-to-end Data Science project☆153Updated last year
- ☆87Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆39Updated last year
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆128Updated 2 years ago