liliasfaxi / Atelier-SparkLinks
Cours et TP sur Apache Spark
☆12Updated 3 years ago
Alternatives and similar repositories for Atelier-Spark
Users that are interested in Atelier-Spark are comparing it to the libraries listed below
Sorting:
- This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop☆33Updated 5 years ago
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆57Updated 3 years ago
- ☆18Updated 4 years ago
- This is a comprehensive solution for real-time football analytics, leveraging Apache Spark execution on yarn for both streaming and batch…☆11Updated 3 months ago
- A Python package to submit and manage Apache Spark applications on Kubernetes.☆46Updated 5 months ago
- Different ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink☆31Updated 3 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆135Updated 3 years ago
- Scraping my school's alumni Data from LinkedIn using a bot 🤖☆25Updated 4 years ago
- EverythingApacheNiFi☆116Updated 2 years ago
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆76Updated 2 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 5 years ago
- ☆14Updated 2 years ago
- Prescriptive guidance for building, deploying, and monitoring machine learning models with Azure Databricks using containers in line with…☆24Updated 2 weeks ago
- This project shows how to capture changes from postgres database and stream them into kafka☆39Updated last year
- AWS lambda function for S3 delete and copy data from source S3 to another target S3☆16Updated 6 years ago
- Azure Deployments using Terraform☆30Updated 3 years ago
- Dockerized monitoring stack for Apache Airflow☆35Updated last year
- Repository for all ITVersity Vagrant Boxes.☆32Updated 5 years ago
- A Series of Notebooks on how to start with Kafka and Python☆151Updated 11 months ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆67Updated 5 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Updated 2 years ago
- ML and Deep Learning everyday challenge☆11Updated 4 years ago
- Learn how to deploy and manage a data tier based on Apache Cassandra™ cluster in Kubernetes using K8ssandra.☆22Updated 3 years ago
- Sample cloud-native application with 10 microservices showcasing Kubernetes, Istio, gRPC and OpenCensus.☆37Updated 2 years ago
- ☆88Updated 3 years ago
- ☆75Updated 2 years ago
- This repository contains code for Spark Streaming☆26Updated 4 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 4 years ago
- Apache NiFi cluster running in Kubernetes☆61Updated last week
- As attendee you will find everything you need for Cassandra Developer Workshop online☆89Updated 2 years ago