liliasfaxi / Atelier-SparkLinks
Cours et TP sur Apache Spark
☆11Updated 3 years ago
Alternatives and similar repositories for Atelier-Spark
Users that are interested in Atelier-Spark are comparing it to the libraries listed below
Sorting:
- TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT wa…☆114Updated 2 years ago
- ☆17Updated 9 months ago
- Run Hadoop Custer within Docker Containers☆29Updated last month
- ☆11Updated 5 years ago
- A simple guide to MLOps through ZenML and its various integrations.☆187Updated last year
- My notes for AWS Data Engineer Associate☆43Updated 5 months ago
- Tunisian Sentiment Analysis Corpus.☆27Updated 4 years ago
- ☆24Updated 3 years ago
- Practical MLOps O'Reilly Book - Personal Extended Version☆15Updated 2 years ago
- A collection of helm (https://helm.sh) charts for datascience. Usable with Onyxia (https://github.com/inseefrlab/onyxia-api).☆18Updated 8 months ago
- This repository contains code for Spark Streaming☆22Updated 4 years ago
- ☆30Updated 2 weeks ago
- Complete PySpark Guide for the beginners... I prepared this notebook for my students.☆17Updated 5 years ago
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆57Updated 2 years ago
- Django-based course management platform for Zoomcamps☆67Updated 3 weeks ago
- ☆87Updated 2 years ago
- ☆15Updated 3 years ago
- Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collab…☆37Updated 5 years ago
- ☆40Updated 3 years ago
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆64Updated 2 years ago
- ☆26Updated 4 years ago
- Copy Hive tables definitions to Compute Cluster, while still using Storage on original cluster☆11Updated 2 weeks ago
- Course materials for Reinforcement Learning course in Arabic☆22Updated 6 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆60Updated last year
- Main TDP repository☆57Updated 2 weeks ago
- Realtime Data Engineering Project☆31Updated 4 months ago
- ☆28Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆36Updated last year
- ☆45Updated 4 years ago
- ☆23Updated 2 years ago