INSATunisia / TP-BigDataLinks
☆17Updated 9 months ago
Alternatives and similar repositories for TP-BigData
Users that are interested in TP-BigData are comparing it to the libraries listed below
Sorting:
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆252Updated 3 months ago
- ☆7Updated 2 years ago
- Cours et TP sur Apache Spark☆11Updated 3 years ago
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆64Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆39Updated last year
- IBM Data Engineering Courses from Coursera☆72Updated 2 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆44Updated last year
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆11Updated last year
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆18Updated 2 years ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆139Updated last year
- All of my individual learning materials, documents, and notes from the process of getting the Coursera IBM Data Engineer Professional Cer…☆91Updated 2 years ago
- This Repo contains Jupyter Notebooks to recap on RDD, DataFrame, Spark Streaming and ML operations using Pyspark☆11Updated 7 months ago
- Python data repo, jupyter notebook, python scripts and data.☆511Updated 5 months ago
- Begin your Data Engineering journey☆50Updated 3 years ago
- Data engineering mentorship program☆272Updated 10 months ago
- ☆102Updated 2 years ago
- ☆150Updated 3 years ago
- My notes for AWS Data Engineer Associate☆43Updated 5 months ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆347Updated last year
- Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of…☆202Updated last week
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆43Updated last year
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆60Updated last year
- Code test for data engineering candidates☆47Updated last year
- ☆23Updated 2 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆22Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆31Updated last year
- Realtime Data Engineering Project☆31Updated 4 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆77Updated last year
- This project unlocks the power of advanced analytics and reporting by transforming an OLTP architecture into an efficient OLAP setup. Lev…☆27Updated last year
- ☆32Updated 7 months ago