monksy / awesome-data-engineeringLinks
A curated list of data engineering tools for software developers
☆13Updated 7 years ago
Alternatives and similar repositories for awesome-data-engineering
Users that are interested in awesome-data-engineering are comparing it to the libraries listed below
Sorting:
- Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collab…☆40Updated 5 years ago
- A collection of kafka-resources☆211Updated last week
- If you are planning or preparing for Apache Kafka Certification then this is the right place for you.There are many Apache Kafka Certific…☆41Updated 5 years ago
- ☆20Updated 6 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 5 years ago
- Apache Spark Interview Question and Answers☆21Updated 5 years ago
- Apache Spark Course Material☆96Updated 2 years ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆67Updated 5 years ago
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆57Updated 3 years ago
- This is the central repository for all materials related to Kafka Streams : Real-time Stream Processing! Book by Prashant Pandey.☆172Updated 5 years ago
- Python and AirFlow - Data Pipeline Orchestration☆16Updated 2 years ago
- Stream Processing Workshop☆23Updated 2 weeks ago
- Example Code for Kafka Tutorials @ Learning Journal☆178Updated 8 years ago
- ETL pipeline using pyspark (Spark - Python)☆116Updated 5 years ago
- Code and Notebooks for Spark Tutorials for Learning Journal @ Youtube☆56Updated 5 years ago
- This project will help the beginners learn Kafka with ease.☆49Updated 2 years ago
- Data engineering interviews Q&A for data community by data community☆66Updated 5 years ago
- Different ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink☆31Updated 3 years ago
- ☆88Updated 3 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 7 years ago
- Complete high-quality practice tests of 50 questions each will help you master your Confluent Certified Developer for Apache Kafka (CCDAK…☆92Updated 2 years ago
- This repos is to keep all the relevant informations for Confluent Certified Developer for Apache Kafka (CCDAK)☆124Updated 2 years ago
- Complete high-quality practice tests of 50 questions each will help you master your Confluent Certified Developer for Apache Kafka (CCDAK…☆39Updated 5 years ago
- Everything about Apache Kafka☆207Updated 2 years ago
- data engineering 100 days 🤖 🧲 🦾 | #DE☆40Updated 2 years ago
- Dockerizing an Apache Spark Standalone Cluster☆42Updated 3 years ago
- A hybrid Big Data pipeline architecture that combines a real-time streaming layer with a batch layer to process large datasets(Lambda Arc…☆189Updated 5 months ago
- This repo is mostly created for pyspark and hive related interview questions.☆63Updated last month
- dbtVault + Greenplum demo☆10Updated last year
- Kafka-Notes☆15Updated 4 years ago