tiagotxm / yt-spark-no-kubernetes
☆10Updated this week
Alternatives and similar repositories for yt-spark-no-kubernetes:
Users that are interested in yt-spark-no-kubernetes are comparing it to the libraries listed below
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 3 years ago
- ☆61Updated 11 months ago
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆53Updated 4 months ago
- Data Engineering com Apache Spark☆43Updated 3 years ago
- ☆18Updated 3 years ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆169Updated last year
- Docker with Airflow and Spark standalone cluster☆249Updated last year
- 📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.☆36Updated last month
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Updated 2 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆234Updated 2 weeks ago
- ☆37Updated 2 years ago
- Delta Lake helper methods in PySpark☆315Updated 5 months ago
- ☆22Updated 3 years ago
- An exercise running Kafka, Kafka Connect, PostgreSQL, Superset and AWS S3☆21Updated 3 years ago
- This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/dat…☆17Updated 3 years ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆362Updated this week
- ☆15Updated 10 months ago
- ☆111Updated 6 months ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆172Updated 3 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆102Updated this week
- Astronomer Starship can send your Airflow workloads to new places!☆28Updated this week
- ☆258Updated 3 months ago
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆419Updated last week
- Notebooks e dicas sobre Databricks☆20Updated 3 months ago
- Repo for saving cheat sheets☆46Updated 8 months ago
- ☆119Updated last week
- Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code☆850Updated this week
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆256Updated 7 months ago
- Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course☆519Updated 2 weeks ago
- ☆43Updated 3 months ago