shauryashaurya / learn-data-munging
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
☆48Updated last week
Alternatives and similar repositories for learn-data-munging:
Users that are interested in learn-data-munging are comparing it to the libraries listed below
- Code and materials for Effective Polars book☆79Updated last year
- ☆29Updated 9 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- A FastMCP tool to search and retrieve Polars API documentation.☆27Updated this week
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy☆42Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated 11 months ago
- Intro to Polars Tutorial☆23Updated 2 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- Pandas Training © MetaSnake 2022, CC BY-NC☆18Updated 3 years ago
- ☆27Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- PipeRider dbt workshop for DataTalksClub DE Zoomcamp☆17Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Essential PySpark for Scalable Data Analytics, published by Packt☆44Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- DataTalks Workshop Materials☆18Updated last year
- ☆17Updated 8 months ago
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- rust-for-data☆45Updated last year
- Some example projects for Data Engineers to build, end-to-end.☆28Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆49Updated 5 months ago
- Syllabus for Artificial Intelligence for Product Innovation Master of Engineering: https://ai.meng.duke.edu/degree☆32Updated last year
- A custom end-to-end analytics platform for customer churn☆11Updated 3 months ago
- Repo for CDC with debezium blog post☆28Updated 7 months ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- csv and flat-file sniffer built in Rust.☆42Updated last year
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last year
- ☆26Updated 3 years ago
- Dagster University courses☆76Updated last week