shauryashaurya / learn-data-mungingView external linksLinks
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
☆52Feb 3, 2026Updated last week
Alternatives and similar repositories for learn-data-munging
Users that are interested in learn-data-munging are comparing it to the libraries listed below
Sorting:
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆18Jul 31, 2023Updated 2 years ago
- ☆20Jul 10, 2024Updated last year
- ☆11Feb 17, 2022Updated 3 years ago
- R for Data Science (2e) in Simplified Chinese☆20Dec 23, 2025Updated last month
- ☆13Apr 18, 2024Updated last year
- Resources for O'Reilly Online Learning course, "First Steps with Power Query for Microsoft Excel"☆11Sep 22, 2021Updated 4 years ago
- shifts hg19/38 genomic position for feasible input format.☆12Jun 8, 2023Updated 2 years ago
- PDF Diff Viewer, a side-by-side, visual highlight, sync-scroll, PDF comparer, written in Python. Open source, mostly powered by PyMuPDF a…☆36Jan 31, 2026Updated 2 weeks ago
- These are a compilation of basic Python operations.☆11Sep 5, 2022Updated 3 years ago
- Suspension telemetry system for mountain bike or dirt bike☆10Apr 9, 2024Updated last year
- Repository for the dbt Semantic Layer course☆11Nov 13, 2025Updated 3 months ago
- functional genomic data integration☆10Sep 22, 2019Updated 6 years ago
- ☆11Dec 14, 2019Updated 6 years ago
- 🧾 Let's automate Invoice generation from CSV file (@jakobowsky YouTube tutorial)☆12Sep 12, 2020Updated 5 years ago
- QuickLook text preview and icon thumbnailing app extensions for macOS Catalina and beyond☆16May 31, 2025Updated 8 months ago
- I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried …☆14Jan 10, 2024Updated 2 years ago
- ☆12Jan 27, 2026Updated 2 weeks ago
- Python wrapper for the Strava (http://www.strava.com) API☆25Nov 25, 2017Updated 8 years ago
- Links to PowerBi Tutorials☆14Updated this week
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- Python module for measure the degree of association between variables☆13Apr 20, 2022Updated 3 years ago
- Appscript to pull stats from gmail and store in a google sheet☆10Mar 20, 2021Updated 4 years ago
- Nord Deep stylesheets for Matplotlib☆11Jul 24, 2023Updated 2 years ago
- This is the repo for CROssBARv2 Knowledge Graph data. CROssBARv2 is a heterogeneous general-purpose biomedical KG-based system.☆11Feb 4, 2026Updated last week
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- My PowerApps that I practice and share some insights on LinkedIn☆13Updated this week
- BLE to Direct Connect bridge for bike trainers adding virtual shifting for Zwift☆14Nov 2, 2025Updated 3 months ago
- Reanalysis of the repressive capacity of promoter DNA methylation☆11Feb 14, 2019Updated 7 years ago
- A Polars plugin for encrypting and decrypting data using AES-GSM-CIV algorithm in Rust☆11Jan 8, 2025Updated last year
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 2 months ago
- Some simple apps in solara☆15Nov 14, 2023Updated 2 years ago
- небольшая надстройка, предназначенная для оценки скорости выполнения запросов PQ и формул на листах в среде MS Excel в среде MS Excel☆16Sep 6, 2025Updated 5 months ago
- Data Analysis of the UCI MTB DH World Cup☆10Jul 15, 2018Updated 7 years ago
- PowerShell script that gives an Excel output of all Power BI workspace, Dataset, App, Report, and Page info (leveraging Power BI REST API…☆21Oct 13, 2024Updated last year
- UQ4DD: Uncertainty Quantification for Drug Discovery☆17Aug 4, 2025Updated 6 months ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 2 months ago
- A lightwight Framework for the Respiratory Sound Classification☆11Feb 12, 2025Updated last year
- ☆12Nov 18, 2024Updated last year
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago