shauryashaurya/learn-data-munging

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shauryashaurya/learn-data-munging)

shauryashaurya / learn-data-munging

Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.

☆53

Alternatives and similar repositories for learn-data-munging

Users that are interested in learn-data-munging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dipankarmazumdar / iceberg-in-production
View on GitHub
A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs
☆20Jul 31, 2023Updated 2 years ago
paulreimer / esp32-simulator
View on GitHub
FreeRTOS-Sim (FreeRTOS POSIX port), and collection of stubbed esp-idf components to allow running esp-idf apps on POSIX OSes (macOS)
☆11Sep 21, 2018Updated 7 years ago
deepyaman / jaffle-shop
View on GitHub
Example project for building scalable data pipelines with Kedro and Ibis.
☆14Dec 10, 2025Updated 6 months ago
sfrechette / stream-sequelize-node
View on GitHub
Ingest sample Market Orders Data feed from PubNub to Postgres with TimescaleDB extension installed and enabled for time series analysis.
☆14Jan 21, 2026Updated 5 months ago
k2datascience / twitter_filter
View on GitHub
Streaming Twitter Filter with Python & Redis
☆18Oct 25, 2018Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
capwan / Stopwatch_timer
View on GitHub
Stopwatch timer on JS
☆11Sep 4, 2023Updated 2 years ago
mensink / arduino-lib-MCP42010
View on GitHub
Arduino library for using the MCP42010 digital potentiometer with SPI
☆12May 23, 2015Updated 11 years ago
jtec / prx
View on GitHub
☆18Updated this week
MeirKaD / unified-search
View on GitHub
☆19Feb 8, 2026Updated 5 months ago
ettorerizza / aat_reconcile
View on GitHub
OpenRefine reconciliation service with Getty AAT (Art & Architecture Thesaurus)
☆12May 1, 2023Updated 3 years ago
sqlchick / Presentations
View on GitHub
Presentation materials from community (public) presentations
☆11May 6, 2021Updated 5 years ago
Amarjit-ph / software-engineering
View on GitHub
Documentation of my software engineering journey
☆12May 19, 2026Updated last month
sfrechette / adventureworks-neo4j
View on GitHub
Importing AdventureWorks (SQL Server Sample Database) to Neo4j
☆15Jun 17, 2025Updated last year
thisisjeffsnow / project-euler-archives
View on GitHub
My solutions for the first 100 problems in Project Euler.
☆10May 6, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thisisjeffsnow / leetcode-solutions
View on GitHub
My solutions to LeetCode exercises.
☆13May 14, 2023Updated 3 years ago
capwan / Password-Generator
View on GitHub
Password Generator
☆11Jun 20, 2026Updated 2 weeks ago
probabl-ai / calibration-cost-sensitive-learning
View on GitHub
Tutorial on probabilistic classification and cost-sensitive learning.
☆13Aug 19, 2025Updated 10 months ago
espressif / esp-toolchain-docs
View on GitHub
Repository with documentation related to toolchains and debuggers maintained by Espressif
☆24Sep 11, 2025Updated 9 months ago
intentional-ai / intentional
View on GitHub
Intentional is an open-source framework to build reliable LLM chatbots that actually talk and behave as you expect.
☆12Dec 31, 2024Updated last year
nextcloud / context_chat_backend
View on GitHub
☆21Jun 29, 2026Updated last week
SliceCast / Bullet-Force-Source-Code
View on GitHub
Bullet Force Source Code for iOS & Android
☆13Dec 31, 2021Updated 4 years ago
anaconda / state-of-data-science
View on GitHub
Data from the state of data science survey released by Anaconda each year.
☆17Aug 15, 2024Updated last year
zjffdu / zeppelin-notebook
View on GitHub
☆12Jul 10, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
aidevnn / FastGoat
View on GitHub
What C# can do for studying Finite Groups, quotient groups, semi-direct products, homomorphisms, automorphisms group, characters table, m…
☆15Jul 1, 2026Updated last week
emilycodestar / cash-earning-game-cpp
View on GitHub
Thing what most of the developers just dream of
☆17Aug 31, 2023Updated 2 years ago
PioneersHub / pytanis
View on GitHub
🔱 Python client for Pretalx
☆29Jul 1, 2026Updated last week
AppServiceProvider / mini-blog-project
View on GitHub
uses laravel breeze , CRUD, soft delete, Factories
☆14Oct 14, 2022Updated 3 years ago
ayyucekizrak / pratik-derin-ogrenme-uygulamalari
View on GitHub
Çeşitli kütüphaneler kullanılarak Türkçe kod açıklamalarıyla pratik derin öğrenme uygulamaları.
☆10Nov 20, 2017Updated 8 years ago
julialintern / intro_to_machine_learning
View on GitHub
☆13May 8, 2023Updated 3 years ago
Enjoyop / coc-assets
View on GitHub
Assets Clash of Clans.
☆14Apr 3, 2024Updated 2 years ago
natanast / 30DayChartChallenge
View on GitHub
This repository contains my contributions to the #30DayChartChallenge
☆10May 2, 2026Updated 2 months ago
datastax-archive / workshop-intro-quarkus-cassandra
View on GitHub
☆21Mar 11, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
astrojuanlu / talk-kedro-huggingface
View on GitHub
☆16Dec 13, 2023Updated 2 years ago
ikantkode / qwen2.5VLM-OCR
View on GitHub
A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.
☆26Aug 18, 2025Updated 10 months ago
cosmohacker / cosmohacker
View on GitHub
About me
☆20Mar 11, 2026Updated 3 months ago
NowinskiK / ssdt-training
View on GitHub
All code from full ssdt training
☆18Dec 5, 2024Updated last year
tdpetrou / Machine-Learning-Tutorials
View on GitHub
☆12Nov 23, 2022Updated 3 years ago
milaabl / milaabl
View on GitHub
Oh that snake is eating my contributions! 🐍
☆15Sep 9, 2025Updated 10 months ago
abunuwas / fastapi-jwt-tutorial
View on GitHub
Code for my JWT auth for FastAPI tutorial (https://youtu.be/C92mjEKUfNQ)
☆16Jan 24, 2022Updated 4 years ago