System Design, Solution Architecture, Data Systems Practice
☆70Aug 14, 2025Updated 6 months ago
Alternatives and similar repositories for data-systems-design
Users that are interested in data-systems-design are comparing it to the libraries listed below
Sorting:
- adidas Data Mesh implementation☆12May 13, 2022Updated 3 years ago
- Learn Scala by practicing on LeetCode.☆30May 4, 2018Updated 7 years ago
- Python Package to Share/Edit Pandas/Polars DF with web interface!☆11Jun 10, 2025Updated 8 months ago
- How to customize Tableau authentication using the AWS Athena's JDBC Credentials Provider capabilites.☆14Jun 8, 2020Updated 5 years ago
- ☆11Nov 26, 2024Updated last year
- ☆15Feb 11, 2026Updated 2 weeks ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- Associated blog post - https://tristanrhodes.com/blog/Adventures-in-Algorithmic-Trading-on-the-Runescape-Grand-Exchange☆10Oct 14, 2024Updated last year
- Learn how to apply design principles, patterns, and architectures to create reusable, flexible, and maintainable software applications an…☆11Nov 9, 2023Updated 2 years ago
- Sample Spring Boot project implementing a REST CRUD application☆16Oct 19, 2021Updated 4 years ago
- ☆12Jun 3, 2023Updated 2 years ago
- This repository will provde code to build end-to-end IAC code to build an intelligent GenAI chatbot based on Amazon Bedrock☆12Jun 13, 2025Updated 8 months ago
- Automatically generate DBML files from Snowflake databases for quickly reverse engineer interactive ER diagrams and documentation from yo…☆17May 5, 2024Updated last year
- Go wrapper around SSH that speaks AWS API☆16Aug 15, 2023Updated 2 years ago
- Helper for handling PySpark DataFrame partition size 📑🎛️☆12Mar 8, 2024Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆108Jan 8, 2026Updated last month
- The project XOS or Experimental Operating System is a platform to help in developing a toy operating system.☆66Apr 11, 2018Updated 7 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆17Mar 31, 2024Updated last year
- Build a Full stack Q&A Chatbot with Langchain, and LLM Models on Amazon Sagemaker☆12Nov 10, 2023Updated 2 years ago
- Deploy an AWS ECS Cluster of EC2 Instances with Terraform☆13Dec 26, 2023Updated 2 years ago
- Apache Arrow Flight example☆11Nov 9, 2020Updated 5 years ago
- In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related info…☆12Sep 9, 2023Updated 2 years ago
- Support for multiple broker hosts and basic "failover" on the client side.☆22Feb 20, 2013Updated 13 years ago
- capstone project for Dataengineer.io bootcamp Public Repo☆12Feb 20, 2024Updated 2 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- An intelligent predictive text entry platform. Mirror of git://git.code.sf.net/p/presage/presage Please send reports to the SourceForge b…☆11Aug 17, 2015Updated 10 years ago
- A tool for comparing large S3 buckets☆17Jan 22, 2026Updated last month
- A no-dependency library to send standardized events to observability and data platforms. Based on plugins, Stratum enables the cataloging…☆26Updated this week
- rb_status_plugin : Data confidence tool for Airflow☆12Jan 7, 2023Updated 3 years ago
- Ssebowa is free and open source library in Python that provides generative-ai models.☆14Jan 31, 2024Updated 2 years ago
- Spark Structured Streaming data pipeline that processes movie ratings data in real-time.☆13Feb 11, 2026Updated 2 weeks ago
- https://www.distributedpython.com/2018/05/01/unit-testing-celery-tasks☆13Jun 26, 2018Updated 7 years ago
- I self petitioned my EB1A and got approved. This repository contains my original petition, RFE response, and link to resources I used.☆23Sep 15, 2025Updated 5 months ago
- The Cloud Squad is a web application that helps users assess their AWS certification exam readiness through a 30-minute test with real-ti…☆14Mar 24, 2025Updated 11 months ago
- How to run DBT on AWS Fargate☆13Oct 15, 2019Updated 6 years ago
- Try and measure tokio task overhead☆11Dec 20, 2021Updated 4 years ago
- 🌟 An end-to-end full-stack data science project, including modelling, MLOps, and data storytelling. ✨☆16Aug 30, 2025Updated 6 months ago
- Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.☆14Nov 9, 2023Updated 2 years ago