realpython / data-version-control
☆27Updated last year
Alternatives and similar repositories for data-version-control:
Users that are interested in data-version-control are comparing it to the libraries listed below
- A conda-smithy repository for python-duckdb.☆13Updated 2 weeks ago
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆24Updated 8 months ago
- This project is created to promote and advocate the use of FOSS machine learning.☆43Updated last month
- A lightweight tool to measure the full memory of a Python session☆19Updated last month
- Demo code and other hand-out materials for our Python for Decision Makers and Business Leaders course☆24Updated 3 years ago
- Official Python client SDK for Iggy.rs message streaming.☆24Updated 3 weeks ago
- Feature flags for python.☆18Updated 7 years ago
- Git scrapers for scraping the fediverse☆15Updated this week
- Build a directory full of files into a SQLite database☆12Updated last year
- Deploying a simple FastAPI app to Fly.io >> https://fly-fastapi.fly.dev/docs <<☆14Updated last year
- Terraform on Localstack Examples☆13Updated 8 months ago
- Basic tutorial of using Apache Airflow☆36Updated 6 years ago
- Pandas Training © MetaSnake 2022, CC BY-NC☆18Updated 3 years ago
- Graphinate. Data to Graphs.☆24Updated last week
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 2 years ago
- Add Google and Python documentation links to the bottom of exceptions.☆27Updated last year
- Data exchange and persistence based on human-readable files☆22Updated 3 months ago
- Making Time Speak! 🎙️☆29Updated last month
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆82Updated this week
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- Demo code exploring Python's memory models and collection algorithms from the Talk Python Training course.☆44Updated 4 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.☆84Updated 4 months ago
- Tutorial: Exploratory Data Analysis, the Polars Way☆16Updated 8 months ago
- Project templates for Snowflake CLI init command☆13Updated this week
- Tracebacks for Humans (in Jupyter notebooks)☆12Updated 2 months ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 3 months ago
- Your new best friend. Puppy is the easiest way to get started with modern python on any platform, install packages in virtual environmen…☆46Updated 2 weeks ago