awesome-mlops / awesome-data-management
A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-data-management
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Apache Spark based framework for analysis A/B experiments☆11Updated this week
- Triptych for data exchange and persistence☆22Updated 7 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆51Updated last week
- a toy duckdb based timeseries database☆14Updated 4 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- ☆12Updated last year
- This repository contains code to build an MVP search engine with google like interface.☆16Updated 4 years ago
- Awesome list of dataops products, open source and resources☆24Updated 2 years ago
- ☆22Updated 2 years ago
- Functional composable pipelines allowing clean separation of the business logic and its implementation☆11Updated 5 months ago
- Common Paper Service Level Agreement☆13Updated 7 months ago
- Build a directory full of files into a SQLite database☆12Updated 9 months ago
- A few end to end examples that use data-describe☆16Updated last year
- Astronomer Vendor Images☆12Updated this week
- Demonstration of how to perform continuous model monitoring on CML using Model Metrics and Evidently.ai dashboards☆12Updated 7 months ago
- A collection of tools that can be used for LLM function calling☆32Updated 8 months ago
- My dot files in one place - extensively edited over time. Your mileage may vary☆2Updated 8 years ago
- Ssebowa is free and open source library in Python that provides generative-ai models.☆14Updated 9 months ago
- Using the Parquet file format with Python☆14Updated last year
- pysh-db - The Data Science Toolkit (DSK)☆14Updated 5 years ago
- This repository contains example implementations for KNIME Analytics Platform.☆16Updated 3 months ago
- Events about the open source data stack☆13Updated 2 years ago
- Batteries included toolkit for data engineering.☆32Updated 2 months ago
- A conda-smithy repository for python-duckdb.☆13Updated this week
- ☀️🦶 A lightweight framework for collaborative, open-source feature engineering☆32Updated 3 years ago
- Orchest quickstart pipeline☆17Updated 2 years ago
- A Pythonic API for Amazon's States Language for defining AWS Step Functions☆8Updated last year
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆17Updated last week