awesome-mlops / awesome-data-management
A curated list of awesome open source tools and commercial products to catalog, version, and manage data π
β27Updated 2 years ago
Related projects β
Alternatives and complementary repositories for awesome-data-management
- Apache Spark based framework for analysis A/B experimentsβ11Updated 3 weeks ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.β11Updated 4 years ago
- Demonstration of how to perform continuous model monitoring on CML using Model Metrics and Evidently.ai dashboardsβ12Updated 7 months ago
- Batteries included toolkit for data engineering.β32Updated this week
- Generate Hive CREATE TABLE statements from json dataβ10Updated 7 years ago
- β22Updated 2 years ago
- Ssebowa is free and open source library in Python that provides generative-ai models.β14Updated 9 months ago
- Triptych for data exchange and persistenceβ23Updated 8 months ago
- a toy duckdb based timeseries databaseβ14Updated 4 years ago
- Documentation and resources for deploying JupyterHub on Hadoopβ18Updated 5 years ago
- Common Paper Service Level Agreementβ13Updated 7 months ago
- Awesome Orchest projects, both official and submitted by the community.β25Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β52Updated 3 weeks ago
- Clone of chatgpt built with Bytewax, Streamlit and NATSβ15Updated last year
- Astronomer Vendor Imagesβ12Updated this week
- Connect to your customer data using any LLM and gain actionable insights. IdentityRAG creates a single comprehensive customer 360 view (gβ¦β25Updated this week
- π» CLI for reporting events to Faros platformβ14Updated last month
- My dot files in one place - extensively edited over time. Your mileage may varyβ2Updated 8 years ago
- a graph definition and execution library for pythonβ16Updated last year
- bamboolib - template for creating your own binder notebookβ21Updated 2 years ago
- Plugin for Intake to read from SQL serversβ15Updated last year
- Build a directory full of files into a SQLite databaseβ13Updated 10 months ago
- Example of a Streamlit data app powered by Vaexβ10Updated 2 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflowβ¦β11Updated 2 years ago
- A collection of tools that can be used for LLM function callingβ32Updated 8 months ago
- Awesome list of dataops products, open source and resourcesβ24Updated 2 years ago
- Python context manager to communicate with a subprocess using iterables: for when data is too big to fit in memory and has to be streamedβ9Updated last month
- Orchest quickstart pipelineβ17Updated 2 years ago
- Datasette plugin for authenticating access using API tokensβ12Updated 2 months ago