awesome-mlops / awesome-data-managementLinks
A curated list of awesome open source tools and commercial products to catalog, version, and manage data π
β36Updated 3 years ago
Alternatives and similar repositories for awesome-data-management
Users that are interested in awesome-data-management are comparing it to the libraries listed below
Sorting:
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β62Updated this week
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- Awesome Orchest projects, both official and submitted by the community.β25Updated 2 years ago
- Data Catalog for Databases and Data Warehousesβ35Updated last year
- Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈβ18Updated this week
- A small Python module containing quick utility functions for standard ETL processes.β36Updated 2 weeks ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graphβ22Updated 4 years ago
- Glue is an enterprise data model for the buy side, tailored for Wealth and Asset Managers and covering key entities such as Party, Busineβ¦β23Updated 2 years ago
- KNOTS is an intuitive desktop application built to simplify the configuration of Singer pipelinesβ67Updated 2 years ago
- Data pipelines from re-usable componentsβ107Updated 2 years ago
- A monorepo of many Rill example projectsβ42Updated last week
- π A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)β141Updated 2 years ago
- Tools for working with Singer Taps and Targetsβ59Updated last year
- Python ELT Studio, an application for building ELT (and ETL) data flows.β58Updated 3 years ago
- β10Updated 3 years ago
- Beneath is a serverless real-time data platform β‘οΈβ84Updated 3 years ago
- CLI for creating databases for Data Quality Dashboards.β19Updated 5 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clientsβ38Updated last year
- A maximum-strength name parser for record linkage.β38Updated last week
- β11Updated last year
- This project is created to promote and advocate the use of FOSS machine learning.β46Updated 4 months ago
- This project provides an example of consolidating Milvus (vector search engine) and PostgreSQL (relational database) to carry out the hybβ¦β11Updated 4 years ago
- A curated list of dagster code snippets for data engineersβ57Updated last year
- A curated list of data wrangling resourcesβ39Updated 6 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.β126Updated 4 years ago
- ODD Specification is a universal open standard for collecting metadata.β144Updated 10 months ago
- βοΈπ¦Ά A lightweight framework for collaborative, open-source feature engineeringβ33Updated 3 years ago
- Documentation and resources for deploying JupyterHub on Hadoopβ19Updated 6 years ago
- β16Updated last year
- MLOps simplified. One-stop AI delivery platform, all the features you need.β100Updated last week