awesome-mlops / awesome-data-managementLinks
A curated list of awesome open source tools and commercial products to catalog, version, and manage data π
β33Updated 3 years ago
Alternatives and similar repositories for awesome-data-management
Users that are interested in awesome-data-management are comparing it to the libraries listed below
Sorting:
- Taking Normal Text as Input and Generating SQL commands using the OpenAI's GPT-3β15Updated 4 years ago
- Functional composable pipelines allowing clean separation of the business logic and its implementationβ11Updated last year
- Apache Spark based framework for analysis A/B experimentsβ15Updated 7 months ago
- Scrape various open data directories to create an index of what's available out thereβ37Updated 4 months ago
- Awesome Orchest projects, both official and submitted by the community.β25Updated last year
- This repository contains code to build an MVP search engine with google like interface.β15Updated last week
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β59Updated last week
- Cookiecutter for community-maintained Jupyter Docker imagesβ15Updated 2 weeks ago
- β11Updated 4 months ago
- Batteries included toolkit for data engineering.β34Updated 5 months ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.β11Updated 4 years ago
- Build a directory full of files into a SQLite databaseβ12Updated last year
- Tools for building SQLite databases from files and directoriesβ12Updated last year
- Data exchange and persistence based on human-readable filesβ22Updated 6 months ago
- Glue is an enterprise data model for the buy side, tailored for Wealth and Asset Managers and covering key entities such as Party, Busineβ¦β22Updated 2 years ago
- Awesome list of dataops products, open source and resourcesβ24Updated 3 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β27Updated 2 years ago
- Git scrapers for scraping the fediverseβ17Updated this week
- A Datasette plugin that adds UI elements to edit, insert, or delete rows in SQLite tablesβ19Updated 5 months ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clientsβ36Updated last year
- Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈβ17Updated last month
- A set of tools to accelerate work in Jupyter notebooks.β11Updated 5 years ago
- A conda-smithy repository for python-duckdb.β13Updated this week
- Supported datasources for MindsDBβ16Updated last month
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- This project is created to promote and advocate the use of FOSS machine learning.β46Updated last month
- Python API, Dynamic source, Dynamic target, N targets, Prometheus exporter, realtime transformation for Singer ETLβ10Updated 4 years ago
- NoETL (Not Only ETL) is a workflow management system designed to enable AI and machine learning functionality.β11Updated this week
- GraphRag vs Embeddingsβ14Updated 11 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!β17Updated 3 weeks ago