awesome-mlops / awesome-data-managementLinks
A curated list of awesome open source tools and commercial products to catalog, version, and manage data π
β33Updated 3 years ago
Alternatives and similar repositories for awesome-data-management
Users that are interested in awesome-data-management are comparing it to the libraries listed below
Sorting:
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β59Updated last month
- Functional composable pipelines allowing clean separation of the business logic and its implementationβ11Updated last year
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.β10Updated 4 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clientsβ36Updated last year
- Scrape various open data directories to create an index of what's available out thereβ37Updated 4 months ago
- Documentation repository for RudderStack - the Customer Data Platform for Developers.β25Updated 8 months ago
- NetworkX-like Python experience for Postgres, SQLite, MongoDB, and Neo4Jβ23Updated 4 months ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.β9Updated 3 years ago
- Batteries included toolkit for data engineering.β34Updated 6 months ago
- A collection of tools that can be used for LLM function callingβ33Updated last month
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- Awesome Orchest projects, both official and submitted by the community.β25Updated last year
- Apache Spark based framework for analysis A/B experimentsβ15Updated 8 months ago
- βοΈ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.β45Updated 4 months ago
- β11Updated last year
- Awesome list of dataops products, open source and resourcesβ24Updated 3 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β27Updated 3 years ago
- β11Updated 5 months ago
- Glue is an enterprise data model for the buy side, tailored for Wealth and Asset Managers and covering key entities such as Party, Busineβ¦β22Updated 2 years ago
- Stardog Helm Chartsβ10Updated last week
- Tools for building SQLite databases from files and directoriesβ12Updated last year
- This repository contains example implementations for KNIME Analytics Platform.β18Updated last week
- Git scrapers for scraping the fediverseβ17Updated this week
- My dot files in one place - extensively edited over time. Your mileage may varyβ2Updated 9 years ago
- d3 plugin to create a temporal network visualizationβ18Updated 2 years ago
- Geniusrise: Framework for building geniusesβ60Updated last year
- π» CLI for reporting events to Faros platformβ14Updated 2 months ago
- β11Updated 5 months ago
- Datasette plugin for authenticating access using API tokensβ12Updated 10 months ago
- A few end to end examples that use data-describeβ16Updated 2 years ago