awesome-mlops / awesome-data-managementLinks
A curated list of awesome open source tools and commercial products to catalog, version, and manage data π
β39Updated 3 years ago
Alternatives and similar repositories for awesome-data-management
Users that are interested in awesome-data-management are comparing it to the libraries listed below
Sorting:
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β66Updated last week
- Awesome list of dataops products, open source and resourcesβ24Updated 3 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 4 years ago
- A curated list of data wrangling resourcesβ39Updated 7 years ago
- Support for jupyter notebook templates in jupyterlabβ25Updated last week
- Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈβ18Updated last week
- CLI for creating databases for Data Quality Dashboards.β19Updated 6 years ago
- Generating Realistic Synthetic Dataβ41Updated last year
- Omnipy is a high level Python library for type-driven data wrangling and scalable workflow orchestration (under development)β25Updated last week
- Example of a Streamlit data app powered by Vaexβ11Updated 3 years ago
- Functional composable pipelines allowing clean separation of the business logic and its implementationβ11Updated 5 months ago
- A small Python module containing quick utility functions for standard ETL processes.β37Updated this week
- β11Updated 2 years ago
- Awesome Orchest projects, both official and submitted by the community.β26Updated 2 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β29Updated 3 years ago
- π Notebooks Academy: Write Production-Ready Code From Jupyter.β13Updated 3 years ago
- π A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)β141Updated 2 years ago
- Scrape various open data directories to create an index of what's available out thereβ37Updated last year
- AgRec is an open source Agriculture Recommendations from the Cooperative Extension Services.β12Updated 4 years ago
- Techniques for Scraping the Web in Pythonβ27Updated 7 years ago
- Tools for working with Singer Taps and Targetsβ61Updated last year
- Repo demonstrating a Dagster pipeline to generate Neo4j Graphβ22Updated 4 years ago
- Data pipelines from re-usable componentsβ107Updated 3 months ago
- KNIME Python Integrationβ86Updated this week
- Framework for processing data packages in pipelines of modular components.β123Updated 7 months ago
- β20Updated last week
- Supercharged pandas indexingβ11Updated 4 years ago
- Glue is an enterprise data model for the buy side, tailored for Wealth and Asset Managers and covering key entities such as Party, Busineβ¦β23Updated 2 years ago
- Python bindings for the Stardog Knowledge Graph platformβ41Updated 3 months ago
- KNOTS is an intuitive desktop application built to simplify the configuration of Singer pipelinesβ67Updated 3 years ago