iterative / dvc-dataLinks

DVC's data management subsystem

☆18

Alternatives and similar repositories for dvc-data

Users that are interested in dvc-data are comparing it to the libraries listed below

Sorting:

iterative / ldb-resources
☆27Updated 3 years ago
cheind / pydantic-numpy
Seamlessly integrate numpy arrays into pydantic models.
☆58Updated 2 years ago
justindujardin / pathy
simple, flexible, offline capable, cloud storage with a Python path-like interface
☆175Updated 6 months ago
iterative / dvclive
📈 Log and track ML metrics, parameters, models with Git and/or DVC
☆182Updated this week
iterative / gto
🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.
☆153Updated last week
deepyaman / kedro-accelerator
Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.
☆36Updated 3 years ago
data-apis / dataframe-api-compat
Extremely lightweight compatibility layer between pandas and Polars
☆41Updated last year
koaning / mktestdocs
Run pytest against markdown files/docstrings.
☆145Updated last week
tomcatling / black-nb
Runs black on code cells in a Jupyter notebook
☆50Updated 3 years ago
koaning / pytest-duration-insights
A mini dashboard to help find slow tests in pytest.
☆83Updated last year
dask / dask-gateway
A multi-tenant server for securely deploying and managing Dask clusters.
☆142Updated 2 weeks ago
dask / dask-cloudprovider
Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...
☆145Updated last month
areshytko / typedframe
Typed wrappers over pandas DataFrames with schema validation
☆102Updated 2 years ago
dojeda / poetry2conda
Convert pyproject.toml to environment.yaml
☆132Updated 2 years ago
iterative / dvc-bench
Benchmarks for DVC
☆21Updated this week
pola-rs / dask-polars
Coming soon
☆62Updated 2 years ago
jupyter / nbclient
A client library for executing notebooks. Formally nbconvert's ExecutePreprocessor
☆178Updated 4 months ago
dask-contrib / dask-databricks
Cluster tools for running Dask on Databricks
☆15Updated last year
tamsanh / kedro-great
The easiest way to integrate Kedro and Great Expectations
☆54Updated 2 years ago
deppen8 / pandas-vet
A plugin for Flake8 that checks pandas code
☆170Updated 2 years ago
MatsMoll / aligned
The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt
☆60Updated this week
ianozsvald / dtype_diet
Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM
☆86Updated last year
nebari-dev / nebari-docs
📖 Documentation for Nebari
☆19Updated last week
data-apis / array-api-tests
Test suite for Python array API standard compliance
☆68Updated last week
n8henrie / jupyter-black
A simple extension for Jupyter Notebook and Jupyter Lab to beautify Python code automatically using Black. Fork of dnanhkhoa/nb_black.
☆59Updated last year
iterative / workshop-uncool-mlops
Accompanies the uncool MLOps workshop
☆26Updated 3 years ago
Itayazolay / featureclass
Feature engineering library that helps you keep track of feature dependencies, documentation and schema
☆28Updated 3 years ago
scientific-python / lazy-loader
Populate library namespace without incurring immediate import costs
☆194Updated 3 weeks ago
conda-incubator / conda-store
Data science environments, for collaboration. ✨
☆155Updated this week
AndrewRook / prefect_ds
Tools for making Prefect work better for typical data science workflows
☆18Updated 3 years ago