datarootsio / databooks
A CLI tool to reduce the friction between data scientists by reducing git conflicts removing notebook metadata and gracefully resolving git conflicts.
☆112Updated last year
Alternatives and similar repositories for databooks:
Users that are interested in databooks are comparing it to the libraries listed below
- Make your Kedro experience snazzy☆35Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆80Updated 2 years ago
- ☆16Updated last year
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆74Updated 3 months ago
- A kedro plugin to use pandera in your kedro projects☆35Updated 5 months ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆498Updated 2 months ago
- Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc☆387Updated 9 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 7 months ago
- Kedro plugin to support running workflows on Microsoft Azure ML Pipelines☆36Updated this week
- UnionML: the easiest way to build and deploy machine learning microservices☆335Updated last year
- Automated Jupyter notebook testing. 📙☆41Updated last year
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆210Updated 3 weeks ago
- Write python locally, execute SQL in your data warehouse☆270Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 3 months ago
- First-party plugins maintained by the Kedro team.☆98Updated this week
- 🏬 modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud sto…☆387Updated 2 months ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 10 months ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated 3 weeks ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 6 months ago
- DagsHub client libraries☆93Updated this week
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆146Updated 2 weeks ago
- Kedro Plugin to support running workflows on GCP Vertex AI Pipelines☆36Updated this week
- The only inspect tool you need for Kedro☆9Updated 8 months ago
- ☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes☆293Updated 3 months ago
- Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two line…☆666Updated last month
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆78Updated 6 months ago
- A kedro plugin that enables logging to the ml experiment tracker aim☆10Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- Kedro extension for VSCode including LSP and other features☆20Updated this week