datarootsio / databooks
A CLI tool to reduce the friction between data scientists by reducing git conflicts removing notebook metadata and gracefully resolving git conflicts.
☆112Updated last year
Alternatives and similar repositories for databooks:
Users that are interested in databooks are comparing it to the libraries listed below
- Write python locally, execute SQL in your data warehouse☆270Updated 2 years ago
- Make your Kedro experience snazzy☆35Updated 2 years ago
- ☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes☆293Updated 3 months ago
- UnionML: the easiest way to build and deploy machine learning microservices☆336Updated last year
- A kedro plugin to use pandera in your kedro projects☆35Updated 4 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 11 months ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 5 months ago
- Typed wrappers over pandas DataFrames with schema validation☆101Updated last year
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 10 months ago
- First-party plugins maintained by the Kedro team.☆97Updated this week
- IbisML is a library for building scalable ML pipelines using Ibis.☆102Updated 2 months ago
- Great Expectations Airflow operator☆160Updated this week
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆71Updated 3 months ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated this week
- Write your dbt models using Ibis☆63Updated 2 months ago
- DagsHub client libraries☆93Updated this week
- A best-practices first project template that allows you to get started on a new machine learning project.☆142Updated 3 years ago
- ☆16Updated last year
- Dask integration for Snowflake☆30Updated 3 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 6 months ago
- A data modelling layer built on top of polars and pydantic☆194Updated last year
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- LRU caching with expiration period.☆18Updated 8 months ago
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆209Updated 3 weeks ago
- fsspec-compatible Azure Datake and Azure Blob Storage access☆187Updated 2 months ago
- 🪴 Nebari - your open source data science platform☆289Updated this week
- Machine learning experiment tracking and data versioning with DVC extension for VS Code☆202Updated this week
- Combinator.ml's central repo, documentation and website☆30Updated 3 years ago