datarootsio / databooks
A CLI tool to reduce the friction between data scientists by reducing git conflicts removing notebook metadata and gracefully resolving git conflicts.
☆112Updated last year
Alternatives and similar repositories for databooks:
Users that are interested in databooks are comparing it to the libraries listed below
- Write python locally, execute SQL in your data warehouse☆269Updated 2 years ago
- Make your Kedro experience snazzy☆35Updated 2 years ago
- A kedro plugin to use pandera in your kedro projects☆35Updated 6 months ago
- A best-practices first project template that allows you to get started on a new machine learning project.☆142Updated 3 years ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 7 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 4 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 7 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆187Updated this week
- 🏃♀️ Minimalist SQL orchestrator☆243Updated this week
- A data modelling layer built on top of polars and pydantic☆194Updated last year
- Write your dbt models using Ibis☆64Updated last month
- prefect integration for running dbt☆60Updated 7 months ago
- Assessing whether data from database complies with reference information.☆42Updated this week
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆146Updated last month
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆34Updated last year
- Automated Jupyter notebook testing. 📙☆41Updated last year
- Typed wrappers over pandas DataFrames with schema validation☆101Updated last year
- A kedro plugin that enables logging to the ml experiment tracker aim☆10Updated 2 years ago
- The TimescaleDB adapter plugin for dbt☆41Updated last week
- 🚀 Get started in our repos☆12Updated this week
- ☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes☆293Updated 4 months ago
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year
- DagsHub client libraries☆93Updated this week
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated last month
- rust-for-data☆45Updated last year
- Repo for orienting dbt users to the Dagster asset framework☆54Updated 2 years ago
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆60Updated 2 years ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆98Updated last week
- Data Tools Subjective List☆83Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆140Updated 3 months ago