kevin-hanselman / dud
A lightweight CLI tool for versioning data alongside source code and building data pipelines.
☆201Updated last month
Alternatives and similar repositories for dud:
Users that are interested in dud are comparing it to the libraries listed below
- 📈 Log and track ML metrics, parameters, models with Git and/or DVC☆169Updated last week
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆144Updated 2 weeks ago
- Reproducible Jupyter notebooks, powered by uv.☆185Updated this week
- Execute a jupyter notebook, fast, without needing jupyter☆127Updated last month
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆128Updated last year
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 5 months ago
- Python Jupyter kernel using Poetry for reproducible notebooks☆247Updated last year
- RFC document, tooling and other content related to the dataframe API standard☆105Updated 10 months ago
- VisiData interface for databases☆66Updated last year
- Super lightweight function registries for your library☆177Updated 8 months ago
- A fun party trick to run Python code from another venv into this one.☆178Updated last month
- General Purpose Data Manipulation Library☆322Updated last year
- Render Jupyter notebook in the terminal☆183Updated 2 years ago
- Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc☆385Updated 7 months ago
- 📝 Pytest plugin for testing notebooks☆192Updated last month
- Confection: the sweetest config system for Python☆182Updated 8 months ago
- Typed wrappers over pandas DataFrames with schema validation☆100Updated last year
- Add DuckDB, Parquet, CSV and JSON lines support to Datasette☆51Updated 6 months ago
- Convert pyproject.toml to environment.yaml☆129Updated last year
- Python Jupyter kernel provisioner using pyproject environment manangers like Uv, Rye, PDM, Poetry, Hatch etc.☆35Updated 2 months ago
- Automatic registry design-pattern library for mapping string names to code functionality.☆44Updated 3 weeks ago
- ☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes☆292Updated 2 months ago
- a scalable data profiler☆296Updated 2 weeks ago
- sshfs - SSH/SFTP implementation for fsspec☆63Updated 2 weeks ago
- ☆58Updated last year
- Toolkit for graph-relational data across space and time☆113Updated 5 months ago
- Turn Pydantic defined Data Models into CLI Tools☆147Updated 5 months ago
- Jupyter + Marimo = ❤️☆37Updated 7 months ago
- A Jupyter server based on FastAPI☆255Updated this week
- ☆94Updated this week