lambdal / lambda-stack-dockerfilesLinks
☆278Updated 7 months ago
Alternatives and similar repositories for lambda-stack-dockerfiles
Users that are interested in lambda-stack-dockerfiles are comparing it to the libraries listed below
Sorting:
- A top-like tool for monitoring GPUs in a cluster☆85Updated last year
- The command line interface for Gradient - https://gradient.paperspace.com☆67Updated 2 months ago
- Docker images for fastai☆181Updated 3 years ago
- GPU environment and cluster management with LLM support☆642Updated last year
- Recipes are a standard, well supported set of blueprints for machine learning engineers to rapidly train models using the latest research…☆330Updated this week
- NVIDIA Data Science stack tools☆393Updated 2 years ago
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆157Updated last year
- Lightning HPO & Training Studio App☆18Updated 2 years ago
- Plugin for deploying MLflow models to TorchServe☆110Updated 2 years ago
- ClearML - Model-Serving Orchestration and Repository Solution☆157Updated 2 weeks ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆394Updated last week
- Benchmark Suite for Deep Learning☆276Updated this week
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing☆80Updated last year
- aim-mlflow integration☆221Updated 2 years ago
- ☆100Updated 3 months ago
- Dataset registry DVC project☆82Updated last year
- Examples of Machine Learning code using Comet.ml☆165Updated last week
- Train fastai models faster (and other useful tools)☆71Updated 4 months ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆241Updated 2 years ago
- ClearML Agent - ML-Ops made easy. ML-Ops scheduler & orchestration solution☆278Updated 2 months ago
- Automatic GPU+CPU memory profiling, re-use and memory leaks detection using jupyter/ipython experiment containers☆225Updated last year
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆244Updated last week
- Module, Model, and Tensor Serialization/Deserialization☆270Updated last month
- Where GPUs get cooked 👩🍳🔥☆293Updated last month
- Template for nbdev projects☆321Updated 3 years ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- A JupyterLab extension for displaying dashboards of GPU usage.☆660Updated last month
- Lightweight Experiment & Resource Monitoring 📺☆189Updated 2 years ago
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆194Updated last week
- ☆84Updated last week