determined-ai / environmentsLinks

Determined AI public environments

☆49

Alternatives and similar repositories for environments

Users that are interested in environments are comparing it to the libraries listed below

Sorting:

HFAiLab / jupyterlab_tensorboard_pro
Tensorboard extension for Jupyterlab all in one
☆90Updated 11 months ago
determined-ai / works-with-determined
This repository contains example integrations between Determined and other ML products
☆48Updated last year
coreweave / ml-containers
☆37Updated this week
run-ai / rntop
A top-like tool for monitoring GPUs in a cluster
☆85Updated last year
mlcommons / logging
MLPerf™ logging library
☆37Updated 2 months ago
determined-ai / determined-examples
Example ML projects that use the Determined library.
☆32Updated 10 months ago
ryantd / veloce
WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.
☆18Updated 2 years ago
ray-project / mlflow-ray-serve
MLFlow Deployment Plugin for Ray Serve
☆46Updated 3 years ago
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆156Updated last week
nod-ai / transformer-benchmarks
benchmarking some transformer deployments
☆26Updated 2 years ago
Michaelvll / llm-ie-benchmarks
A collection of reproducible inference engine benchmarks
☆32Updated 2 months ago
pytorch / torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆158Updated 3 weeks ago
HabanaAI / Model-References
Reference models for Intel(R) Gaudi(R) AI Accelerator
☆166Updated last week
pytorch / torchdistx
Torch Distributed Experimental
☆116Updated 11 months ago
NVIDIA / ngc-container-replicator
NGC Container Replicator
☆28Updated 2 years ago
hpcaitech / SkyComputing
Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
☆91Updated 2 years ago
tensorchord / inference-benchmark
Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)
☆28Updated 2 years ago
foundation-model-stack / fms-acceleration
🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
☆11Updated last month
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆155Updated last week
hpcaitech / CachedEmbedding
A memory efficient DLRM training solution using ColossalAI
☆105Updated 2 years ago
gpuopenanalytics / pynvml
Provide Python access to the NVML library for GPU diagnostics
☆241Updated 7 months ago
GoogleCloudPlatform / slurm-gcp
☆50Updated this week
rapidsai / ucx-py
Python bindings for UCX
☆137Updated last week
intel / llm-on-ray
Pretrain, finetune and serve LLMs on Intel platforms with Ray
☆129Updated last week
FrancescoSaverioZuppichini / pytorch-2.0-benchmark
Benchmarking PyTorch 2.0 different models
☆21Updated 2 years ago
lambdal / deeplearning-benchmark
Benchmark Suite for Deep Learning
☆271Updated 4 months ago
run-ai / vscode-genv
GPU Environment Management for Visual Studio Code
☆38Updated last year
coreweave / tensorizer
Module, Model, and Tensor Serialization/Deserialization
☆248Updated last week
mlcommons / mlcube
MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.
☆157Updated 10 months ago
pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆370Updated 2 weeks ago