determined-ai / determined-examplesLinks

Example ML projects that use the Determined library.

☆32

Alternatives and similar repositories for determined-examples

Users that are interested in determined-examples are comparing it to the libraries listed below

Sorting:

tanyuqian / redco
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
☆68Updated 10 months ago
Michaelvll / llm-ie-benchmarks
A collection of reproducible inference engine benchmarks
☆34Updated 5 months ago
siyan-zhao / prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆60Updated last year
UmerHA / triton_util
Make triton easier
☆47Updated last year
axonn-ai / axonn
Parallel framework for training and fine-tuning deep neural networks
☆65Updated 6 months ago
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆133Updated 4 months ago
NVIDIA / LDDL
Distributed preprocessing and data loading for language datasets
☆39Updated last year
HabanaAI / Megatron-DeepSpeed
Intel Gaudi's Megatron DeepSpeed Large Language Models for training
☆13Updated 9 months ago
softmax1 / Flash-Attention-Softmax-N
CUDA and Triton implementations of Flash Attention with SoftmaxN.
☆73Updated last year
anyscale / llm-continuous-batching-benchmarks
☆121Updated last year
foundation-model-stack / foundation-model-stack
🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
☆214Updated this week
drisspg / transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
☆60Updated this week
prateeky2806 / ComPEFT
☆26Updated last year
IST-DASLab / SparseFinetuning
Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry
☆42Updated last year
deepspeedai / DeepSpeed-Kernels
☆72Updated 6 months ago
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆160Updated this week
pytorch / torchdistx
Torch Distributed Experimental
☆117Updated last year
open-lm-engine / lm-engine
LM engine is a library for pretraining/finetuning LLMs
☆69Updated last week
meta-pytorch / torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆161Updated 3 weeks ago
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated last year
abacusai / gh200-llm
Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning
☆50Updated 7 months ago
fw-ai / benchmark
Benchmark suite for LLMs from Fireworks.ai
☆83Updated last week
HabanaAI / Model-References
Reference models for Intel(R) Gaudi(R) AI Accelerator
☆165Updated 2 weeks ago
NetEase-FuXi / EETQ
Easy and Efficient Quantization for Transformers
☆203Updated 3 months ago
gnovack / distributed-training-and-deepspeed
☆17Updated 2 years ago
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆80Updated last year
apple / ml-hypercloning
☆52Updated 11 months ago
lessw2020 / transformer_central
Various transformers for FSDP research
☆38Updated 2 years ago
rasbt / pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…
☆92Updated 2 years ago
mayank31398 / ladder-residual-inference
☆14Updated 3 months ago