foundation-model-stack / fms-accelerationLinks

🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.

☆13

Alternatives and similar repositories for fms-acceleration

Users that are interested in fms-acceleration are comparing it to the libraries listed below

Sorting:

facebookresearch / adaptive_scheduling
Experimental scripts for researching data adaptive learning rate scheduling.
☆22Updated 2 years ago
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated last year
srush / tangent
Source-to-Source Debuggable Derivatives in Pure Python
☆15Updated last year
HabanaAI / Megatron-DeepSpeed
Intel Gaudi's Megatron DeepSpeed Large Language Models for training
☆13Updated 9 months ago
shreyansh26 / Attention-Mask-Patterns
Using FlexAttention to compute attention with different masking patterns
☆46Updated last year
google-deepmind / asyncdiloco
☆46Updated last year
ezyang / torchdbg
PyTorch centric eager mode debugger
☆48Updated 10 months ago
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆134Updated 4 months ago
lianakoleva / no-libtorch-compile
☆21Updated 7 months ago
prateeky2806 / ComPEFT
☆26Updated last year
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆44Updated 2 years ago
zaydzuhri / flame
Fork of Flame repo for training of some new stuff in development
☆18Updated last week
huggingface / peft-pytorch-conference
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆14Updated 2 years ago
frankxwang / dpo-prefix-sharing
DPO, but faster 🚀
☆45Updated 10 months ago
bentherien / mu_learned_optimization
[Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers
☆13Updated 7 months ago
cloneofsimo / zeroshampoo
☆34Updated last year
graphcore-research / out-of-the-box-fp8-training
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
☆45Updated last year
lucidrains / sinkhorn-router-pytorch
Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise
☆39Updated last year
BlinkDL / LinearAttentionArena
Here we will test various linear attention designs.
☆61Updated last year
Michaelvll / llm-ie-benchmarks
A collection of reproducible inference engine benchmarks
☆34Updated 5 months ago
graphcore-research / jax-scalify
JAX Scalify: end-to-end scaled arithmetics
☆16Updated 11 months ago
yixiaoer / tpu-training-example
☆15Updated last year
EleutherAI / training-jacobian
☆22Updated 10 months ago
tanyuqian / redco
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
☆68Updated 10 months ago
smonsays / hypernetwork-attention
Official code for the paper "Attention as a Hypernetwork"
☆43Updated last year
IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆61Updated last month
meta-pytorch / torchfix
TorchFix - a linter for PyTorch-using code with autofix support
☆149Updated last month
determined-ai / determined-examples
Example ML projects that use the Determined library.
☆32Updated last year
ClashLuke / tpucare
Automatically take good care of your preemptible TPUs
☆37Updated 2 years ago
stas00 / ml-ways
ML/DL Math and Method notes
☆64Updated last year