foundation-model-stack / fms-fsdpLinks
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
☆279Updated 2 months ago
Alternatives and similar repositories for fms-fsdp
Users that are interested in fms-fsdp are comparing it to the libraries listed below
Sorting:
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆219Updated this week
- This repository contains the experimental PyTorch native float8 training UX☆227Updated last year
- Triton-based implementation of Sparse Mixture of Experts.☆263Updated 4 months ago
- Load compute kernels from the Hub☆389Updated last week
- Applied AI experiments and examples for PyTorch☆315Updated 5 months ago
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch