foundation-model-stack / fms-accelerationLinks
π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
β10Updated last month
Alternatives and similar repositories for fms-acceleration
Users that are interested in fms-acceleration are comparing it to the libraries listed below
Sorting:
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β44Updated this week
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found heβ¦β31Updated last year
- β21Updated 5 months ago
- Minimum Description Length probing for neural network representationsβ19Updated 4 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.β20Updated last year
- Training hybrid models for dummies.β21Updated 4 months ago
- JAX Scalify: end-to-end scaled arithmeticsβ16Updated 7 months ago
- β18Updated last year
- β25Updated last year
- Utilities for Training Very Large Modelsβ58Updated 8 months ago
- [Oral; Neurips OPT2024 ] ΞΌLO: Compute-Efficient Meta-Generalization of Learned Optimizersβ12Updated 2 months ago
- Using FlexAttention to compute attention with different masking patternsβ43Updated 8 months ago
- β13Updated this week
- β44Updated last year
- Train, tune, and infer Bamba modelβ127Updated last month
- A collection of reproducible inference engine benchmarksβ31Updated last month
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataβ21Updated 10 months ago
- Example ML projects that use the Determined library.β32Updated 8 months ago
- vLLM adapter for a TGIS-compatible gRPC server.β30Updated this week
- β15Updated 2 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.β30Updated last week
- Experimental scripts for researching data adaptive learning rate scheduling.β23Updated last year
- MatFormer repoβ26Updated 5 months ago
- Efficiently computing & storing token n-grams from large corporaβ23Updated 7 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.β17Updated 2 months ago
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwiseβ35Updated 9 months ago
- Fork of Flame repo for training of some new stuff in developmentβ13Updated last week
- β28Updated 4 months ago
- β38Updated last month
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modelingβ35Updated last year