foundation-model-stack / fms-accelerationLinks
π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
β11Updated last week
Alternatives and similar repositories for fms-acceleration
Users that are interested in fms-acceleration are comparing it to the libraries listed below
Sorting:
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β47Updated this week
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.β20Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.β23Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.β17Updated 3 months ago
- vLLM adapter for a TGIS-compatible gRPC server.β32Updated this week
- Train, tune, and infer Bamba modelβ127Updated 3 weeks ago
- β29Updated 5 months ago
- β37Updated this week
- Minimum Description Length probing for neural network representationsβ18Updated 4 months ago
- A sample pattern for running CI tests on Modalβ18Updated 2 months ago
- β24Updated 9 months ago
- β44Updated last year
- Utilities for Training Very Large Modelsβ58Updated 9 months ago
- Exploration into the Firefly algorithm in Pytorchβ40Updated 4 months ago
- Example ML projects that use the Determined library.β32Updated 9 months ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.β13Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found heβ¦β31Updated last year
- β23Updated 6 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β14Updated last year
- π· Build compute kernelsβ68Updated this week
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.β30Updated 3 weeks ago
- LM engine is a library for pretraining/finetuning LLMsβ57Updated this week
- β15Updated 2 months ago
- β13Updated 9 months ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.β18Updated 7 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataβ21Updated 10 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scrollβ¦β27Updated last year
- MatFormer repoβ31Updated 6 months ago
- β41Updated 2 weeks ago
- DPO, but faster πβ43Updated 6 months ago