Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP
☆11Jan 29, 2024Updated 2 years ago
Alternatives and similar repositories for epochraft-hf-fsdp
Users that are interested in epochraft-hf-fsdp are comparing it to the libraries listed below
Sorting:
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆34Feb 17, 2024Updated 2 years ago
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- Ongoing research training Mixture of Expert models.☆21Sep 16, 2024Updated last year
- ☆20Aug 28, 2024Updated last year
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆53May 20, 2024Updated last year
- ☆22Sep 18, 2023Updated 2 years ago
- The tool facilitates debugging convergence issues and testing new algorithms and recipes for training LLMs using Nvidia libraries such as…☆18Sep 17, 2025Updated 5 months ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆125Nov 13, 2025Updated 3 months ago
- ☆36Feb 26, 2024Updated 2 years ago
- Advanced block device testing/file system testing, targetting SNIA compatible reporting☆12Oct 15, 2025Updated 4 months ago
- Mamba training library developed by kotoba technologies☆71Feb 11, 2024Updated 2 years ago
- Lustre Repository with MS patches☆14Updated this week
- Python wrappers for the FirecREST API☆12Updated this week
- ☆48Jan 5, 2026Updated 2 months ago
- A framework for few-shot evaluation of autoregressive language models.☆153Sep 13, 2024Updated last year
- Cloyster HPC is a turnkey HPC cluster solution with an user-friendly installer☆10Oct 2, 2025Updated 5 months ago
- Auto detection of apt proxies in the LAN, caching and checking status☆10Feb 13, 2025Updated last year
- Lustre HSM tools☆10Feb 19, 2024Updated 2 years ago
- Wantedlyのインターン情報や新卒採用についてのインフォメーションです☆11Apr 5, 2022Updated 3 years ago
- This repository holds all material related to the Ory Summit, specifically the presentations.☆12Oct 22, 2025Updated 4 months ago
- ☆12Jul 7, 2022Updated 3 years ago
- extended benchmarking automation tool for HPC applications☆16Updated this week
- Crawl & Visualize NeurIPS 2022 Data from OpenReview☆14Nov 8, 2022Updated 3 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- Large language models to diffusion finetuning code☆24Jun 2, 2025Updated 9 months ago
- This is an isometric game developed using React/Hooks, TypeScript, LESS. Inspired by Fallout 2/Fallout Tactics and was developed as an e…