FreedomIntelligence / FastLLMLinks
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆40Updated 2 years ago
Alternatives and similar repositories for FastLLM
Users that are interested in FastLLM are comparing it to the libraries listed below
Sorting:
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆139Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆36Updated last year
- FuseAI Project☆87Updated last year
- ☆96Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated 2 years ago
- An Experiment on Dynamic NTK Scaling RoPE