chrisociepa / allamo
Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models
☆153Updated this week
Related projects ⓘ
Alternatives and complementary repositories for allamo
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA