unslothai / unsloth-zoo
Utils for Unsloth
☆63Updated last week
Alternatives and similar repositories for unsloth-zoo:
Users that are interested in unsloth-zoo are comparing it to the libraries listed below
- Train, tune, and infer Bamba model☆87Updated 2 months ago
- ☆112Updated 6 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆195Updated 8 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆91Updated 3 weeks ago
- ☆126Updated 7 months ago
- ☆47Updated 7 months ago
- Train your own SOTA deductive reasoning model☆81Updated 3 weeks ago
- PyTorch building blocks for the OLMo ecosystem☆177Updated this week
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆81Updated 3 weeks ago
- vLLM performance dashboard☆23Updated 11 months ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆145Updated last year
- My fork os allen AI's OLMo for educational purposes.☆30Updated 3 months ago
- minimal GRPO implementation from scratch☆65Updated 3 weeks ago
- Google TPU optimizations for transformers models☆104Updated 2 months ago
- Data preparation code for Amber 7B LLM☆86Updated 10 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 11 months ago
- RWKV-7: Surpassing GPT☆82Updated 4 months ago
- Collection of autoregressive model implementation☆83Updated last month
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Updated 5 months ago
- A pipeline for LLM knowledge distillation☆99Updated last week
- LLM reads a paper and produce a working prototype☆51Updated 2 weeks ago
- ☆53Updated 10 months ago
- Exploring Applications of GRPO☆145Updated this week
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆39Updated last month
- DPO, but faster 🚀☆40Updated 3 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆150Updated 3 months ago
- Model Activity VIsualiser☆109Updated this week
- Benchmark suite for LLMs from Fireworks.ai☆70Updated last month
- ☆48Updated 4 months ago
- An extension of the nanoGPT repository for training small MOE models.☆109Updated 3 weeks ago