LLM360 / k2-train
☆50Updated 9 months ago
Alternatives and similar repositories for k2-train:
Users that are interested in k2-train are comparing it to the libraries listed below
- ☆48Updated 4 months ago
- A repository for research on medium sized language models.☆76Updated 10 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- This is the official repository for Inheritune.☆109Updated last month
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 5 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆85Updated this week
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆53Updated 5 months ago
- The first dense retrieval model that can be prompted like an LM☆67Updated 6 months ago
- ☆74Updated 7 months ago
- My fork os allen AI's OLMo for educational purposes.☆30Updated 3 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆110Updated 10 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆26Updated 6 months ago
- ☆47Updated 6 months ago
- ☆52Updated 6 months ago
- ☆76Updated 2 months ago
- ☆32Updated 9 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆38Updated 5 months ago
- Aioli: A unified optimization framework for language model data mixing☆22Updated 2 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆71Updated 7 months ago
- ☆111Updated last month
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 10 months ago
- ☆44Updated 10 months ago
- ☆40Updated 3 weeks ago
- ☆59Updated last week
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆95Updated 2 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆75Updated last year
- ☆32Updated 3 weeks ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"☆59Updated 5 months ago