mistralai / mistral-evalsLinks
☆78Updated 3 months ago
Alternatives and similar repositories for mistral-evals
Users that are interested in mistral-evals are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆88Updated last week
- Verifiers for LLM Reinforcement Learning☆79Updated 7 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Updated last month
- ☆52Updated last year
- A repository for research on medium sized language models.☆78Updated last year
- ☆48Updated last year
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Updated last year
- Evaluating LLMs with fewer examples☆168Updated last year
- ☆55Updated last year
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Updated last year
- This is the official repository for Inheritune.☆115Updated 9 months ago
- ☆82Updated this week
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Updated last year
- My fork os allen AI's OLMo for educational purposes.☆30Updated 11 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆110Updated 11 months ago
- Simple repository for training small reasoning models☆45Updated 9 months ago
- ☆88Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 9 months ago
- Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More☆33Updated 6 months ago
- Replicating O1 inference-time scaling laws☆90Updated 11 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆42Updated 3 months ago
- Make reasoning models scalable☆47Updated 5 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆297Updated last week
- ☆65Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆56Updated last month
- Train, tune, and infer Bamba model☆136Updated 5 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆150Updated last year
- minimal GRPO implementation from scratch☆99Updated 8 months ago
- Aioli: A unified optimization framework for language model data mixing☆28Updated 10 months ago