mistralai / mistral-evalsLinks

☆78

Alternatives and similar repositories for mistral-evals

Users that are interested in mistral-evals are comparing it to the libraries listed below

Sorting:

ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
allenai / IFBench
☆88Updated last week
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆79Updated 7 months ago
samchaineau / llm_slerp_generation
Repo hosting codes and materials related to speeding LLMs' inference using token merging.
☆37Updated last month
LLM360 / k2-train
☆52Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated last year
IST-DASLab / RoSA
Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)
☆44Updated last year
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆168Updated last year
arcee-ai / DAM
☆55Updated last year
jeffreysijuntan / lloco
The official repo for "LLoCo: Learning Long Contexts Offline"
☆118Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆115Updated 9 months ago
allenai / infinigram-api
☆82Updated this week
siyan-zhao / prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆60Updated last year
thepowerfuldeez / OLMo
My fork os allen AI's OLMo for educational purposes.
☆30Updated 11 months ago
JayZhang42 / SLED
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433
☆110Updated 11 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆45Updated 9 months ago
RobertCsordas / moeut
☆88Updated last year
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 9 months ago
scitix / MEAP
Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More
☆33Updated 6 months ago
hughbzhang / o1_inference_scaling_laws
Replicating O1 inference-time scaling laws
☆90Updated 11 months ago
MaxBelitsky / cache-steering
KV Cache Steering for Inducing Reasoning in Small Language Models
☆42Updated 3 months ago
SalesforceAIResearch / Elastic-Reasoning
Make reasoning models scalable
☆47Updated 5 months ago
facebookresearch / RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆297Updated last week
locuslab / scaling_laws_data_filtering
☆65Updated last year
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆56Updated last month
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆136Updated 5 months ago
withmartian / routerbench
The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System
☆150Updated last year
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆99Updated 8 months ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆28Updated 10 months ago