MiuLab / LLM-EvalLinks

☆15

Alternatives and similar repositories for LLM-Eval

Users that are interested in LLM-Eval are comparing it to the libraries listed below

Sorting:

foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆134Updated 4 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆96Updated last week
TheDuckAI / arb
Advanced Reasoning Benchmark Dataset for LLMs
☆46Updated last year
HishamAlyahya / semantic_backprop
Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖
☆76Updated 10 months ago
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆92Updated last year
arcee-ai / DAM
☆55Updated 11 months ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆72Updated last year
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated last year
LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆55Updated last year
allenai / DataDecide
☆35Updated last month
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆29Updated 10 months ago
UbiquitousLearning / SLM_Survey
☆97Updated last year
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆74Updated 6 months ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated last year
allenai / infinigram-api
☆80Updated this week
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 9 months ago
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆56Updated this week
patronus-ai / Lynx-hallucination-detection
☆43Updated last year
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆80Updated last year
tval2 / contextual-pruning
Library to facilitate pruning of LLMs based on context
☆32Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
UpstageAI / evalverse-IFEval
Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…
☆14Updated last year
metal-chart-generation / metal
☆40Updated 4 months ago
JacobPfau / fillerTokens
☆72Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆115Updated 8 months ago
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆163Updated last year
mistralai / mistral-evals
☆77Updated last month
padas-lab-de / ir-rag-sigir24-persona-rag
☆50Updated last year