VILA-Lab / Open-LLM-LeaderboardLinks
Open-LLM-Leaderboard: Open-Style Question Evaluation. Paper at https://arxiv.org/abs/2406.07545
☆48Updated last year
Alternatives and similar repositories for Open-LLM-Leaderboard
Users that are interested in Open-LLM-Leaderboard are comparing it to the libraries listed below
Sorting:
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆55Updated 9 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆110Updated 9 months ago
- ☆65Updated last year
- ☆104Updated 11 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆54Updated 5 months ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆74Updated 4 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- ☆30Updated last year
- RL Scaling and Test-Time Scaling (ICML'25)☆112Updated 10 months ago
- ☆131Updated 8 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆83Updated 8 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆55Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆123Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆31Updated 3 months ago
- ☆136Updated 2 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆159Updated last week
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆83Updated last year
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆36Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆116Updated 6 months ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆57Updated 8 months ago
- a curated list of the role of small models in the LLM era☆109Updated last year
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆27Updated last year
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆38Updated last year
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆131Updated 9 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆76Updated last year
- Code implementation of synthetic continued pretraining☆138Updated 10 months ago
- [ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆52Updated 5 months ago
- ☆41Updated 2 years ago
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆82Updated last year