VILA-Lab / Open-LLM-LeaderboardLinks
Open-LLM-Leaderboard: Open-Style Question Evaluation. Paper at https://arxiv.org/abs/2406.07545
☆45Updated 11 months ago
Alternatives and similar repositories for Open-LLM-Leaderboard
Users that are interested in Open-LLM-Leaderboard are comparing it to the libraries listed below
Sorting:
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆52Updated 4 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆98Updated last month
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 2 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆31Updated last year
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆25Updated 6 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆64Updated 3 months ago
- ☆105Updated 2 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 3 months ago
- ☆100Updated last week
- Long Context Extension and Generalization in LLMs☆56Updated 8 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆45Updated 6 months ago
- ☆37Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆46Updated last week
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 7 months ago
- ☆107Updated 2 weeks ago
- This the implementation of LeCo☆31Updated 4 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆70Updated 2 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆104Updated 4 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆70Updated 6 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- ☆61Updated 7 months ago
- ☆17Updated 5 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 8 months ago
- [ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆63Updated 7 months ago
- A Sober Look at Language Model Reasoning☆63Updated last week
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆148Updated 3 months ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆38Updated last year
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆35Updated 2 weeks ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆115Updated last month
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆61Updated 5 months ago