waltonfuture / Diff-eRank
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆32Updated last month
Alternatives and similar repositories for Diff-eRank:
Users that are interested in Diff-eRank are comparing it to the libraries listed below
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 7 months ago
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated 2 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆33Updated 11 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆39Updated last month
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆41Updated 5 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆29Updated last week
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆43Updated 2 weeks ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆27Updated 5 months ago
- ☆34Updated last month
- Codebase for Instruction Following without Instruction Tuning☆33Updated 2 months ago
- [EMNLP 2023 Main] Sparse Low-rank Adaptation of Pre-trained Language Models☆70Updated 9 months ago
- MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆22Updated last week
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆34Updated 8 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆73Updated last month
- MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.☆73Updated 2 months ago
- This the implementation of LeCo☆29Updated 4 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆48Updated last month
- [ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆67Updated 6 months ago
- ☆34Updated 2 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆33Updated 2 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆54Updated last month
- Large Language Models Can Self-Improve in Long-context Reasoning☆56Updated 3 weeks ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆61Updated 6 months ago
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆15Updated 7 months ago
- ☆19Updated 5 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆43Updated last month
- ☆20Updated 5 months ago
- ☆27Updated last year
- Code and Data Repo for NeurIPS 2024 Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆19Updated 6 months ago