percent4 / llm_evaluation_4_mmluLinks
Using LLM to evaluate MMLU dataset.
☆33Updated last year
Alternatives and similar repositories for llm_evaluation_4_mmlu
Users that are interested in llm_evaluation_4_mmlu are comparing it to the libraries listed below
Sorting:
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆55Updated last month
- Paper list for Efficient Reasoning.☆548Updated 3 weeks ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆235Updated last month
- awesome papers in LLM interpretability☆522Updated 3 weeks ago
- 《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Of…☆273Updated 3 months ago
- ☆21Updated 4 months ago
- ☆51Updated last month
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆519Updated 2 weeks ago
- Awesome list for LLM pruning.☆241Updated 7 months ago
- LongBench v2 and LongBench (ACL 25'&24')☆927Updated 6 months ago
- ☆144Updated 10 months ago
- personal chatgpt☆377Updated 7 months ago
- pytorch distribute tutorials☆142Updated last month
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆63Updated 3 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆357Updated last year
- A series of technical report on Slow Thinking with LLM☆708Updated last month
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆828Updated 3 weeks ago
- A live reading list for LLM-synthetic-data.☆308Updated last week
- ☆44Updated last year
- Fast inference from large lauguage models via speculative decoding☆779Updated 10 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆463Updated 8 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆268Updated last week
- ☆348Updated 5 months ago
- Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"☆30Updated 2 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆68Updated 7 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆285Updated 2 months ago
- ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)☆973Updated 7 months ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆203Updated 9 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆432Updated last week
- Awesome RL-based LLM Reasoning☆561Updated 2 months ago