Using LLM to evaluate MMLU dataset.
☆42Mar 8, 2024Updated 2 years ago
Alternatives and similar repositories for llm_evaluation_4_mmlu
Users that are interested in llm_evaluation_4_mmlu are comparing it to the libraries listed below
Sorting:
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆23Feb 6, 2025Updated last year
- a website for accessing many models through api(deepseek、Qwen、Hunyuan etc.)☆17Jul 12, 2025Updated 7 months ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆61Feb 6, 2026Updated last month
- multicast learning in network programming course☆10Oct 30, 2020Updated 5 years ago
- ☆15Nov 18, 2025Updated 3 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated last month
- ☆43Nov 1, 2022Updated 3 years ago
- NSCSCC “龙芯杯” 2024 个人赛 LoongArch 赛道三等奖☆14Aug 17, 2024Updated last year
- Official Implementation of Avoiding spurious correlations via logit correction☆17May 6, 2023Updated 2 years ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 7 months ago
- ☆36Jan 13, 2026Updated last month
- 2023龙芯杯mips赛道作品☆14Dec 23, 2023Updated 2 years ago
- Chinese Generation Evaluation☆13Aug 14, 2023Updated 2 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆15Feb 4, 2025Updated last year
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- DevKit for SoccerNet Team Action Spotting Challenge 2025☆18Aug 26, 2025Updated 6 months ago
- 2022龙芯杯个人赛三等奖作品☆14Oct 11, 2023Updated 2 years ago
- Vitis 部署加速器工作流介绍☆11Jan 10, 2025Updated last year
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- llms related stuff , including code, docs☆13Feb 25, 2025Updated last year
- Adapter board exposing SATA M.2 SSD on FMC board-to-board connector☆15Aug 7, 2023Updated 2 years ago
- Diverse Demonstrations Improve In-context Compositional Generalization☆12Jul 7, 2023Updated 2 years ago
- 2022年龙芯杯个人赛 单发射110M(含icache)☆48Aug 22, 2022Updated 3 years ago
- Diagnostic Framework for LLMs and MLLMs☆32Mar 2, 2026Updated last week
- 💬 MCP Server for notify to Weixin, Telegram, Bark, Lark, 飞书, 钉钉☆30Feb 24, 2026Updated 2 weeks ago
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Dec 4, 2023Updated 2 years ago
- Official Implementation of NIPS 2022 paper Pre-activation Distributions Expose Backdoor Neurons☆15Jan 13, 2023Updated 3 years ago
- ☆12Jul 20, 2022Updated 3 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- 清华大学第八届人工智能挑战赛电子系赛道(原电子系第 26 届队式程序设计大赛 teamstyle26)☆17Feb 27, 2026Updated last week
- Mamba support for transformer lens☆19Sep 17, 2024Updated last year
- A JPEG-LS plugin for the Python Pillow library☆16Dec 31, 2023Updated 2 years ago
- NSCSCC 2020 - Yet Another MIPS Processor☆14Aug 7, 2021Updated 4 years ago
- FT2232HL JTAG & UART Downloader☆20Jul 18, 2021Updated 4 years ago
- NSCSCC 2023 The Second Prize. TEAM PUA FROM HDU.☆13Mar 29, 2025Updated 11 months ago
- multi-bit language model watermarking (NAACL 24)☆17Sep 20, 2024Updated last year
- ☆63Jan 26, 2026Updated last month
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated 11 months ago
- Groundhog - Serial ATA Host Bus Adapter☆24Jun 10, 2018Updated 7 years ago