☆56Aug 10, 2024Updated last year
Alternatives and similar repositories for MuSR
Users that are interested in MuSR are comparing it to the libraries listed below
Sorting:
- ☆25Apr 10, 2025Updated 10 months ago
- ☆12Mar 5, 2025Updated last year
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated 11 months ago
- ☆33Updated this week
- ☆21Jun 26, 2025Updated 8 months ago
- ☆34Jan 25, 2026Updated last month
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- [NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions☆13May 7, 2024Updated last year
- ☆13Jul 14, 2024Updated last year
- 中文大语言模型评测第三期☆35Dec 30, 2025Updated 2 months ago
- ☆70Jun 18, 2025Updated 8 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆43Feb 27, 2025Updated last year
- ☆17Feb 22, 2024Updated 2 years ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 9 months ago
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆24Oct 8, 2024Updated last year
- ☆64Feb 4, 2026Updated last month
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 5 months ago
- ☆54Oct 24, 2024Updated last year
- ☆52Feb 12, 2025Updated last year
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆28Nov 25, 2024Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Chinese Vision-Language Understanding Evaluation☆23Dec 26, 2024Updated last year
- ☆37Jan 25, 2024Updated 2 years ago
- ☆31Jun 12, 2024Updated last year
- ☆26Mar 17, 2023Updated 2 years ago
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆37Apr 21, 2025Updated 10 months ago
- Long Context Extension and Generalization in LLMs☆63Sep 21, 2024Updated last year
- ☆55Aug 5, 2025Updated 7 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆28May 3, 2025Updated 10 months ago
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Aug 7, 2025Updated 6 months ago
- ☆33Sep 14, 2025Updated 5 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆33Feb 26, 2026Updated last week
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 7 months ago