ShuoTang123 / MATRIXLinks
Implementation of the MATRIX framework (ICML 2024)
☆53Updated last year
Alternatives and similar repositories for MATRIX
Users that are interested in MATRIX are comparing it to the libraries listed below
Sorting:
- ☆32Updated 7 months ago
- ☆44Updated 3 months ago
- ☆42Updated 7 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆89Updated last week
- ☆52Updated last week
- ☆131Updated 3 weeks ago
- ☆17Updated last year
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆177Updated 4 months ago
- ☆26Updated last year
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆66Updated 6 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 5 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆46Updated 3 weeks ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆64Updated last month
- ☆57Updated this week
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆40Updated 2 weeks ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆54Updated 6 months ago
- ☆46Updated 7 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆79Updated 9 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆64Updated last week
- ☆105Updated 2 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆69Updated 3 months ago
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆18Updated 4 months ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆75Updated 8 months ago
- ☆52Updated last week
- A Sober Look at Language Model Reasoning☆52Updated last week
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆22Updated 3 months ago
- Pytorch implementation of Tree Preference Optimization (TPO) (Accepyed by ICLR'25)☆17Updated last month
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆75Updated last month
- Toolkit for evaluating the trustworthiness of generative foundation models.☆101Updated 3 weeks ago
- Accepted LLM Papers in NeurIPS 2024☆37Updated 7 months ago