killthefullmoon / PhyXLinks
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
☆49Updated 2 weeks ago
Alternatives and similar repositories for PhyX
Users that are interested in PhyX are comparing it to the libraries listed below
Sorting:
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆83Updated this week
- ☆346Updated 5 months ago
- A Sober Look at Language Model Reasoning☆92Updated last month
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆84Updated 6 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆88Updated 10 months ago
- ☆46Updated 9 months ago
- Towards a Unified View of Large Language Model Post-Training☆199Updated 4 months ago
- Official Repository of "Learning what reinforcement learning can't"☆75Updated last week
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆94Updated 2 months ago
- 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆91Updated last year
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆63Updated 2 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆56Updated 7 months ago
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆146Updated 3 months ago
- repo for paper https://arxiv.org/abs/2504.13837☆314Updated 3 weeks ago
- ☆176Updated last month
- ☆114Updated 3 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆127Updated 9 months ago
- Official Repository of LatentSeek☆73Updated 7 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆69Updated 5 months ago
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆128Updated 7 months ago
- ☆201Updated 2 weeks ago
- ☆143Updated 3 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆82Updated 2 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆257Updated 7 months ago
- The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''☆110Updated 4 months ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆67Updated 5 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆98Updated last year
- [TMLR 2025] Efficient Reasoning Models: A Survey☆290Updated last week
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆348Updated 3 months ago
- ☆138Updated 10 months ago