killthefullmoon / PhyXLinks
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
β50Updated last month
Alternatives and similar repositories for PhyX
Users that are interested in PhyX are comparing it to the libraries listed below
Sorting:
- π LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Trainingβ91Updated last year
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Modelβ64Updated 3 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concenβ¦β85Updated 7 months ago
- β47Updated 9 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ89Updated 11 months ago
- β352Updated 6 months ago
- A Sober Look at Language Model Reasoningβ92Updated 2 months ago
- Towards a Unified View of Large Language Model Post-Trainingβ199Updated 4 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repositoryβ83Updated this week
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".β95Updated 2 months ago
- β144Updated 4 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*β109Updated 8 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ62Updated 8 months ago
- β204Updated last month
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ70Updated 6 months ago
- β177Updated last month
- Revisiting Mid-training in the Era of Reinforcement Learning Scalingβ182Updated 6 months ago
- repo for paper https://arxiv.org/abs/2504.13837β325Updated last month
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Styleβ73Updated 6 months ago
- JudgeLRM: Large Reasoning Models as a Judgeβ40Updated last month
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Modelsβ57Updated 8 months ago
- Extrapolating RLVR to General Domains without Verifiersβ191Updated 5 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.β164Updated 4 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)β152Updated 6 months ago
- [EMNLP 2024 Findingsπ₯] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inβ¦β103Updated last year
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibratiβ¦β46Updated last year
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoningβ351Updated 3 weeks ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large β¦β99Updated last year
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compressionβ131Updated 9 months ago
- β141Updated 10 months ago