killthefullmoon/PhyX

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/killthefullmoon/PhyX)

killthefullmoon / PhyX

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

☆54

Alternatives and similar repositories for PhyX

Users that are interested in PhyX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AIoT-MLSys-Lab / Famba-V
View on GitHub
[ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
☆34Sep 30, 2024Updated last year
dxzxy12138 / PhysReason
View on GitHub
PhysReason Becnhmark
☆19Jul 8, 2025Updated last year
menik1126 / Swing-Bench
View on GitHub
[ICLR2026🔥Oral] SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
☆15Feb 26, 2026Updated 4 months ago
drogozhang / Criminal-Intelligence-QA-System
View on GitHub
Demo for advanced Java final project in 18-19 1 of Canghong Jin
☆25Nov 18, 2018Updated 7 years ago
SUSTechBruce / SRPO_MLLMs
View on GitHub
[NeurIPS 2025🔥]Main source code of SRPO framework.
☆192Nov 25, 2025Updated 7 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
killthefullmoon / MMSpec
View on GitHub
MMSpec: Benchmarking Speculative Decoding for Vision-Language Models
☆41Jul 2, 2026Updated 3 weeks ago
zzli2022 / TLDR
View on GitHub
Code for Research Project TLDR
☆26Jul 28, 2025Updated 11 months ago
AIoT-MLSys-Lab / MEDA
View on GitHub
[NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
☆22Jun 19, 2025Updated last year
mchiquier / llm-mutate
View on GitHub
☆15Oct 7, 2024Updated last year
xufangzhi / NLP_HW2
View on GitHub
The second Homework of NLP
☆13Jun 9, 2021Updated 5 years ago
thunlp / OHRE
View on GitHub
Source code of paper 'Open Hierarchical Relation Extraction' (NAACL 2021)
☆22Mar 4, 2022Updated 4 years ago
SUSTechBruce / LOOK-M
View on GitHub
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆103Nov 9, 2024Updated last year
TIGER-AI-Lab / GenAI-Arena
View on GitHub
Interface for GenAI-Arena [NeurIPS24]
☆16Feb 27, 2024Updated 2 years ago
LARK-AI-Lab / CodeScaler
View on GitHub
The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"
☆35Mar 26, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
RenzeLou / AAAR-1.0
View on GitHub
The source code for running LLMs on the AAAR-1.0 benchmark.
☆20Apr 5, 2025Updated last year
hanxuhu / SeqIns
View on GitHub
The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…
☆30Nov 24, 2024Updated last year
chengyou-jia / T2IS
View on GitHub
Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"
☆21Oct 1, 2025Updated 9 months ago
VisualSphinx / VisualSphinx
View on GitHub
☆17Jun 3, 2025Updated last year
OSU-NLP-Group / QA4RE
View on GitHub
[ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"
☆39Dec 22, 2023Updated 2 years ago
AIoT-MLSys-Lab / D2O
View on GitHub
[ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models
☆27Jul 7, 2025Updated last year
chengyou-jia / ChatGen
View on GitHub
[CVPR 2025] ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
☆33Dec 5, 2024Updated last year
xcltql666 / DenseDiT
View on GitHub
Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"
☆27Jun 7, 2026Updated last month
menik1126 / UNComp
View on GitHub
[EMNLP 2025🔥] UNComp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design from an Uncertainty-Aware Perspective
☆20Jan 7, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
KlingAIResearch / PhysMaster
View on GitHub
Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning
☆57Oct 16, 2025Updated 9 months ago
penghao-wu / GUI_Reflection
View on GitHub
☆34Sep 19, 2025Updated 10 months ago
CharlesQ9 / Physics-Supernova
View on GitHub
☆32Dec 7, 2025Updated 7 months ago
X-GenGroup / PaCo-RL
View on GitHub
Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*
☆42Dec 13, 2025Updated 7 months ago
facebookresearch / reasoning-memory
View on GitHub
Procedural Knowledge at Scale Improves ReasoningThis repository contains the minimal, end-to-end pipeline for reproducing the paper resul…
☆15Apr 1, 2026Updated 3 months ago
alibaba-damo-academy / VL-Cogito
View on GitHub
☆24Nov 4, 2025Updated 8 months ago
ZunhaiSu / Awesome-Attention-Sink
View on GitHub
🚀 First survey on Attention Sink in Transformers — 200+ papers on utilization, interpretation, and mitigation.
☆136Jun 5, 2026Updated last month
lean-dojo / lean4code
View on GitHub
Lean4 Code Editor
☆17Jul 14, 2026Updated last week
TaiMingLu / know-dont-tell
View on GitHub
☆19Oct 14, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhoustan / CamSAM2
View on GitHub
[NeurIPS 2025] CamSAM2: Segment Anything Accurately in Camouflaged Videos
☆21Nov 19, 2025Updated 8 months ago
WooooDyy / BMMR
View on GitHub
Code and resources for the NeurIPS 2025 Paper "BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset" by Zhiheng X…
☆18Oct 14, 2025Updated 9 months ago
TIGER-AI-Lab / VisCoder
View on GitHub
The official code of "VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation" [EMNLP25]
☆19Sep 21, 2025Updated 10 months ago
Embodied-Web-Agent / Embodied-Web-Agent
View on GitHub
☆40May 29, 2025Updated last year
yayayacc / MUR
View on GitHub
☆49May 14, 2026Updated 2 months ago
Eleanor-H / DAGN
View on GitHub
Official implementations for Discourse-Aware Graph Networks for Textual Logical Reasoning (TPAMI) and DAGN: Discourse-Aware Graph Network…
☆28Feb 19, 2026Updated 5 months ago
hkust-nlp / GUIMid
View on GitHub
☆22May 3, 2025Updated last year