tongjingqi/Game-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tongjingqi/Game-RL)

tongjingqi / Game-RL

Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

☆155

Alternatives and similar repositories for Game-RL

Users that are interested in Game-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hkust-nlp / Laser
View on GitHub
[ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
☆66May 22, 2025Updated last year
STARE-bench / STARE
View on GitHub
☆19Oct 12, 2025Updated 8 months ago
hkust-nlp / mstar
View on GitHub
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆75Jul 13, 2025Updated 11 months ago
zzzhr97 / SpecBench
View on GitHub
Code repository for the ICML 2026 paper "Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Deliberation".
☆24Jun 14, 2026Updated 3 weeks ago
TingchenFu / MathIF
View on GitHub
instruction-following benchmark for large reasoning models
☆48Apr 19, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Mikivishy / FullFront
View on GitHub
The official code repository for the FullFront benchmark
☆27May 16, 2025Updated last year
ssmisya / PRMBench
View on GitHub
[ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.
☆92Feb 15, 2025Updated last year
EnVision-Research / TiViBench
View on GitHub
[CVPR 2026] TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models
☆67Feb 21, 2026Updated 4 months ago
OpenLMLab / ParallelTokenizer
View on GitHub
Use the tokenizer in parallel to achieve superior acceleration
☆20Mar 21, 2024Updated 2 years ago
OpenSparseLLMs / Skip-DiT
View on GitHub
✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints
☆80Jul 10, 2025Updated 11 months ago
llmeval / LLMEval-Fair
View on GitHub
[ACL 2026] A large-scale longitudinal study on robust and fair evaluation of LLMs — 200K+ generative questions across 13 disciplines
☆40May 21, 2026Updated last month
tongjingqi / Awesome-Agent-RL
View on GitHub
A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical …
☆59Sep 1, 2025Updated 10 months ago
llmeval / LLMEval-Med
View on GitHub
[EMNLP 2025] A real-world clinical benchmark for medical LLMs with physician validation — 2,996 questions from EHRs
☆28May 21, 2026Updated last month
DeepSoftwareAnalytics / swe-factory
View on GitHub
[FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks
☆182May 12, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
EMMA-Bench / EMMA
View on GitHub
[ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchma…
☆69Jul 17, 2025Updated 11 months ago
HumanMLLM / LOVE-R1
View on GitHub
Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"
☆24Nov 1, 2025Updated 8 months ago
MajorDavidZhang / MCL
View on GitHub
code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning
☆20Jul 16, 2024Updated last year
tongjingqi / Thinking-with-Video
View on GitHub
We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…
☆311Jun 21, 2026Updated 2 weeks ago
OpenSparseLLMs / LLaMA-MoE-v2
View on GitHub
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
☆93Dec 3, 2024Updated last year
penghao-wu / GUI_Reflection
View on GitHub
☆34Sep 19, 2025Updated 9 months ago
zhaochen0110 / Timo
View on GitHub
Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)
☆26Oct 23, 2024Updated last year
UNITES-Lab / MoE-RBench
View on GitHub
[ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"
☆11Jul 1, 2024Updated 2 years ago
lcqysl / DiffThinker
View on GitHub
[ICML 2026] Official repo for "DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models"
☆184Jan 4, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
NJU-RL / GLIDER
View on GitHub
[ICML 2025] Official Implementation of GLIDER
☆73Oct 9, 2025Updated 9 months ago
Simplified-Reasoning / LUFFY
View on GitHub
Official Repository of "Learning to Reason under Off-Policy Guidance"
☆459Mar 20, 2026Updated 3 months ago
Linzwcs / echos
View on GitHub
Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.
☆55Nov 10, 2025Updated 7 months ago
microsoft / SWE-bench-Live
View on GitHub
[NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!
☆205Jun 11, 2026Updated 3 weeks ago
hewei2001 / ReachQA
View on GitHub
[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs
☆61Aug 25, 2025Updated 10 months ago
ssmisya / VLMLT
View on GitHub
[CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Cal…
☆22Jun 6, 2025Updated last year
GuanZhengChen / GGPN
View on GitHub
☆10Dec 11, 2021Updated 4 years ago
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
zhaochen0110 / OpenThinkIMG
View on GitHub
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
☆394Jun 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Shawn-Guo-CN / Lossless_Text_Compression_with_Transformer
View on GitHub
This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.
☆14May 2, 2024Updated 2 years ago
weigao266 / Awesome-Efficient-Arch
View on GitHub
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
☆408Nov 11, 2025Updated 7 months ago
rtcatc / wxappUnpacker
View on GitHub
小程序反编译(支持分包)
☆14May 7, 2021Updated 5 years ago
llmeval / Llmeval-Gaokao2024-Math
View on GitHub
LLM evaluation on 2024 Chinese Gaokao Mathematics — zero-contamination benchmark with dual prompt formats
☆21Apr 15, 2026Updated 2 months ago
NJU-RL / T2DA
View on GitHub
[NeurIPS 2025] Official codebase for T2DA: Offline Meta-RL from Natural Language Supervision
☆17Jun 1, 2025Updated last year
Yijia-Xiao / LogicVista
View on GitHub
☆18Aug 1, 2024Updated last year
OpenSparseLLMs / Open-Pandora
View on GitHub
Open-Pandora: On-the-fly Control Video Generation
☆35Nov 28, 2024Updated last year