real-absolute-AI/SynthRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/real-absolute-AI/SynthRL)

real-absolute-AI / SynthRL

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

☆70

Alternatives and similar repositories for SynthRL

Users that are interested in SynthRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chenyuxin1999 / Abstract_Thought
View on GitHub
[NeurIPS 2025] The implementation of paper "The Emergence of Abstract Thought in Large Language Models Beyond Any Language"
☆19Jun 9, 2025Updated last year
real-absolute-AI / NoisyRollout
View on GitHub
[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆112Sep 18, 2025Updated 10 months ago
AlphaLab-USTC / LRM-plans-CoT
View on GitHub
[NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"
☆31Jul 6, 2025Updated last year
yunfeixie233 / ViGaL
View on GitHub
☆70Feb 4, 2026Updated 5 months ago
AlphaLab-USTC / Must-Read-LLM-Papers
View on GitHub
☆19Sep 16, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sail-sg / AnytimeReasoner
View on GitHub
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆54Jul 15, 2025Updated last year
JinjieNi / Quokka
View on GitHub
The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…
☆46Nov 6, 2025Updated 8 months ago
eval-sys / mcpmark
View on GitHub
MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.
☆452Jun 12, 2026Updated last month
AndrewWTY / UNIT
View on GitHub
☆33Jun 24, 2025Updated last year
sail-sg / tty-use
View on GitHub
☆15Oct 13, 2025Updated 9 months ago
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
real-absolute-AI / Unnatural_Language
View on GitHub
The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'
☆24May 20, 2025Updated last year
evolvent-ai / ClawMark
View on GitHub
🦞 ClawMark: A Living-World Benchmark for Multi-Day, Multimodal Coworker Agents
☆118May 28, 2026Updated last month
ModalMinds / gym-v
View on GitHub
A unified framework for vision-language environments with Gymnasium-compatible interface
☆35Mar 17, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
AlphaLab-USTC / AlphaSteer
View on GitHub
[ICLR 2026] The implementation of paper "AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint"
☆61Nov 20, 2025Updated 8 months ago
Gray-OREO / MST-Distill
View on GitHub
Official implementation of "MST-Distill: Mixture of Specialized Teachers for Cross-Modal Knowledge Distillation" (ACM MM 2025)
☆35Mar 5, 2026Updated 4 months ago
chenyuxin1999 / S-DPO
View on GitHub
[NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"
☆101Nov 29, 2024Updated last year
Fu-Dayuan / AgentRefine
View on GitHub
(ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning
☆20Nov 22, 2025Updated 8 months ago
sail-sg / variational-reasoning
View on GitHub
Code for "Variational Reasoning for Language Models"
☆60Sep 29, 2025Updated 9 months ago
Xiaohao-Liu / ModalBed
View on GitHub
[MM 2025] Towards Modality Generalization: A Benchmark and Prospective Analysis
☆31May 22, 2025Updated last year
HappyPointer / LLM2Rec
View on GitHub
[KDD'25] LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential Recommendation.
☆68Sep 6, 2025Updated 10 months ago
haonan3 / V1
View on GitHub
V1: Toward Multimodal Reasoning by Designing Auxiliary Task
☆36Apr 14, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AlphaLab-USTC / AutoWiki-skill
View on GitHub
☆61Apr 9, 2026Updated 3 months ago
Xiaohao-Liu / CLHE
View on GitHub
The implementation of paper "Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction", WSDM'24.
☆18Oct 30, 2025Updated 8 months ago
SnowCharmQ / DEP
View on GitHub
[2025 EMNLP Main (oral)] Latent Inter-User Difference Modeling for LLM Personalization
☆17Sep 16, 2025Updated 10 months ago
RUCBM / DeepCritic
View on GitHub
Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"
☆41Jun 24, 2025Updated last year
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 6 months ago
LehengTHU / AlphaRec
View on GitHub
[ICLR 2025 Oral 🏆] The implementation of paper "Language Representations Can be What Recommenders Need: Findings and Potentials"
☆108May 16, 2025Updated last year
sail-sg / FlowReasoner
View on GitHub
☆145May 6, 2025Updated last year
uclanlp / OpenVLThinker
View on GitHub
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆155May 25, 2026Updated 2 months ago
TIGER-AI-Lab / VL-Rethinker
View on GitHub
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆190Jun 5, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆24Oct 7, 2025Updated 9 months ago
axon-rl / gem
View on GitHub
A Gym for Agentic LLMs
☆502Jan 21, 2026Updated 6 months ago
HappyPointer / MultiCBR
View on GitHub
☆17May 25, 2023Updated 3 years ago
MuyuenLP / AdaSteer
View on GitHub
EMNLP 25 Oral - AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender
☆19Feb 7, 2026Updated 5 months ago
chenllliang / G1
View on GitHub
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
☆103May 20, 2025Updated last year
sail-sg / Video-Next-Event-Prediction
View on GitHub
☆28Aug 9, 2025Updated 11 months ago
Lillianwei-h / MMIE
View on GitHub
[ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
☆35Nov 3, 2024Updated last year