SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis
☆68Jul 24, 2025Updated 7 months ago
Alternatives and similar repositories for SynthRL
Users that are interested in SynthRL are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆30Jul 6, 2025Updated 7 months ago
- ☆33Jun 24, 2025Updated 8 months ago
- ☆53Feb 14, 2026Updated 2 weeks ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆35Nov 3, 2024Updated last year
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated 11 months ago
- [KDD'25] LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential Recommendation.☆58Sep 6, 2025Updated 5 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆18Oct 17, 2025Updated 4 months ago
- The implementation of paper "Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction", WSDM'24.☆17Oct 30, 2025Updated 4 months ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆28Feb 25, 2025Updated last year
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 9 months ago
- ☆20Apr 16, 2025Updated 10 months ago
- [ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆29Feb 6, 2026Updated 3 weeks ago
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆45Feb 7, 2026Updated 3 weeks ago
- R1-like Computer-use Agent☆89Mar 21, 2025Updated 11 months ago
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆25Oct 5, 2025Updated 4 months ago
- [ICLR 2026] The implementation of paper "AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint"☆40Nov 20, 2025Updated 3 months ago
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated last month
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last week
- implementation of dualformer☆24Mar 1, 2025Updated last year
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated 11 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- [ACM MM 2025] MLLMs for Aesthetics Reasoning☆23Jan 5, 2026Updated last month
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆96Nov 29, 2024Updated last year
- ☆64Feb 4, 2026Updated 3 weeks ago
- [ICLR 2025 Oral 🏆] The implementation of paper "Language Representations Can be What Recommenders Need: Findings and Potentials"☆97May 16, 2025Updated 9 months ago
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.☆22Dec 7, 2023Updated 2 years ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆59Jan 5, 2026Updated last month
- A curated list of research papers, resources, and advancements on Diffusion Cache and related efficient diffusion model acceleration tech…☆73Nov 4, 2025Updated 3 months ago
- ☆34Aug 18, 2025Updated 6 months ago
- SFT+RL boosts multimodal reasoning☆46Jun 27, 2025Updated 8 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- [MM 2025] Towards Modality Generalization: A Benchmark and Prospective Analysis☆28May 22, 2025Updated 9 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 24, 2026Updated last week
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆45Nov 6, 2025Updated 3 months ago
- Dateset Reset Policy Optimization☆31Apr 12, 2024Updated last year
- KV Cache Steering for Inducing Reasoning in Small Language Models☆46Jul 24, 2025Updated 7 months ago
- Official code of "RoboOmni: Proactive Robot Manipulation in Omni-modal Context"☆89Nov 17, 2025Updated 3 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago