alchemistyzz/PeRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alchemistyzz/PeRL)

alchemistyzz / PeRL

[NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"

☆30

Alternatives and similar repositories for PeRL

Users that are interested in PeRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zss02 / BiPS
View on GitHub
[CVPR 2026] See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
☆21Jun 28, 2026Updated 3 weeks ago
microsoft / PixelCraft
View on GitHub
[ICLR 2026] High-Fidelity Visual Reasoning on Structured Images
☆30Jul 17, 2026Updated last week
qishisuren123 / AnyCap
View on GitHub
A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…
☆54Jul 24, 2025Updated last year
kinesiatricssxilm14 / CodeRepoQA
View on GitHub
CodeRepoQA dataset
☆15Feb 19, 2025Updated last year
eternal8080 / MV-MATH
View on GitHub
Description for MV-MATH
☆15Jul 20, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Code4Agent / codeagent
View on GitHub
☆22Jul 16, 2024Updated 2 years ago
Eshe0922 / PILOT
View on GitHub
[ASE'23] When Less is Enough: Positive-Unlabeled Learning Model for Vulnerability Detection
☆16Jan 12, 2024Updated 2 years ago
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 5 months ago
SihengLi99 / SEALONG
View on GitHub
Large Language Models Can Self-Improve in Long-context Reasoning
☆72Nov 24, 2024Updated last year
jiyt17 / ReDiff
View on GitHub
Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'
☆45Jun 27, 2026Updated 3 weeks ago
Lucanyc / VISTA-Gym
View on GitHub
☆27Mar 17, 2026Updated 4 months ago
Hesse73 / RLVR-Directions
View on GitHub
Source Code for our ICLR'26 paper
☆17Feb 22, 2026Updated 5 months ago
DeepExperience / HyperEyes
View on GitHub
HyperEyes is a parallel multimodal search agent that fuses visual grounding and retrieval into a single atomic action, enabling concurren…
☆70May 23, 2026Updated 2 months ago
Utaotao / ProFit
View on GitHub
☆35Jan 20, 2026Updated 6 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Aries-iai / Manifold_Steering
View on GitHub
The official implementation for "Mitigating Overthinking in Large Reasoning Models via Manifold Steering"
☆15May 29, 2025Updated last year
SihengLi99 / LLM-Honesty-Survey
View on GitHub
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆66Dec 8, 2024Updated last year
lllllllllllll-llll / NSSADNN_IQA
View on GitHub
Pytorch version of IEEE Transactions on Multimedia 2019: "Naturalness-Aware Deep No-Reference Image Quality Assessment."
☆12Jun 30, 2020Updated 6 years ago
yangzhch6 / DARS
View on GitHub
The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration" (ICML 2026)
☆24Feb 4, 2026Updated 5 months ago
v587su / SimPy
View on GitHub
Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"
☆12Oct 21, 2024Updated last year
xinyan-cxy / MINT-CoT
View on GitHub
[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
☆107Sep 19, 2025Updated 10 months ago
penghao-wu / GUI_Reflection
View on GitHub
☆34Sep 19, 2025Updated 10 months ago
WikiChao / VisAH
View on GitHub
[CVPR 2025] Pytorch implementation of the paper "Learning to Highlight Audio by Watching Movies"
☆15Oct 1, 2025Updated 9 months ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cxcscmu / General-AgentBench
View on GitHub
Benchmark Test-Time Scaling of General LLM Agents
☆20Apr 14, 2026Updated 3 months ago
showlab / FocusUI
View on GitHub
[CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
☆35Jun 7, 2026Updated last month
zhengkd95 / thu_poster_template
View on GitHub
A LaTeX template for academic posters with Tsinghua University logo
☆15Nov 18, 2022Updated 3 years ago
ltzheng / SimpleTIR
View on GitHub
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆401Mar 30, 2026Updated 3 months ago
daochenzha / autosmote
View on GitHub
[CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification
☆10Mar 20, 2023Updated 3 years ago
FreedomIntelligence / MyPhoneBench
View on GitHub
MyPhoneBench: Do Phone-Use Agents Respect Your Privacy?
☆24Apr 3, 2026Updated 3 months ago
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
Multimedia-Analytics-Laboratory / dpdmd
View on GitHub
[ICML 2026] The offical code of Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis
☆87Jun 2, 2026Updated last month
BitSecret / HyperGNet
View on GitHub
Geometric Problem Solving Integrating FormalGeo Symbolic System and Hypergraph Neural Network.
☆16Sep 23, 2025Updated 10 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
shaharl6000 / MoreDocsSameLen
View on GitHub
This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…
☆18Mar 13, 2025Updated last year
UMass-Embodied-AGI / Mirage
View on GitHub
[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
☆294Aug 2, 2025Updated 11 months ago
microsoft / EpiCoder
View on GitHub
Implementation for "EpiCoder: Encompassing Diversity and Complexity in Code Generation" (ICML 2025)
☆27May 16, 2025Updated last year
weizhou-geek / DeepSRQ
View on GitHub
Blind quality assessment for image superresolution using deep two-stream convolutional networks, published in Information Sciences 2020
☆13Sep 19, 2021Updated 4 years ago
rootyJeon / Vision-aligned-Latent-Reasoning
View on GitHub
[ICML 2026] Official implementation of Vision-aligned Latent Reasoning for Multi-modal Large Language Model (VaLR)
☆20Apr 30, 2026Updated 2 months ago
zzfoutofspace / ATPO
View on GitHub
AT2PO: Agentic Turn-based Policy Optimization via Tree Search
☆22May 21, 2026Updated 2 months ago
cpf0079 / CSPT
View on GitHub
Source code for paper "Contrastive Self-Supervised Pre-Training for Video Quality Assessment".
☆12Jun 8, 2022Updated 4 years ago