gyhdog99/RACRO2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gyhdog99/RACRO2)

gyhdog99 / RACRO2

Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)

☆19

Alternatives and similar repositories for RACRO2

Users that are interested in RACRO2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mahtabbigverdi / Aurora
View on GitHub
☆12Dec 4, 2024Updated last year
pixeli99 / TrackDiffusion
View on GitHub
[WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
☆81Jun 26, 2024Updated 2 years ago
OpenGVLab / VRBench
View on GitHub
[ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos
☆28Jun 4, 2026Updated last month
Jingfeng0705 / LIFT
View on GitHub
The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders
☆43Jun 10, 2025Updated last year
ml-research / deictic-segment-anything
View on GitHub
Segment Anything with Deictic Prompting
☆27May 13, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 5 months ago
drilistbox / FlashOCC_on_UniOcc_and_RenderOCC
View on GitHub
☆26Feb 2, 2024Updated 2 years ago
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
mbzuai-oryx / Video-R2
View on GitHub
Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models
☆19Jan 21, 2026Updated 6 months ago
Ranking-VMR / SPR
View on GitHub
☆13Jun 11, 2026Updated last month
RUCAIBox / Event-Bench
View on GitHub
Official code of *Towards Event-oriented Long Video Understanding*
☆12Jul 26, 2024Updated last year
kxfan2002 / SophiaVL-R1
View on GitHub
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
☆94Aug 8, 2025Updated 11 months ago
hkust-nlp / model-task-align-rl
View on GitHub
[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆18Feb 9, 2026Updated 5 months ago
Gabesarch / grounded-rl
View on GitHub
☆133Jul 22, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Jiaxing-star / LLaVA-Octopus
View on GitHub
☆11Jan 8, 2025Updated last year
LunarShen / DsicoVLA
View on GitHub
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
☆22Jun 23, 2025Updated last year
LAION-AI / Conditional-Pretraining-of-Large-Language-Models
View on GitHub
☆37May 7, 2023Updated 3 years ago
DAMO-NLP-SG / LLM-Multilingual-Knowledge-Boundaries
View on GitHub
[ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations
☆19Oct 18, 2025Updated 9 months ago
kaiyuhwang / MLLM-Survey
View on GitHub
The paper list of multilingual pre-trained models (Continual Updated).
☆25Jun 18, 2024Updated 2 years ago
maifoundations / GCoT
View on GitHub
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation
☆15Aug 11, 2025Updated 11 months ago
343gltysprk / ovow
View on GitHub
☆39Nov 25, 2025Updated 8 months ago
DAMO-NLP-SG / LongPO
View on GitHub
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆43Feb 27, 2025Updated last year
tmlr-group / TriMem
View on GitHub
[arXiv:2605.19952] "Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory"
☆16May 20, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DongSky / MR-GDINO
View on GitHub
☆54Dec 23, 2024Updated last year
junkangwu / Dr_DPO
View on GitHub
[ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"
☆19Jun 1, 2024Updated 2 years ago
MediaBrain-SJTU / MoLA
View on GitHub
☆21Jul 19, 2024Updated 2 years ago
Jiaxin-Wen / MisleadLM
View on GitHub
Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""
☆20Oct 11, 2024Updated last year
jxjessieli / contextual-distortion-parser
View on GitHub
[ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.
☆14Jun 3, 2023Updated 3 years ago
pengshuai-rin / MultiMath
View on GitHub
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
☆33Jan 22, 2025Updated last year
SparklingH / BloomScene
View on GitHub
BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation (AAAI 2025)
☆19Jan 13, 2025Updated last year
HuiGuanLab / RaTSG
View on GitHub
This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"
☆13Aug 22, 2025Updated 11 months ago
TIGER-AI-Lab / VisualWebInstruct
View on GitHub
The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]
☆39Feb 1, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
harrylin-hyl / SGROD
View on GitHub
☆14Sep 6, 2024Updated last year
hustvl / GroundingSuite
View on GitHub
[ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding
☆77Jun 26, 2025Updated last year
saferlhf-v / saferlhf-v
View on GitHub
☆23Jun 16, 2025Updated last year
ZhenglinZhou / DreamDPO
View on GitHub
[ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
☆22May 24, 2025Updated last year
NVlabs / PerVLBenchmark
View on GitHub
☆11Jul 31, 2022Updated 3 years ago
TMIU / iTFA
View on GitHub
Incremental Few-Shot Object Detection via Simple Fine-Tuning Approach (ICRA 2023)
☆10Feb 14, 2023Updated 3 years ago
nusnlp / d2vlm
View on GitHub
[ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models
☆24Apr 18, 2026Updated 3 months ago