AnonymousDUTAI / SREKCARC-IA-TUDLinks

☆20

Alternatives and similar repositories for SREKCARC-IA-TUD

Users that are interested in SREKCARC-IA-TUD are comparing it to the libraries listed below

Sorting:

fscdc / RewardMap
[arxiv 2025] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
☆33Updated last month
tongjingqi / Thinking-with-Video
We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…
☆199Updated last week
thuml / MiniVeo3-Reasoner
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…
☆185Updated last month
CS-BAOYAN / CSInternship2025
☆60Updated 4 months ago
We-Math / V-Thinker
☆134Updated last week
HumanMLLM / LLaVA-Scissor
The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs
☆113Updated 4 months ago
WayneJin0918 / SOTA-paper-rating.io
A tiny paper rating web
☆38Updated 8 months ago
arctanxarc / UniCTokens
A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…
☆124Updated last month
AntResearchNLP / ViLaSR
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
☆79Updated 4 months ago
arctanxarc / MC-LLaVA
Official implementation of MC-LLaVA.
☆139Updated 2 weeks ago
doubleZ0108 / Digital-Media-Technology-PKU
Fundamentals of Digital Media Technology(04713901) | Peking University ECE Course Materials
☆23Updated 3 years ago
cokeshao / Awesome-Multimodal-Token-Compression
Survey: https://arxiv.org/pdf/2507.20198
☆218Updated last month
shxie2020 / Awesome-UGVFM
A collection of vision foundation models unifying understanding and generation.
☆59Updated 10 months ago
Video-Reason / Awesome-Video-Reasoning
This is a collection of recent papers on reasoning in video generation models.
☆38Updated this week
hhnqqq / MyTransformers
This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel s…
☆52Updated this week
Fredreic1849 / BranchGRPO
BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models
☆38Updated last month
PKU-YuanGroup / N-LoRA
【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".
☆36Updated 11 months ago
open-compass / Creation-MMBench
Assessing Context-Aware Creative Intelligence in MLLMs
☆23Updated 4 months ago
CSU-JPG / VCode
VCode: SVG as Symbolic Visual Representation
☆111Updated last week
ZJU-REAL / ViewSpatial-Bench
ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models
☆66Updated 6 months ago
Osilly / Interleaving-Reasoning-Generation
This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark perform…
☆76Updated 2 months ago
aim-uofa / dLLM-MidTruth
☆55Updated 3 months ago
Monad-Cube / CVPR-2024-Highlight-Oral
Collection of Highlight papers
☆42Updated last year
zjuruizhechen / Awesome-Video-Agent
A collection of Video Agent (Think-with-Videos) papers
☆48Updated this week
ls-kelvin / REVPT
Code for paper: Reinforced Vision Perception with Tools
☆62Updated last month
JunyaoHu / academic-project-page-template-vue
A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue
☆297Updated 4 months ago
Candice-yu / GeoLaux
A Benchmark for Evaluating MLLMs' Geometry Performance on Long-Step Problems Requiring Auxiliary Lines
☆30Updated 2 months ago
longmalongma / TW-GRPO
The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"
☆29Updated 5 months ago
czg1225 / VeriThinker
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆62Updated 2 months ago