AnonymousDUTAI / SREKCARC-IA-TUDLinks
☆20Updated last year
Alternatives and similar repositories for SREKCARC-IA-TUD
Users that are interested in SREKCARC-IA-TUD are comparing it to the libraries listed below
Sorting:
- [arxiv 2025] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆33Updated last month
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆199Updated last week
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆185Updated last month
- ☆60Updated 4 months ago
- ☆134Updated last week
- The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs☆113Updated 4 months ago
- A tiny paper rating web☆38Updated 8 months ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆124Updated last month
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆79Updated 4 months ago
- Official implementation of MC-LLaVA.☆139Updated 2 weeks ago
- Fundamentals of Digital Media Technology(04713901) | Peking University ECE Course Materials☆23Updated 3 years ago
- Survey: https://arxiv.org/pdf/2507.20198☆218Updated last month
- A collection of vision foundation models unifying understanding and generation.☆59Updated 10 months ago
- This is a collection of recent papers on reasoning in video generation models.☆38Updated this week
- This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel s…☆52Updated this week
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆38Updated last month
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆36Updated 11 months ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Updated 4 months ago
- VCode: SVG as Symbolic Visual Representation☆111Updated last week
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆66Updated 6 months ago
- This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark perform…☆76Updated 2 months ago
- ☆55Updated 3 months ago
- Collection of Highlight papers☆42Updated last year
- A collection of Video Agent (Think-with-Videos) papers☆48Updated this week
- Code for paper: Reinforced Vision Perception with Tools☆62Updated last month
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆297Updated 4 months ago
- A Benchmark for Evaluating MLLMs' Geometry Performance on Long-Step Problems Requiring Auxiliary Lines☆30Updated 2 months ago
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆29Updated 5 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆62Updated 2 months ago