AnonymousDUTAI / SREKCARC-IA-TUDLinks
☆19Updated 10 months ago
Alternatives and similar repositories for SREKCARC-IA-TUD
Users that are interested in SREKCARC-IA-TUD are comparing it to the libraries listed below
Sorting:
- A collection of vision foundation models unifying understanding and generation.☆57Updated 6 months ago
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆274Updated last week
- ☆55Updated last week
- ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆16Updated 3 weeks ago
- Official implementation of MC-LLaVA.☆32Updated last month
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆34Updated 7 months ago
- Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆69Updated last month
- (ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"☆56Updated 5 months ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆114Updated 8 months ago
- ☆32Updated last month
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆52Updated last month
- List of diffusion related active submissions on OpenReview for ICLR 2025.☆32Updated 8 months ago
- A tiny paper rating web☆38Updated 4 months ago
- A Collection of AIGC Research Groups☆75Updated 2 weeks ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆49Updated 3 weeks ago
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆19Updated last month
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆85Updated 10 months ago
- A paper list for spatial reasoning☆121Updated last month
- [CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding☆23Updated 3 months ago
- Collection of Highlight papers☆41Updated last year
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆69Updated last week
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆59Updated 2 months ago
- Provide .bst files for NeurIPS latex template☆49Updated 3 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆178Updated this week
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆48Updated 2 weeks ago
- The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs☆91Updated 2 weeks ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆141Updated last month
- Official code for ICLR 2024 paper "Do Generated Data Always Help Contrastive Learning?"☆31Updated last year
- ☆31Updated 2 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆129Updated last month