AnonymousDUTAI / SREKCARC-IA-TUDLinks
☆20Updated 9 months ago
Alternatives and similar repositories for SREKCARC-IA-TUD
Users that are interested in SREKCARC-IA-TUD are comparing it to the libraries listed below
Sorting:
- A tiny paper rating web☆38Updated 3 months ago
- Fundamentals of Digital Media Technology(04713901) | Peking University ECE Course Materials☆18Updated 3 years ago
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆34Updated 6 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆120Updated 2 weeks ago
- ☆32Updated 2 weeks ago
- Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆48Updated this week
- ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆14Updated last week
- A collection of vision foundation models unifying understanding and generation.☆55Updated 5 months ago
- Personal Transformer models training library☆22Updated this week
- Collection of Highlight papers☆41Updated last year
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆55Updated 5 months ago
- Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention☆39Updated 2 months ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆64Updated 3 weeks ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆75Updated this week
- [TMLR 2025] Efficient Diffusion Models: A Survey☆66Updated 2 weeks ago
- A paper list for spatial reasoning☆94Updated 2 weeks ago
- ☆100Updated 3 months ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆63Updated 2 weeks ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆113Updated 8 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆48Updated last month
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆45Updated 4 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆126Updated last month
- Collection of recent methods on 3D Scene Generation from Text Description.☆15Updated 3 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆246Updated this week
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆268Updated this week
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆175Updated 3 months ago
- [Arxiv Paper 2504.09130]: VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search☆19Updated 2 months ago
- BoardCaster 是 CSBAOYAN 相关的数据库,使用JSON文件管理格式化的保研相关信息,并通过 Issue 进行更新以简化参与开源流程的难度。☆19Updated this week
- [ICCV 25]SpectralAR: Spectral Autoregressive Visual Generation☆25Updated 2 weeks ago
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆31Updated 2 months ago