AnonymousDUTAI / SREKCARC-IA-TUD
☆20Updated 8 months ago
Alternatives and similar repositories for SREKCARC-IA-TUD
Users that are interested in SREKCARC-IA-TUD are comparing it to the libraries listed below
Sorting:
- A tiny paper rating web☆36Updated last month
- Collection of Highlight papers☆39Updated 11 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆86Updated last month
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆188Updated this week
- A paper list for spatial reasoning☆60Updated last month
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆102Updated 6 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 4 months ago
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆33Updated 5 months ago
- Collection of recent methods on 3D Scene Generation from Text Description.☆14Updated 2 months ago
- [Arxiv Paper 2504.09130]: VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search☆16Updated 3 weeks ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆56Updated 4 months ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆52Updated last month
- [World-Model-Survey-2024] Paper list and projects for World Model☆9Updated 6 months ago
- A python script for downloading huggingface datasets and models.☆19Updated last month
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆42Updated 3 weeks ago
- A Brief Review for Computer Architecture☆19Updated 3 weeks ago
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆26Updated last month
- ☆83Updated last month
- Fundamentals of Digital Media Technology(04713901) | Peking University ECE Course Materials☆18Updated 3 years ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆97Updated last month
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation☆23Updated last year
- 一个绩点低于 3.5 的菜鸟笔记 / Notes from a rookie with a GPA below 3.5☆19Updated 2 years ago
- Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention☆36Updated 3 weeks ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆84Updated 8 months ago
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆56Updated last month
- Collections of Papers and Projects for Multimodal Reasoning.☆104Updated 3 weeks ago
- RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆44Updated last month
- ☆47Updated 5 months ago
- The official implementation of The paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs"☆51Updated this week
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆44Updated 2 months ago