wangjiyuan9 / ECCV2024-Full-PaperListLinks
We used a web scraper to obtain all the papers from ECCV that have not yet been officially announced, making them available for those who need to read the latest papers.
☆21Updated 11 months ago
Alternatives and similar repositories for ECCV2024-Full-PaperList
Users that are interested in ECCV2024-Full-PaperList are comparing it to the libraries listed below
Sorting:
- [MICCAI 24] The official code repository for paper "FairDiff: Fair Segmentation with Point-Image Diffusion".☆58Updated 4 months ago
- ☆26Updated 4 months ago
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆102Updated last week
- ☆46Updated 4 months ago
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆38Updated 5 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆94Updated last week
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆79Updated last month
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆315Updated last month
- A collection of vision foundation models unifying understanding and generation.☆57Updated 7 months ago
- [ACM MM2024] Official implementation of the paper "GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space …☆66Updated 9 months ago
- [ECCV2024] Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding☆123Updated last year
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆48Updated 3 months ago
- ☆99Updated 4 months ago
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆143Updated 2 months ago
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆41Updated last year
- A comprehensive surevy on Multimodal Models in 3D☆64Updated last year
- [Arxiv 25'] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆35Updated last month
- Code for "Open Vocabulary Monocular 3D Object Detection"☆60Updated 3 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆87Updated 4 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".☆47Updated 2 weeks ago
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detecti…☆147Updated 8 months ago
- Generative World Explorer☆153Updated last month
- ☆13Updated 8 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆33Updated 4 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆186Updated 3 weeks ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆58Updated last year
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆156Updated last month
- (ECCV'24) Official Implementation of SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.☆13Updated 10 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆106Updated last month
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆148Updated 3 weeks ago