Monad-Cube / CVPR-2024-Highlight-Oral
Collection of Highlight papers
☆37Updated 11 months ago
Alternatives and similar repositories for CVPR-2024-Highlight-Oral:
Users that are interested in CVPR-2024-Highlight-Oral are comparing it to the libraries listed below
- ☆21Updated 2 years ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆78Updated 3 weeks ago
- A collection of vision foundation models unifying understanding and generation.☆51Updated 3 months ago
- Curated list of recent visual autoregressive (VAR) modeling works☆30Updated last month
- [AAAI 2025] Pre-Training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation☆25Updated last week
- A paper list for spatial reasoning☆57Updated 2 weeks ago
- Spatial-R1: The first MLLM trained using GRPO for spatial reasoning in videos☆25Updated last week
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆118Updated last month
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆43Updated 2 weeks ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆69Updated 4 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆69Updated last week
- Official implementation of "Can Language Understand Depth?"☆81Updated 2 years ago
- code for the paper "CoReS: Orchestrating the Dance of Reasoning and Segmentation"☆15Updated last month
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆73Updated last year
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆76Updated this week
- Frequency Autoregressive Image Generation with Continuous Tokens☆56Updated last month
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆84Updated 7 months ago
- ECCV 2024 paper template☆50Updated last year
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆70Updated 7 months ago
- [SIGGRAPH 2025] Officially implement of the paper "Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussi…☆63Updated this week
- Official Implementation for CVPR 2024 paper: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor☆104Updated 10 months ago
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆20Updated 3 months ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆35Updated last year
- The offical implemention of JM3D.☆30Updated this week
- Accepted by CVPR 2024☆33Updated 11 months ago
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'☆56Updated 4 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆55Updated last year
- [ICCV 2023] CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training☆115Updated last year
- ☆45Updated last week
- [CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding☆120Updated last year