Monad-Cube / CVPR-2024-Highlight-Oral
Collection of Highlight papers
☆39Updated 11 months ago
Alternatives and similar repositories for CVPR-2024-Highlight-Oral
Users that are interested in CVPR-2024-Highlight-Oral are comparing it to the libraries listed below
Sorting:
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆56Updated last year
- ☆20Updated 2 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 4 months ago
- The repository contains the official implementation of "Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation"☆40Updated 2 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆140Updated last month
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆83Updated last month
- Curated list of recent visual autoregressive (VAR) modeling works☆30Updated 2 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆76Updated last year
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆97Updated last month
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'☆56Updated 4 months ago
- This is the official implementation for ControlVAR.☆107Updated 5 months ago
- [AAAI 2025] Pre-Training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation☆26Updated last month
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆86Updated last year
- ☆21Updated 2 years ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆35Updated last year
- code for the paper "CoReS: Orchestrating the Dance of Reasoning and Segmentation"☆15Updated 2 months ago
- Frequency Autoregressive Image Generation with Continuous Tokens☆67Updated 2 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆40Updated last year
- ☆28Updated 11 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆32Updated 2 months ago
- Spatial-R1: The first MLLM trained using GRPO for spatial reasoning in videos☆35Updated last week
- ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models☆17Updated 9 months ago
- The official implementation of The paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs"☆51Updated this week
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆91Updated 11 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆70Updated 7 months ago
- [ICLR'25] Reconstructive Visual Instruction Tuning☆83Updated last month
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆12Updated 7 months ago
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆31Updated 2 months ago
- Posters for all 235 cvpr2023 highlight papers☆30Updated 10 months ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆69Updated 5 months ago