zh460045050 / V2L-Tokenizer
☆104Updated 2 months ago
Related projects: ⓘ
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions☆85Updated last week
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆53Updated 3 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆67Updated last month
- ☆106Updated 3 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆67Updated 3 weeks ago
- ☆71Updated last year
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆53Updated 10 months ago
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆160Updated 6 months ago
- Official PyTorch implementation for "Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels"☆75Updated 8 months ago
- The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆58Updated 3 months ago
- Open source implementation of "Vision Transformers Need Registers"☆126Updated last week
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆150Updated 10 months ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆75Updated 6 months ago
- Implements VAR+CLIP for image generation☆64Updated last month
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆110Updated 8 months ago
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆41Updated 3 months ago
- The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆64Updated 3 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆62Updated 4 months ago
- Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language☆20Updated 2 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆45Updated 4 months ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆72Updated 7 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆161Updated 7 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆34Updated 2 months ago
- ICCV2023-Diffusion-Papers☆110Updated last year
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆103Updated 3 weeks ago
- [ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"☆128Updated 7 months ago
- CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts☆38Updated 3 weeks ago
- ☆40Updated 3 months ago
- ☆100Updated last month
- The official implementation of RAR☆61Updated 5 months ago