weijiawu / Awesome-Synthetic-Data-for-Perception-Task
☆44Updated last year
Related projects: ⓘ
- ☆20Updated last year
- Turning to Video for Transcript Sorting☆44Updated last year
- ☆52Updated last year
- ☆20Updated 9 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆49Updated last month
- ☆13Updated last week
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Updated last year
- Teach-DETR: Better Training DETR with Teachers☆28Updated 6 months ago
- code base for vision transformers☆35Updated 2 years ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆35Updated last month
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆17Updated last year
- IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆18Updated last week
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆33Updated 3 weeks ago
- The collection of awesome papers on alignment of diffusion model.☆21Updated last week
- TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆13Updated 3 months ago
- Official codes for ConMIM (ICLR 2023)☆57Updated last year
- ☆10Updated 8 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆43Updated 2 months ago
- [ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching☆69Updated 9 months ago
- ☆57Updated last year
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆15Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated last year
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- ☆15Updated 4 months ago
- [ECCV 2022] Official pytorch implementation of "mc-BEiT: Multi-choice Discretization for Image BERT Pre-training" in European Conference …☆22Updated 2 years ago
- ☆41Updated this week
- ☆55Updated 11 months ago
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆61Updated 11 months ago
- Official implementation of TagAlign☆31Updated 5 months ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated last year