amirziai / rafic
Stanford CS330 project
☆7Updated last year
Alternatives and similar repositories for rafic:
Users that are interested in rafic are comparing it to the libraries listed below
- [ICCVW2023] Robust Asymmetric Loss for Multi-Label Long-Tailed Learning☆18Updated last year
- Official PyTorch code for HILA☆28Updated 2 years ago
- S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions☆48Updated last year
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆19Updated 5 months ago
- ☆20Updated last year
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆13Updated last month
- Knowledge Distillation using Contrastive Language-Image Pretraining (CLIP) without a teacher model.☆12Updated 6 months ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆33Updated 3 years ago
- The official implementation of ADDP (ICLR 2024)☆12Updated last year
- The codes and dataset for the semantic explainable AI (S-XAI)☆15Updated 2 years ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆48Updated 2 years ago
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆41Updated last year
- ☆16Updated last year
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆38Updated last year
- ☆32Updated 11 months ago
- ☆30Updated 2 years ago
- ☆18Updated 11 months ago
- Masked Vision-Language Transformer in Fashion☆33Updated last year
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆11Updated 3 months ago
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning☆25Updated last year
- ☆19Updated last month
- ☆43Updated 2 years ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆35Updated 6 months ago
- ☆15Updated 4 months ago
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆16Updated 9 months ago
- Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (h…☆14Updated 6 months ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆17Updated 8 months ago
- 🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]☆22Updated last year