nguyentthong / CrossSummOptimalTransport
☆23Updated last year
Alternatives and similar repositories for CrossSummOptimalTransport
Users that are interested in CrossSummOptimalTransport are comparing it to the libraries listed below
Sorting:
- ☆31Updated last year
- Official implementation of POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples (NeurIPS 2021)☆14Updated 2 years ago
- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)☆36Updated 6 months ago
- ☆33Updated last year
- ☆35Updated last year
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆39Updated 3 years ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated 2 years ago
- Code and data for ImageCoDe, a contextual vison-and-language benchmark☆39Updated last year
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Updated 2 years ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆27Updated 2 years ago
- Mixture of Attention Heads☆44Updated 2 years ago
- ☆22Updated 2 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated 2 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆47Updated last year
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Updated last year
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Updated 3 years ago
- ☆32Updated 3 years ago
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆28Updated 3 years ago
- ☆29Updated 2 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated 2 years ago
- Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling☆36Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago
- ☆29Updated 2 years ago
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Updated last year
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Updated 2 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- ☆23Updated 4 years ago
- [ICML 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling☆65Updated 2 years ago