nguyentthong / CrossSummOptimalTransport
☆21Updated last year
Alternatives and similar repositories for CrossSummOptimalTransport:
Users that are interested in CrossSummOptimalTransport are comparing it to the libraries listed below
- ☆30Updated last year
- Official implementation of POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples (NeurIPS 2021)☆14Updated 2 years ago
- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)☆34Updated 2 months ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated last year
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated last year
- ☆15Updated 2 years ago
- [EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering☆16Updated 3 months ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆55Updated 7 months ago
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Updated 2 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆16Updated last year
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling☆35Updated 2 years ago
- Reading list for research topics in Diffusion models.☆17Updated last year
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆26Updated 2 years ago
- ☆15Updated 3 years ago
- ☆33Updated last year
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆28Updated 3 years ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆65Updated 11 months ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"☆13Updated 3 months ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Updated 3 years ago
- Mixture of Attention Heads☆41Updated 2 years ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Updated last year
- The code for lifelong few-shot language learning☆55Updated 2 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- ☆34Updated last year
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Updated last year
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆59Updated 4 years ago
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆46Updated 2 years ago