☆20May 3, 2025Updated 10 months ago
Alternatives and similar repositories for CgtGAN
Users that are interested in CgtGAN are comparing it to the libraries listed below
Sorting:
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- ☆11Sep 15, 2023Updated 2 years ago
- ☆11Jun 27, 2023Updated 2 years ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- ☆11May 30, 2024Updated last year
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆15May 13, 2025Updated 9 months ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆15Jan 2, 2023Updated 3 years ago
- SotA text-only image/video method (IJCAI 2023)☆16Jan 9, 2024Updated 2 years ago
- Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…☆20Jun 3, 2024Updated last year
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆55Aug 16, 2024Updated last year
- [CVPR22] Group Contextualization for Video Recognition☆23Aug 24, 2023Updated 2 years ago
- ☆59Aug 30, 2023Updated 2 years ago
- Implementing ONNX runtime for android to run Segment Anything Model 2☆12Aug 1, 2025Updated 7 months ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Oct 13, 2022Updated 3 years ago
- ☆29Oct 18, 2022Updated 3 years ago
- Counterfactual Reasoning VQA Dataset☆28Nov 23, 2023Updated 2 years ago
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆32Dec 26, 2024Updated last year
- This repository houses the code for the paper - "The Neglected of VLMs"☆30Dec 31, 2025Updated 2 months ago
- ☆29Jun 10, 2024Updated last year
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- Repository related to Cranfield's AAI MSCs GDP☆11Apr 8, 2023Updated 2 years ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆42May 7, 2025Updated 10 months ago
- [ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers☆34Dec 30, 2024Updated last year
- ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No☆142Dec 2, 2023Updated 2 years ago
- ☆10Feb 10, 2026Updated 3 weeks ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆37Mar 7, 2024Updated 2 years ago
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆99Oct 20, 2025Updated 4 months ago
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 4 years ago
- LED : Light Enhanced Depth Estimation at Night☆14Dec 9, 2025Updated 2 months ago
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 11 months ago
- ☆45Oct 5, 2025Updated 5 months ago
- Some of my practices on Algorithms : ) 这个仓库保存着我在 LeetCode、剑指Offer 上的一些解答,代码中保留了必要的注释。不一定是最优的解答,但力保代码简洁易懂。后续还会整合其他题库,如若发现什么错误,希望你能告诉我或帮助我…☆11Dec 3, 2024Updated last year
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated last month
- The code of MetaViewer: Towards A Unified Multi-View Representation (CVPR 2023).☆10Nov 20, 2023Updated 2 years ago
- ☆10Sep 5, 2024Updated last year