Lihr747 / CgtGANView external linksLinks
☆20May 3, 2025Updated 9 months ago
Alternatives and similar repositories for CgtGAN
Users that are interested in CgtGAN are comparing it to the libraries listed below
Sorting:
- [ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.☆19Jun 7, 2024Updated last year
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- ☆11Sep 15, 2023Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- Retrieval-augmented Image Captioning☆13Feb 16, 2023Updated 2 years ago
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆15May 13, 2025Updated 9 months ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆15Jan 2, 2023Updated 3 years ago
- SotA text-only image/video method (IJCAI 2023)☆16Jan 9, 2024Updated 2 years ago
- Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…☆20Jun 3, 2024Updated last year
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆55Aug 16, 2024Updated last year
- ☆25Jul 10, 2023Updated 2 years ago
- ☆59Aug 30, 2023Updated 2 years ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆34Feb 13, 2025Updated last year
- Implementing ONNX runtime for android to run Segment Anything Model 2☆12Aug 1, 2025Updated 6 months ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Oct 13, 2022Updated 3 years ago
- ☆29Oct 18, 2022Updated 3 years ago
- Counterfactual Reasoning VQA Dataset☆27Nov 23, 2023Updated 2 years ago
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- This repository houses the code for the paper - "The Neglected of VLMs"☆30Dec 31, 2025Updated last month
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆32Dec 26, 2024Updated last year
- ☆32Mar 25, 2024Updated last year
- ☆30Aug 14, 2023Updated 2 years ago
- ☆29Jun 10, 2024Updated last year
- Repository related to Cranfield's AAI MSCs GDP☆11Apr 8, 2023Updated 2 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆42May 7, 2025Updated 9 months ago
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆31May 16, 2024Updated last year
- [ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers☆34Dec 30, 2024Updated last year
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆37Mar 7, 2024Updated last year
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆99Oct 20, 2025Updated 3 months ago
- Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?☆11Jan 3, 2019Updated 7 years ago
- ☆10Nov 15, 2023Updated 2 years ago
- Improving Continuous Sign Language Recognition with Adapted Image Models☆14Nov 10, 2025Updated 3 months ago
- ☆12Jun 26, 2024Updated last year
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 10 months ago