boreng0817/IFCap

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/boreng0817/IFCap)

boreng0817 / IFCap

[EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning

☆15

Alternatives and similar repositories for IFCap

Users that are interested in IFCap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

taewhankim / VIPCAP
View on GitHub
☆15Dec 31, 2024Updated last year
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆23Oct 8, 2024Updated last year
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
aimagelab / PMA-Net
View on GitHub
[ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.
☆19Jun 7, 2024Updated 2 years ago
NAVER-INTEL-Co-Lab / gaudi-lavcap
View on GitHub
☆15Jan 24, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ytaek-oh / awesome-vl-compositionality
View on GitHub
Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.
☆40Feb 13, 2025Updated last year
junyangwang0410 / Knight
View on GitHub
SotA text-only image/video method (IJCAI 2023)
☆15Jan 9, 2024Updated 2 years ago
FeiElysia / ViECap
View on GitHub
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023
☆167Sep 9, 2024Updated last year
njucckevin / KnowCap
View on GitHub
Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
☆13Feb 15, 2024Updated 2 years ago
ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 4 years ago
unveiled-the-red-hat / SEE-Few
View on GitHub
Code for "SEE-Few: Seed, Expand and Entail for Few-shot Named Entity Recognition", accepted at COLING 2022.
☆12Nov 25, 2022Updated 3 years ago
RitaRamo / extra
View on GitHub
Retrieval-augmented Image Captioning
☆13Feb 16, 2023Updated 3 years ago
yuhui-zh15 / C3
View on GitHub
Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)
☆36Oct 16, 2024Updated last year
Florence365 / GroundVTS
View on GitHub
Grounded Visual Token Sampling (GroundVTS), a Vid-LLM architecture designed to enhance VTG performance through adaptive and efficient vis…
☆16Jun 12, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
HM4725 / The-Art-of-Multiprocessor-Programming
View on GitHub
THE ART of MULTIPROCESSOR PROGRAMMING, Maurice Herlihy & Nir Shavit
☆11Feb 12, 2023Updated 3 years ago
zhjgao / difformer
View on GitHub
The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)
☆56Apr 23, 2024Updated 2 years ago
art-jang / LiTFiC
View on GitHub
[CVPR2025] Official code for Lost in Translation Found in Context
☆24Jan 14, 2026Updated 6 months ago
dhg-wei / DeCap
View on GitHub
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
☆144Mar 16, 2023Updated 3 years ago
Hao840 / ADEM-VL
View on GitHub
PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"
☆21Oct 28, 2024Updated last year
joeyz0z / MeaCap
View on GitHub
(CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning
☆56Aug 16, 2024Updated last year
prajwalkr / transpeller
View on GitHub
Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.
☆12Jun 22, 2023Updated 3 years ago
v-nhandt21 / ViMFA
View on GitHub
Montreal Forced Aligner for Vietnamese
☆15Oct 23, 2023Updated 2 years ago
sejong-rcv / INSANet
View on GitHub
[2024] INSANet: INtra-INter Spectral Attention Network for Effective Feature Fusion of Multispectral Pedestrian Detection, Sensors.
☆23Mar 20, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
patrick-0817 / T-MASS-dataleakage
View on GitHub
☆10Nov 27, 2024Updated last year
invhun / NarVid
View on GitHub
[CVPR'2025] Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions
☆19Jan 16, 2026Updated 6 months ago
mrazhou / SeTa
View on GitHub
[CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"
☆24Mar 18, 2025Updated last year
echogarden-project / text-segmentation
View on GitHub
A library for multilingual word, phrase and sentence segmentation.
☆16Updated this week
amazon-science / slang-llm-benchmark
View on GitHub
☆19May 6, 2024Updated 2 years ago
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆18Jun 30, 2026Updated 3 weeks ago
google-deepmind / svo_probes
View on GitHub
The SVO-Probes Dataset for Verb Understanding
☆29Jan 28, 2022Updated 4 years ago
Lihr747 / CgtGAN
View on GitHub
☆20May 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SpeechEE / SpeechEE
View on GitHub
☆11Aug 20, 2025Updated 11 months ago
SHI-Labs / Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment
View on GitHub
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment, arXiv 2024 / CVPR 2025
☆46Mar 1, 2025Updated last year
FarinaMatteo / zero
View on GitHub
[NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!
☆64Mar 24, 2025Updated last year
DAMO-NLP-SG / SSTuning
View on GitHub
Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"
☆29Sep 25, 2023Updated 2 years ago
mengzaiqiao / awesome-natural-language-reasoning
View on GitHub
A collection of research papers related to Natural Language Reasoning
☆10May 27, 2022Updated 4 years ago
LunarShen / TempMe
View on GitHub
[ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
☆27Feb 13, 2025Updated last year
uniglot / korean-word-ipa-dictionary
View on GitHub
Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)
☆23Nov 12, 2025Updated 8 months ago