art-jang/LiTFiC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/art-jang/LiTFiC)

art-jang / LiTFiC

[CVPR2025] Official code for Lost in Translation Found in Context

☆23

Alternatives and similar repositories for LiTFiC

Users that are interested in LiTFiC are comparing it to the libraries listed below

Sorting:

kaistmm / VoxMM
View on GitHub
☆19Apr 18, 2024Updated last year
eddie-euijun-hwang / SpaMo
View on GitHub
☆32Jul 9, 2025Updated 8 months ago
kaistmm / SlowFastSign
View on GitHub
[ICASSP 2024] Official code for Slowfast Network for Continuous Sign Language Recognition
☆61Jul 4, 2025Updated 8 months ago
chojw / genb
View on GitHub
Generative Bias for Robust Visual Question Answering ( CVPR 2023 )
☆28Jul 4, 2023Updated 2 years ago
ryanwongsa / Sign2GPT
View on GitHub
[ICLR2024] Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation
☆36Jun 30, 2025Updated 8 months ago
kaistmm / Metric-UD-KWS
View on GitHub
Official code for Metric learning for user-defined keyword spotting
☆38Feb 21, 2024Updated 2 years ago
MinJunKang / DualPixelFace
View on GitHub
Facial Depth and Normal Estimation using Dual-Pixel Camera (ECCV 22)
☆33Updated this week
taeyeopl / Object-Pose-Estimation-CVPR-2024
View on GitHub
☆34Jun 24, 2024Updated last year
JinhuiYE / SignCL
View on GitHub
This is the official code repository for the paper 'Improving Gloss-free Sign Language Translation by Reducing Representation Density'.
☆35Nov 27, 2024Updated last year
ms-dot-k / Visual-Audio-Memory
View on GitHub
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆20Apr 11, 2022Updated 3 years ago
kaistmm / SSLalignment
View on GitHub
☆37May 28, 2025Updated 9 months ago
JongSuk1 / EquiAV
View on GitHub
☆36Jan 20, 2025Updated last year
njucckevin / KnowCap
View on GitHub
Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
☆13Feb 15, 2024Updated 2 years ago
hulianyuyy / AdaptSign
View on GitHub
Improving Continuous Sign Language Recognition with Adapted Image Models
☆14Nov 10, 2025Updated 3 months ago
caijunhao / ov9d
View on GitHub
☆40Jun 20, 2024Updated last year
robinyjpark / AutoLabelClassifier
View on GitHub
☆10Oct 24, 2024Updated last year
graphdeco-inria / gaussian-hierarchy
View on GitHub
☆13Jul 17, 2024Updated last year
sejong-rcv / PVLR
View on GitHub
[ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization
☆12Oct 8, 2024Updated last year
kaist-ami / AVHBench
View on GitHub
[ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"
☆20Feb 25, 2026Updated last week
Vinoground / Vinoground
View on GitHub
☆13Aug 7, 2025Updated 7 months ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆14Feb 24, 2025Updated last year
graphdeco-inria / hierarchy-rasterizer
View on GitHub
☆10Aug 7, 2024Updated last year
zhjohnchan / bert-clip-synesthesia
View on GitHub
[Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.
☆14Jun 7, 2023Updated 2 years ago
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated last year
UkcheolShin / SelfDepth4Thermal
View on GitHub
☆10Mar 30, 2023Updated 2 years ago
JongSuk1 / AVCap
View on GitHub
☆11Sep 1, 2024Updated last year
e-bug / fine-grained-evals
View on GitHub
[ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"
☆13Jun 11, 2023Updated 2 years ago
ms-dot-k / Image-to-Speech
View on GitHub
Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…
☆12Mar 9, 2024Updated 2 years ago
LHL3341 / ContextBLIP
View on GitHub
☆11May 17, 2024Updated last year
Jam1ezhang / RankCLIP
View on GitHub
Ranking-Consistent Language-Image Pretraining
☆12Oct 24, 2025Updated 4 months ago
ytaek-oh / retriever
View on GitHub
☆11Sep 15, 2023Updated 2 years ago
NJUyued / SoC4SS-FGVC
View on GitHub
"Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024…
☆13Nov 20, 2025Updated 3 months ago
ms-dot-k / LRW_ID
View on GitHub
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Oct 12, 2023Updated 2 years ago
ys-zong / MIRB
View on GitHub
Benchmarking Multi-Image Understanding in Vision and Language Models
☆12Jul 29, 2024Updated last year
minzwon / docker_dl4mir
View on GitHub
☆11Nov 17, 2018Updated 7 years ago
princeton-pli / VLM_S2H
View on GitHub
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
☆16Jun 3, 2025Updated 9 months ago
Annusha / xmic
View on GitHub
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024
☆11Nov 7, 2024Updated last year
ldynx / SAVE
View on GitHub
☆25Nov 22, 2024Updated last year
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆14Sep 30, 2023Updated 2 years ago