facebookresearch/Llip

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/Llip)

facebookresearch / Llip

Official PyTorch codebase for the Modeling Caption Diversity in ContrastiveVision-Language Pretraining paper.

☆19

Alternatives and similar repositories for Llip

Users that are interested in Llip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Fsoft-AIC / Z-GMOT
View on GitHub
[NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking
☆12May 19, 2026Updated 2 months ago
VinAIResearch / SwiftTry
View on GitHub
☆15Jun 9, 2025Updated last year
google-deepmind / scivid
View on GitHub
☆18Mar 2, 2026Updated 4 months ago
EIT-NLP / Layer_Select_Fuse_for_MLLM
View on GitHub
[CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…
☆48Oct 29, 2025Updated 9 months ago
ericcristofalo / GeoD
View on GitHub
☆11Sep 30, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
guoyongcs / TAPADL
View on GitHub
Code of "Robustifying Token Attention for Vision Transformers"
☆20Dec 31, 2023Updated 2 years ago
ipcng00 / LDM-S
View on GitHub
☆18Nov 19, 2024Updated last year
zef1611 / AIC23_NLRetrieval_HCMIU_CVIP
View on GitHub
Official codes of the 1st place for The NVIDIA AI City Challenge 2023 - Track 2
☆20Jul 25, 2023Updated 3 years ago
Qualcomm-AI-research / SharpDepth
View on GitHub
Code accompanying paper "SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation"
☆29May 8, 2026Updated 2 months ago
EIT-NLP / Connector-Selection-for-MLLM
View on GitHub
[EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…
☆16Dec 13, 2024Updated last year
dbstjswo505 / StarLab-Dialogue-System
View on GitHub
비디오 기반 인공지능 대화시스템
☆11Aug 16, 2023Updated 2 years ago
kevinliang888 / IVR-QA-baselines
View on GitHub
[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers
☆20Apr 16, 2024Updated 2 years ago
gabegrand / battleship
View on GitHub
Official repo for Shoot First, Ask Questions Later?
☆24Apr 23, 2026Updated 3 months ago
AmeenAli / VideoMatch
View on GitHub
☆14Jan 5, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
minhnhatvt / glamor-net
View on GitHub
Global-Local Attention for Emotion Recognition
☆20Nov 13, 2020Updated 5 years ago
dbstjswo505 / ESD
View on GitHub
☆11May 1, 2023Updated 3 years ago
dbstjswo505 / Multimodal_AI_Video_Dialogue
View on GitHub
Multimodal_AI_Video_Dialogue
☆16Dec 3, 2024Updated last year
AlexeyAB / SPVT-Transformer
View on GitHub
☆13Nov 7, 2021Updated 4 years ago
ninatu / in_style
View on GitHub
Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023
☆11Oct 5, 2023Updated 2 years ago
kaist-siit-lab / StarLab_1-1
View on GitHub
☆28Mar 13, 2025Updated last year
gadm21 / Face-recognition-using-PCA-and-SVD
View on GitHub
In this project, facial recognition algorithm is implemented with python using PCA and SVD dimensionality reduction tools.
☆11Sep 2, 2019Updated 6 years ago
ngocquang / logging_system
View on GitHub
Hướng dẫn tạo một hệ thống Log Remote dùng chung cho nhiều dự án/server
☆15Feb 26, 2020Updated 6 years ago
ThomasRobertFr / thesis
View on GitHub
My PhD manuscript LaTeX code and the slides for the defense
☆11Feb 2, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SkyworkAI / DAQ-VS
View on GitHub
Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]
☆15Jul 11, 2024Updated 2 years ago
kaist-siit-lab / StarLab_1-2
View on GitHub
☆28Mar 13, 2025Updated last year
sujanshresstha / SAM2-in-video
View on GitHub
This repository contains code for deploying a Gradio application using the SAM2 model for video processing. The application allows users …
☆47Sep 24, 2024Updated last year
lucabarsellotti / awesome-open-vocabulary-semantic-segmentation
View on GitHub
☆15May 7, 2024Updated 2 years ago
layumi / Awesome-Text2Motion-Generation
View on GitHub
Awesome-Text2Motion-Generation
☆18Oct 26, 2023Updated 2 years ago
Qualcomm-AI-research / SwiftEdit
View on GitHub
Official PyTorch implementation of our CVPR 2025 paper: "SwiftEdit: Lightning Fast Text-guided Image Editing via One-step Diffusion"
☆53Jan 7, 2026Updated 6 months ago
Annusha / xmic
View on GitHub
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024
☆11Nov 7, 2024Updated last year
nhtlongcs / AIC2022-VER
View on GitHub
Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding
☆13Aug 2, 2023Updated 2 years ago
JindongJiang / SlotSSMs
View on GitHub
Official Release of NeurIPS 2024 paper "Slot State Space Models"
☆11Mar 22, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ZhengJun-AI / vfid-metrics
View on GitHub
A toolkit for computing Video Fréchet Inception Distance (VFID) metrics.
☆11May 28, 2024Updated 2 years ago
iLearn-Lab / MM23-RTQ
View on GitHub
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
☆15Apr 7, 2026Updated 3 months ago
ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆68Mar 21, 2024Updated 2 years ago
facebookresearch / dmae_st
View on GitHub
Directed masked autoencoders
☆14Mar 25, 2026Updated 4 months ago
Hanjun-Dai / r-hsmm
View on GitHub
Implementation of Recurrent Hidden Semi-Markov Model http://www.cc.gatech.edu/~lsong/papers/DaiDaiZhaLietal17.pdf
☆13Mar 31, 2019Updated 7 years ago
HelloJianHan / P2LR
View on GitHub
code for "Delving into Probabilistic Uncertainty for Unsupervised Domain Adaptive Person Re-Identification" in AAAI2022
☆18Apr 8, 2022Updated 4 years ago
jason-lim26 / DiPEx
View on GitHub
Official PyTorch implementation of our paper "Dispersing Prompt Expansion for Class-Agnostic Object Detection" (NeurIPS 2024)
☆14Jan 19, 2025Updated last year