yuxie11/R2D2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yuxie11/R2D2)

yuxie11 / R2D2

☆170

Alternatives and similar repositories for R2D2

Users that are interested in R2D2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BAAI-WuDao / BriVL
View on GitHub
Bridging Vision and Language Model
☆286Mar 27, 2023Updated 3 years ago
billjie1 / Chinese-CLIP
View on GitHub
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
☆167Nov 3, 2022Updated 3 years ago
ksOAn6g5 / TaiSu
View on GitHub
TaiSu（太素）--a large-scale Chinese multimodal dataset（亿级大规模中文视觉语言预训练数据集）
☆192Nov 17, 2023Updated 2 years ago
li-xirong / coco-cn
View on GitHub
Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
☆214Feb 12, 2025Updated last year
OFA-Sys / Chinese-CLIP
View on GitHub
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
☆5,980Mar 31, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
salesforce / ALBEF
View on GitHub
Code for ALBEF: a new vision-language pre-training method
☆1,755Sep 20, 2022Updated 3 years ago
MUGE-2021 / image-retrieval-baseline
View on GitHub
☆60Nov 17, 2022Updated 3 years ago
starmemda / CAMoE
View on GitHub
☆100Sep 27, 2021Updated 4 years ago
alibaba / AliceMind
View on GitHub
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
☆2,043Mar 19, 2024Updated 2 years ago
OFA-Sys / OFA
View on GitHub
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…
☆2,557Apr 24, 2024Updated 2 years ago
applenob / clip_chinese_text_encoder
View on GitHub
CLIP中文encoder
☆26Jun 21, 2022Updated 4 years ago
360CVGroup / 360VL
View on GitHub
Our 2nd-gen LMM
☆34May 22, 2024Updated 2 years ago
jdcomsearch / poeem
View on GitHub
A library for end-to-end learning of embedding index and retrieval model
☆62Jul 7, 2021Updated 5 years ago
salesforce / BLIP
View on GitHub
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
☆5,717Mar 3, 2026Updated 4 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
microsoft / UniCL
View on GitHub
[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"
☆410Nov 10, 2023Updated 2 years ago
mynameischaos / Lion
View on GitHub
Lion: Kindling Vision Intelligence within Large Language Models
☆51Jan 25, 2024Updated 2 years ago
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,162Jan 22, 2024Updated 2 years ago
thu-ml / zh-clip
View on GitHub
☆73Jun 28, 2023Updated 3 years ago
IDEA-CCNL / Fengshenbang-LM
View on GitHub
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。
☆4,126Jun 8, 2026Updated last month
medhini / clip_it
View on GitHub
CLIP-It! Language-Guided Video Summarization
☆75Jun 21, 2021Updated 5 years ago
racp-submission / racp
View on GitHub
☆15Jun 2, 2021Updated 5 years ago
OpenBGBenchmark / OpenBG-Align
View on GitHub
Baselines for CCKS 2022 Task "Product Knowledge Graph Alignment"
☆31Feb 11, 2023Updated 3 years ago
JunnYu / ChineseBert_pytorch
View on GitHub
huggingface ChineseBert Tokenizer
☆16Apr 16, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
CarnoZhao / GAIIC-Track1
View on GitHub
codes for GAIIC-Track1
☆15Jun 14, 2022Updated 4 years ago
JunnYu / GAU-alpha-pytorch
View on GitHub
GAU-alpha-pytorch
☆20May 11, 2022Updated 4 years ago
linjieli222 / HERO
View on GitHub
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆235Sep 16, 2021Updated 4 years ago
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,605Jan 24, 2024Updated 2 years ago
megvii-research / protoclip
View on GitHub
📍 Official repository of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS 2023)
☆56Nov 8, 2023Updated 2 years ago
huggingface / m4-logs
View on GitHub
M4 experiment logbook
☆59Aug 21, 2023Updated 2 years ago
TencentARC-QQ / QA-CLIP
View on GitHub
Chinese CLIP models with SOTA performance.
☆63Aug 28, 2023Updated 2 years ago
yangjianxin1 / CLIP-Chinese
View on GitHub
中文CLIP预训练模型
☆418Dec 5, 2022Updated 3 years ago
microsoft / UniVL
View on GitHub
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
☆366Jul 25, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
360CVGroup / Bridge_Diffusion_Model
View on GitHub
Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025
☆13Jun 25, 2024Updated 2 years ago
MUGE-2021 / image-caption-baseline
View on GitHub
☆66Dec 15, 2023Updated 2 years ago
xiaoxing2001 / DeGLA
View on GitHub
[ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]
☆16Jul 15, 2025Updated last year
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
ArrowLuo / CLIP4Clip
View on GitHub
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
☆1,030Apr 12, 2024Updated 2 years ago
zengyan-97 / X-VLM
View on GitHub
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
☆507Nov 25, 2022Updated 3 years ago
yandun72 / WBDC_2022_RANK15
View on GitHub
☆16Feb 28, 2023Updated 3 years ago