ksOAn6g5/TaiSu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ksOAn6g5/TaiSu)

ksOAn6g5 / TaiSu

TaiSu（太素）--a large-scale Chinese multimodal dataset（亿级大规模中文视觉语言预训练数据集）

☆192

Alternatives and similar repositories for TaiSu

Users that are interested in TaiSu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuxie11 / R2D2
View on GitHub
☆170Nov 9, 2023Updated 2 years ago
kakaobrain / coyo-dataset
View on GitHub
COYO-700M: Large-scale Image-Text Pair Dataset
☆1,256Nov 30, 2022Updated 3 years ago
allenai / mmc4
View on GitHub
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
☆953Mar 19, 2025Updated last year
chuhaojin / BriVL-BUA-applications
View on GitHub
Bling's Object detection tool
☆55Jan 9, 2023Updated 3 years ago
billjie1 / Chinese-CLIP
View on GitHub
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
☆167Nov 3, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kai-wen-yang / IDAA
View on GitHub
[ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"
☆10Jul 24, 2022Updated 4 years ago
rom1504 / img2dataset
View on GitHub
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
☆4,436Oct 19, 2025Updated 9 months ago
baaivision / EVA
View on GitHub
EVA Series: Visual Representation Fantasies from BAAI
☆2,686Aug 1, 2024Updated last year
OpenGVLab / OmniCorpus
View on GitHub
[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
☆425May 5, 2025Updated last year
FuxiaoLiu / LRV-Instruction
View on GitHub
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
☆297Mar 13, 2024Updated 2 years ago
mlfoundations / datacomp
View on GitHub
DataComp: In search of the next generation of multimodal datasets
☆787Apr 28, 2025Updated last year
zhanxlin / Product1M
View on GitHub
Product1M
☆90Oct 12, 2022Updated 3 years ago
opendatalab / WanJuan1.0
View on GitHub
万卷1.0多模态语料
☆574Oct 20, 2023Updated 2 years ago
OPPO-Mente-Lab / GlyphDraw
View on GitHub
Text-To-Image Generation with Chinese Characters
☆133Jul 20, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
baaivision / CapsFusion
View on GitHub
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
☆215Feb 27, 2024Updated 2 years ago
OFA-Sys / Chinese-CLIP
View on GitHub
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
☆5,978Mar 31, 2026Updated 3 months ago
Zeqiang-Lai / Mini-DALLE3
View on GitHub
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
☆313Dec 28, 2023Updated 2 years ago
yangjianxin1 / OFA-Chinese
View on GitHub
transformers结构的中文OFA模型
☆138Feb 13, 2023Updated 3 years ago
LAION-AI / Big-Interleaved-Dataset
View on GitHub
Big-Interleaved-Dataset
☆59Jan 21, 2023Updated 3 years ago
benywon / ChiQA
View on GitHub
The implementations of various baselines in our CIKM 2022 paper: ChiQA: A Large Scale Image-based Real-World Question Answering Dataset f…
☆34May 13, 2024Updated 2 years ago
svdbase / SVD-transformer
View on GitHub
This repo is used for generating faking labeled positive videos for SVD dataset.
☆10Aug 16, 2020Updated 5 years ago
BAAI-DCAI / Visual-Instruction-Tuning
View on GitHub
SVIT: Scaling up Visual Instruction Tuning
☆167Jun 20, 2024Updated 2 years ago
salesforce / LAVIS
View on GitHub
LAVIS - A One-stop Library for Language-Vision Intelligence
☆11,253Jun 2, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
IDEA-Research / hana
View on GitHub
Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch
☆17Dec 22, 2022Updated 3 years ago
CMMMU-Benchmark / CMMMU
View on GitHub
☆48Sep 5, 2024Updated last year
CASIA-LMC-Lab / Obj2Seq
View on GitHub
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)
☆85Nov 2, 2022Updated 3 years ago
thu-ml / zh-clip
View on GitHub
☆73Jun 28, 2023Updated 3 years ago
FreddeFrallan / Multilingual-CLIP
View on GitHub
OpenAI CLIP text encoders for multiple languages!
☆833May 15, 2023Updated 3 years ago
X-PLUG / mPLUG-Owl
View on GitHub
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
☆2,536Apr 2, 2025Updated last year
facebookresearch / diht
View on GitHub
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
☆141Dec 16, 2025Updated 7 months ago
schelotto / Gaussian_Word_Embedding
View on GitHub
PyTorch implementation of Gaussian word embeddings
☆19Apr 7, 2018Updated 8 years ago
phellonchen / X-LLM
View on GitHub
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
☆318Jul 14, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nlpapereading / nlpapereading
View on GitHub
☆58Sep 23, 2022Updated 3 years ago
salesforce / ALBEF
View on GitHub
Code for ALBEF: a new vision-language pre-training method
☆1,755Sep 20, 2022Updated 3 years ago
google-research-datasets / wit
View on GitHub
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique imag…
☆1,113Sep 27, 2024Updated last year
OFA-Sys / OFA
View on GitHub
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…
☆2,557Apr 24, 2024Updated 2 years ago
uta-smile / TCL
View on GitHub
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022
☆271Oct 2, 2024Updated last year
hustvl / MIMDet
View on GitHub
[ICCV 2023] You Only Look at One Partial Sequence
☆343Oct 21, 2023Updated 2 years ago
bwconrad / flexivit
View on GitHub
PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes
☆68May 5, 2024Updated 2 years ago