BAAI-WuDao / CogViewLinks
Text-to-Image generation
☆35Updated 4 years ago
Alternatives and similar repositories for CogView
Users that are interested in CogView are comparing it to the libraries listed below
Sorting:
- “悟道”模型☆131Updated 4 years ago
- Finetune CPM-1☆24Updated 4 years ago
- ☆34Updated 4 years ago
- CLIP中文encoder☆22Updated 3 years ago
- ☆32Updated 3 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Updated 3 years ago
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆113Updated 2 years ago
- WuDaoMM this is a data project☆74Updated 3 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆169Updated 3 years ago
- GLM (General Language Model)☆24Updated 3 years ago
- Code for CPM-2 Pre-Train☆158Updated 2 years ago
- 基于baichuan-7b的开源多模态大语言模型☆72Updated 2 years ago
- A unified tokenization tool for Images, Chinese and English.☆153Updated 2 years ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- Finetune CPM-1☆75Updated 2 years ago
- A fine tune version of Stable Diffusion model on self-translate 10k diffusiondb Chinese Corpus and "extend" it☆32Updated 2 years ago
- Bridging Vision and Language Model☆285Updated 2 years ago
- the world's first large-scale multi-modal short-video encyclopedia, where the primitive units are items, aspects, and short videos.☆64Updated 2 years ago
- ☆65Updated 2 years ago
- Chinese CLIP models with SOTA performance.☆59Updated 2 years ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆189Updated 2 years ago
- Gaokao Benchmark for AI☆109Updated 3 years ago
- ☆60Updated 3 years ago
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 3 years ago
- Bling's Object detection tool☆56Updated 2 years ago
- [CVPR 2022] Aesthetic Text Logo Synthesis via Content-aware Layout Inferring☆274Updated 3 years ago
- Introduction to CPM☆17Updated 4 years ago
- VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)☆194Updated 2 years ago
- ☆168Updated 2 years ago