km1994 / AwesomeMultiModelLinks
【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语音合成(TTS),人像分割(SA),多模态(VLM),Ai 换脸(Face Swapping), 文生视频(VD),图生视频(SVD),Ai 动作迁移,Ai 虚拟试衣,数字人,全模态理解(Omni),Ai音乐生成 干货学习 等 实战与经验。
☆14Updated 2 months ago
Alternatives and similar repositories for AwesomeMultiModel
Users that are interested in AwesomeMultiModel are comparing it to the libraries listed below
Sorting:
- 中文原生文生图测评基准☆9Updated 11 months ago
- Chinese CLIP models with SOTA performance.☆55Updated last year
- 中文原生多层次文生视频测评基准☆17Updated 11 months ago
- ☆10Updated 7 months ago
- Our 2nd-gen LMM☆33Updated last year
- Taiyi-Diffusion-XL训练代码☆22Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- CLIP中文encoder☆22Updated 3 years ago
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 9 months ago
- ☆16Updated 2 years ago
- ☆15Updated 8 months ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆23Updated last year
- ☆28Updated last year
- [CVPR Challenge Rank 2nd] The codes and related files to reproduce the results for Video Similarity Challenge Descriptor Track.☆19Updated 2 months ago
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆13Updated last year
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- ☆29Updated 10 months ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆26Updated last year
- Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion☆49Updated last year
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆22Updated last year
- ☆69Updated 3 weeks ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆15Updated 2 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆10Updated 2 years ago
- ☆15Updated 5 months ago
- ☆68Updated last year
- 可以成功Lora微调的Qwen-VL模型☆18Updated last year
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated last year
- image retrieval systems based on CNN feature distance and triplet loss☆31Updated 3 years ago