360CVGroup/SEEChat

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/360CVGroup/SEEChat)

360CVGroup / SEEChat

Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM

☆101

Alternatives and similar repositories for SEEChat

Users that are interested in SEEChat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

360CVGroup / Bridge_Diffusion_Model
View on GitHub
Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025
☆13Jun 25, 2024Updated 2 years ago
360CVGroup / Inner-Adaptor-Architecture
View on GitHub
LMM solved catastrophic forgetting, AAAI2025
☆45Apr 15, 2025Updated last year
pleisto / yuren-baichuan-7b
View on GitHub
基于baichuan-7b的开源多模态大语言模型
☆72Dec 7, 2023Updated 2 years ago
Pillars-Creation / Visualglm-image-to-text
View on GitHub
补充了一些Visualglm缺少的文件，可以对Visualglm进行训练，实例中是对人脸做了面相的识别
☆13Jun 7, 2023Updated 3 years ago
ChenDelong1999 / polite-flamingo
View on GitHub
🦩 Official repository of paper "Visual Instruction Tuning with Polite Flamingo" (AAAI-24 Oral)
☆65Dec 9, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
360CVGroup / 360VL
View on GitHub
Our 2nd-gen LMM
☆34May 22, 2024Updated 2 years ago
anonymous0x233 / ReuseAndDiffuse
View on GitHub
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
☆38Nov 21, 2023Updated 2 years ago
X-PLUG / Youku-mPLUG
View on GitHub
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
☆307Jan 8, 2024Updated 2 years ago
will-singularity / Skywork-MM
View on GitHub
Empirical Study Towards Building An Effective Multi-Modal Large Language Model
☆22Oct 25, 2023Updated 2 years ago
OpenBMB / VisCPM
View on GitHub
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
☆1,063Jun 13, 2024Updated 2 years ago
zai-org / VisualGLM-6B
View on GitHub
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
☆4,158Aug 23, 2024Updated last year
phellonchen / X-LLM
View on GitHub
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
☆318Jul 14, 2026Updated last week
BayLing-Models / BayLing
View on GitHub
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型，具有优越的英语/中文能力，在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced l…
☆315Dec 3, 2024Updated last year
huizhang0110 / catvision
View on GitHub
A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…
☆14Feb 5, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
airaria / Visual-Chinese-LLaMA-Alpaca
View on GitHub
多模态中文LLaMA&Alpaca大语言模型（VisualCLA）
☆461Jul 27, 2023Updated 2 years ago
billjie1 / Chinese-CLIP
View on GitHub
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
☆167Nov 3, 2022Updated 3 years ago
LinkSoul-AI / Chinese-LLaVA
View on GitHub
支持中英文双语视觉-文本对话的开源可商用多模态模型。
☆378Sep 23, 2023Updated 2 years ago
buptlihang / CVLM
View on GitHub
☆23Jan 8, 2024Updated 2 years ago
scenarios / WeMM
View on GitHub
☆90Jul 4, 2024Updated 2 years ago
PCIResearch / TransCore-M
View on GitHub
Large Multimodal Model
☆15Apr 8, 2024Updated 2 years ago
kyegomez / PALI3
View on GitHub
Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"
☆147Jun 22, 2026Updated 3 weeks ago
PaddlePaddle / PaddleMIX
View on GitHub
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pret…
☆724Mar 6, 2026Updated 4 months ago
SALT-NLP / LLaVAR
View on GitHub
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"
☆268Jun 12, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
iflytek / VLE
View on GitHub
VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)
☆197Mar 13, 2023Updated 3 years ago
Alibaba-NLP / CoFE-RAG
View on GitHub
☆44Apr 11, 2025Updated last year
ArrowLuo / CLIP4Clip
View on GitHub
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
☆1,028Apr 12, 2024Updated 2 years ago
qinggangwu / yolov5-Heading
View on GitHub
基于yoloV5进行多类别+关键检测，关键点检测主要是计算车辆航向角
☆18Jun 1, 2022Updated 4 years ago
mightyzau / InfMLLM
View on GitHub
☆19Dec 6, 2023Updated 2 years ago
mlfoundations / open_flamingo
View on GitHub
An open-source framework for training large multimodal models.
☆4,114Aug 31, 2024Updated last year
OpenGVLab / Ask-Anything
View on GitHub
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
☆3,343Updated this week
OurBluePrint / easy_video
View on GitHub
☆20Mar 3, 2025Updated last year
yxuansu / PandaGPT
View on GitHub
[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All
☆862Jun 1, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
liangheming / yolo_seriesv1
View on GitHub
YOLOv4/v5, clean code, good performance,easy to understand.
☆15Sep 29, 2020Updated 5 years ago
OpenBuddy / OpenBuddy
View on GitHub
Open Multilingual Chatbot for Everyone
☆1,272Jun 8, 2025Updated last year
X-PLUG / mPLUG-Owl
View on GitHub
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
☆2,535Apr 2, 2025Updated last year
JustinXu0 / AnimateZoo
View on GitHub
☆22Mar 27, 2026Updated 3 months ago
coderonion / awesome-open-world-object-detection
View on GitHub
This repository lists some awesome public Open World object detection series projects.
☆31Feb 22, 2024Updated 2 years ago
UCSB-AI / MiniGPT-5
View on GitHub
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
☆867May 8, 2025Updated last year
FreedomIntelligence / ALLaVA
View on GitHub
Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model
☆281Jun 25, 2024Updated 2 years ago