xverse-ai / XVERSE-V-13BLinks

☆79

Alternatives and similar repositories for XVERSE-V-13B

Users that are interested in XVERSE-V-13B are comparing it to the libraries listed below

Sorting:

WePOINTS / WePOINTS
☆186Updated 8 months ago
360CVGroup / SEEChat
Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM
☆101Updated last year
360CVGroup / 360VL
Our 2nd-gen LMM
☆34Updated last year
bytedance / Valley
Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.
☆252Updated 2 months ago
vaew / SkyScript-100M
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2
☆127Updated 11 months ago
MonolithFoundation / Bumblebee
A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.
☆38Updated last year
zai-org / GLM-Edge
GLM Series Edge Models
☆149Updated 4 months ago
pleisto / yuren-baichuan-7b
基于baichuan-7b的开源多模态大语言模型
☆72Updated last year
rednote-hilab / dots.vlm1
The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.
☆260Updated 3 weeks ago
shootime2021 / APUS-xDAN-4.0-moe
Its an open source LLM based on MOE Structure.
☆58Updated last year
cnzzx / VSA
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
☆126Updated 11 months ago
thu-ml / zh-clip
☆72Updated 2 years ago
will-singularity / Skywork-MM
Empirical Study Towards Building An Effective Multi-Modal Large Language Model
☆22Updated last year
gpt4video / GPT4Video
Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation
☆142Updated 11 months ago
TencentARC-QQ / QA-CLIP
Chinese CLIP models with SOTA performance.
☆58Updated 2 years ago
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆189Updated last year
RhapsodyAILab / MiniCPM-V-Embedding
☆29Updated last year
SUSTech-IDEA / SUS-Chat
SUS-Chat: Instruction tuning done right
☆49Updated last year
yuyq96 / TextHawk
Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
☆63Updated 11 months ago
shiyemin / light-hf-proxy
A light proxy solution for HuggingFace hub.
☆46Updated last year
StarRing2022 / R1-Nature
最简易的R1结果在小模型上的复现，阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证，对于强推理能力，think思考过程性内容是AGI/ASI的核心。
☆45Updated 8 months ago
AI-Study-Han / Zero-Qwen-VL
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
☆74Updated last year
xverse-ai / XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
☆140Updated last year
OpenBMB / MobileCPM
A Toolkit for Running On-device Large Language Models (LLMs) in APP
☆78Updated last year
VectorSpaceLab / MegaPairs
[ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval
☆228Updated 5 months ago
opendatalab / image-downloader
☆28Updated last year
xverse-ai / XVERSE-MoE-A4.2B
XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.
☆39Updated last year
westlake-baichuan-mllm / bc-omni
Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊
☆269Updated 8 months ago
modelscope / lite-sora
An initiative to replicate Sora
☆103Updated last year
sterzhang / image-textualization
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)
☆167Updated last year