Alpha-VLLM / WeMix-LLMLinks

☆17

Alternatives and similar repositories for WeMix-LLM

Users that are interested in WeMix-LLM are comparing it to the libraries listed below

Sorting:

will-singularity / Skywork-MM
Empirical Study Towards Building An Effective Multi-Modal Large Language Model
☆22Updated 2 years ago
SparksJoe / Prism
A Framework for Decoupling and Assessing the Capabilities of VLMs
☆43Updated last year
FudanNLPLAB / MouSi
☆74Updated last year
OFA-Sys / TouchStone
Touchstone: Evaluating Vision-Language Models by Language Models
☆83Updated last year
patrick-tssn / Awesome-Colorful-LLM
Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…
☆123Updated 4 months ago
SihengLi99 / TextBind
[2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation
☆46Updated 2 years ago
RhapsodyAILab / MiniCPM-V-Embedding
☆29Updated last year
vaew / SkyScript-100M
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2
☆127Updated 11 months ago
GuoqingWang1 / WebFilter
☆28Updated 2 weeks ago
OpenLMLab / scaling-rope
code for Scaling Laws of RoPE-based Extrapolation
☆73Updated 2 years ago
Zheng0428 / COIG-Kun
☆36Updated last year
jdf-prog / LLM-Engines
☆50Updated 4 months ago
DAMO-NLP-SG / CLEX
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
☆78Updated last year
MBZUAI-LLM / web2code
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
☆91Updated last year
YuchuanTian / RethinkTinyLM
[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”
☆123Updated 9 months ago
mlfoundations / VisIT-Bench
☆50Updated last year
TIGER-AI-Lab / VisualWebInstruct
The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]
☆34Updated last month
kq-chen / qwen-vl-utils
helper functions for processing and integrating visual language information with Qwen-VL Series Model
☆15Updated last year
KwaiKEG / CogGPT
Unleashing the Power of Cognitive Dynamics on Large Language Models
☆63Updated last year
apple / ml-mia-bench
This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
☆31Updated 7 months ago
Victorwz / MLM_Filter
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
☆67Updated 6 months ago
HZQ950419 / Math-LLaVA
Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
☆91Updated last year
GAIR-NLP / ReAlign
Reformatted Alignment
☆112Updated last year
TemporaryLoRA / Temp-LoRA
☆116Updated last year
MiroMindAI / MiroTrain
MiroTrain is an efficient and algorithm-first framework for post-training large agentic models.
☆88Updated last month
hewei2001 / ReachQA
[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs
☆56Updated 2 months ago
gpt4video / GPT4Video
Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation
☆142Updated 11 months ago
OpenGVLab / V2PE
[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
☆57Updated 10 months ago
cofe-ai / FLM-101B
☆12Updated last year
TIGER-AI-Lab / MEGA-Bench
This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]
☆77Updated 3 months ago