Alpha-VLLM / WeMix-LLM
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for WeMix-LLM
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆38Updated 4 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆56Updated this week
- Touchstone: Evaluating Vision-Language Models by Language Models☆78Updated 10 months ago
- ☆35Updated 2 months ago
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆23Updated last year
- Converting Mixtral-8x7B to Mixtral-[1~7]x7B☆20Updated 8 months ago
- ☆74Updated 8 months ago
- ☆45Updated last year
- ☆35Updated 5 months ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆99Updated this week
- ☆22Updated 3 months ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆55Updated last month
- ☆12Updated 10 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆36Updated 2 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆26Updated 4 months ago
- ☆30Updated 6 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- Video dataset dedicated to portrait-mode video recognition.☆38Updated 7 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆33Updated this week
- Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…☆113Updated last month
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆43Updated 5 months ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆48Updated last year
- ☆46Updated 2 months ago
- Official repo for StableLLAVA☆91Updated 11 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆73Updated 8 months ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆99Updated 8 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆67Updated 4 months ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆66Updated 5 months ago
- GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes fr…☆69Updated last week
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆46Updated 9 months ago