Alpha-VLLM / WeMix-LLMLinks
☆17Updated last year
Alternatives and similar repositories for WeMix-LLM
Users that are interested in WeMix-LLM are comparing it to the libraries listed below
Sorting:
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆22Updated last year
- ☆29Updated 10 months ago
- ☆36Updated 9 months ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Updated last year
- ☆73Updated last year
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆44Updated 11 months ago
- ☆50Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆34Updated 5 months ago
- Converting Mixtral-8x7B to Mixtral-[1~7]x7B☆22Updated last year
- Our 2nd-gen LMM☆33Updated last year
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated last year
- ☆48Updated 2 weeks ago
- LMM solved catastrophic forgetting, AAAI2025☆43Updated 2 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆104Updated 3 weeks ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- ☆48Updated last year
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆50Updated 6 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆78Updated last year
- ☆37Updated 2 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆62Updated 7 months ago
- Official repo for StableLLAVA☆95Updated last year
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆35Updated 11 months ago
- Official repository of MMDU dataset☆92Updated 8 months ago
- Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…☆122Updated 3 weeks ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- ☆12Updated last year
- ☆49Updated 2 months ago
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]☆68Updated 2 months ago
- ☆64Updated last year
- MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer☆228Updated last year