lamm-mit / Cephalo-Phi-3-Vision-MoELinks
☆12Updated last year
Alternatives and similar repositories for Cephalo-Phi-3-Vision-MoE
Users that are interested in Cephalo-Phi-3-Vision-MoE are comparing it to the libraries listed below
Sorting:
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆94Updated 7 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆83Updated 2 months ago
- Train, tune, and infer Bamba model☆131Updated 2 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 5 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated 3 weeks ago
- DPO, but faster 🚀☆44Updated 8 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆83Updated last week
- ☆78Updated 9 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆110Updated last month
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆97Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 3 weeks ago
- ☆17Updated last year
- ☆75Updated 3 months ago
- a family of highly capabale yet efficient large multimodal models☆187Updated 11 months ago
- ☆56Updated 8 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Verifiers for LLM Reinforcement Learning☆69Updated 3 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated last year
- Implementation of the premier Text to Video model from OpenAI☆56Updated 9 months ago
- ☆67Updated last year
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆51Updated last year
- ☆51Updated last year
- ☆27Updated last month
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated 8 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆98Updated 10 months ago
- PyTorch implementation of models from the Zamba2 series.☆184Updated 6 months ago
- My fork os allen AI's OLMo for educational purposes.☆30Updated 8 months ago