lamm-mit / Cephalo-Phi-3-Vision-MoELinks
β12Updated last year
Alternatives and similar repositories for Cephalo-Phi-3-Vision-MoE
Users that are interested in Cephalo-Phi-3-Vision-MoE are comparing it to the libraries listed below
Sorting:
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.β96Updated 8 months ago
- DPO, but faster πβ44Updated 9 months ago
- Train, tune, and infer Bamba modelβ131Updated 3 months ago
- β27Updated last month
- β56Updated 9 months ago
- Data preparation code for CrystalCoder 7B LLMβ45Updated last year
- My fork os allen AI's OLMo for educational purposes.β30Updated 9 months ago
- PyTorch implementation of models from the Zamba2 series.β184Updated 7 months ago
- β20Updated last year
- Verifiers for LLM Reinforcement Learningβ71Updated 4 months ago
- MatFormer repoβ62Updated 8 months ago
- A repository for research on medium sized language models.β78Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β36Updated last month
- β77Updated 2 weeks ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language modelsβ86Updated 3 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024β61Updated 6 months ago
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagatiβ¦β97Updated last year
- Multi-Layer Key-Value sharing experiments on Pythia modelsβ34Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Ayaβ116Updated 3 weeks ago
- Linear Attention Sequence Parallelism (LASP)β86Updated last year
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hβ¦β83Updated last month
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.β34Updated last year
- vLLM adapter for a TGIS-compatible gRPC server.β38Updated this week
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillationβ51Updated last week
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"β110Updated 2 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"β36Updated last year
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"β98Updated 11 months ago
- Data preparation code for Amber 7B LLMβ91Updated last year
- A collection of reproducible inference engine benchmarksβ32Updated 4 months ago
- β51Updated last year