lamm-mit / Cephalo-Phi-3-Vision-MoE
☆11Updated 10 months ago
Alternatives and similar repositories for Cephalo-Phi-3-Vision-MoE:
Users that are interested in Cephalo-Phi-3-Vision-MoE are comparing it to the libraries listed below
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆91Updated 4 months ago
- Train, tune, and infer Bamba model☆88Updated this week
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated 11 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- DPO, but faster 🚀☆41Updated 4 months ago
- A repository for research on medium sized language models.☆76Updated 11 months ago
- ☆20Updated 10 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆152Updated 2 weeks ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆73Updated last month
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆44Updated 7 months ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆44Updated last month
- A collection of reproducible inference engine benchmarks☆24Updated this week
- QuIP quantization☆51Updated last year
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated 10 months ago
- [WIP] Better (FP8) attention for Hopper☆30Updated 2 months ago
- ☆63Updated 7 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 5 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated last month
- ☆48Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆48Updated 2 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆54Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- ☆56Updated last week
- Set of scripts to finetune LLMs☆37Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆27Updated 2 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 8 months ago
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆96Updated 10 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year