qnguyen3 / nanoLLaVALinks
World's Smallest Vision-Language Model
☆27Updated last year
Alternatives and similar repositories for nanoLLaVA
Users that are interested in nanoLLaVA are comparing it to the libraries listed below
Sorting:
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆93Updated 5 months ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆58Updated 7 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆95Updated 5 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated last year
- ☆68Updated 11 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆59Updated 3 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆70Updated 8 months ago
- ☆61Updated 10 months ago
- ☆36Updated 2 years ago
- ☆92Updated 2 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆69Updated 2 weeks ago
- ☆20Updated last year
- minimal GRPO implementation from scratch☆90Updated 2 months ago
- My fork os allen AI's OLMo for educational purposes.☆30Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated 11 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆85Updated last month
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆54Updated last year
- ☆55Updated 6 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆44Updated 9 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated 9 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- ☆47Updated 9 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆88Updated 2 weeks ago
- From scratch implementation of a vision language model in pure PyTorch☆220Updated last year
- ☆63Updated 8 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 3 months ago
- ☆24Updated 8 months ago
- ☆101Updated 9 months ago