qnguyen3 / nanoLLaVA
World's Smallest Vision-Language Model
☆24Updated 9 months ago
Alternatives and similar repositories for nanoLLaVA:
Users that are interested in nanoLLaVA are comparing it to the libraries listed below
- ☆65Updated 6 months ago
- ☆35Updated last year
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆46Updated 2 months ago
- XmodelLM☆37Updated last month
- ☆60Updated 3 months ago
- ☆50Updated last month
- ☆57Updated 6 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 10 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆79Updated 7 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆34Updated 8 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆17Updated 3 months ago
- Train, tune, and infer Bamba model☆75Updated this week
- ☆62Updated 3 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆44Updated last month
- OLA-VLM: Elevating Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆45Updated last month
- ☆18Updated 7 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆81Updated 3 weeks ago
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆38Updated 3 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆41Updated 4 months ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆29Updated 2 months ago
- From scratch implementation of a vision language model in pure PyTorch☆189Updated 8 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated last month
- Finetune any model on HF in less than 30 seconds☆56Updated 2 months ago
- Implementation of the Mamba SSM with hf_integration.☆56Updated 4 months ago
- a curated list of the role of small models in the LLM era☆89Updated 3 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 6 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆37Updated 4 months ago
- Data preparation code for CrystalCoder 7B LLM☆43Updated 8 months ago
- ☆30Updated 3 months ago