The simplest, fastest repository for training/finetuning small-sized VLMs.
☆4,843Oct 27, 2025Updated 6 months ago
Alternatives and similar repositories for nanoVLM
Users that are interested in nanoVLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,171Aug 26, 2025Updated 8 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,843Apr 18, 2025Updated last year
- Everything about the SmolLM and SmolVLM family of models☆3,755Apr 2, 2026Updated last month
- A PyTorch native platform for training generative AI models☆5,286Updated this week
- Fully open reproduction of DeepSeek-R1☆26,013Apr 2, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NanoGPT (124M) in 90 seconds☆5,157Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆13,079Feb 27, 2026Updated 2 months ago
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆2,260Apr 13, 2026Updated 3 weeks ago
- Minimalistic large language model 3D-parallelism training☆2,674Apr 7, 2026Updated 3 weeks ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,568Jan 12, 2025Updated last year
- Train transformer language models with reinforcement learning.☆18,193Updated this week
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆7,327May 5, 2025Updated 11 months ago
- Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆63,070Apr 27, 2026Updated last week
- Efficient Triton Kernels for LLM Training☆6,315Apr 27, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Nano vLLM