lucasjinreal/Namo-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucasjinreal/Namo-R1)

lucasjinreal / Namo-R1

A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.

☆256

Alternatives and similar repositories for Namo-R1

Users that are interested in Namo-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucasjinreal / Namors
View on GitHub
Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.
☆24Mar 12, 2025Updated last year
lucasjinreal / LLaVA-Magvit2
View on GitHub
LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.
☆38Jun 20, 2024Updated 2 years ago
lucasjinreal / ImageTokenizer
View on GitHub
imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…
☆40Jun 22, 2024Updated 2 years ago
x-cls / superclass
View on GitHub
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training
☆223Mar 20, 2025Updated last year
MonolithFoundation / Bumblebee
View on GitHub
A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.
☆38Sep 9, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
xuliu-cyber / RSUniVLM
View on GitHub
☆47Apr 16, 2026Updated 3 months ago
antgroup / OmniBench
View on GitHub
[ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…
☆22Jun 12, 2025Updated last year
rvp-group / srrg2-hipe
View on GitHub
Hierarchical Initialization for Pose Graphs
☆31Nov 18, 2021Updated 4 years ago
yujunhuics / Reyes
View on GitHub
2025.01：从零到一实现了一个多模态大模型，并命名为Reyes（睿视），R：睿，eyes：眼。Reyes的参数量为8B，视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct，Reyes也通过一个两…
☆34Feb 10, 2026Updated 5 months ago
ouyanghaodong / DEYOv1.5
View on GitHub
DEYOv1.5
☆29Jul 22, 2024Updated 2 years ago
mbzuai-oryx / LlamaV-o1
View on GitHub
[ACL 2025 🔥] Rethinking Step-by-step Visual Reasoning in LLMs
☆307May 21, 2025Updated last year
symao / viola
View on GitHub
VIOLA(Vision-Imu-Odometry LibrAry) is a versatile C++ library for vision/robotics system. We try to build it as a common basic library fo…
☆85Jul 29, 2023Updated 3 years ago
om-ai-lab / VLM-R1
View on GitHub
Solve Visual Understanding with Reinforced VLMs
☆6,018Jul 7, 2026Updated 3 weeks ago
xushilin1 / RMP-SAM
View on GitHub
[ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything
☆271Apr 11, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ATH-MaaS / Ovis
View on GitHub
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
☆1,476Jul 15, 2026Updated 2 weeks ago
Meituan-AutoML / MobileVLM
View on GitHub
Strong and Open Vision Language Assistant for Mobile Devices
☆1,366Apr 15, 2024Updated 2 years ago
snakehaihai / MCVIO_CPU_only
View on GitHub
☆15Dec 31, 2024Updated last year
boschresearch / RelationField
View on GitHub
[CVPR 2025] RelationField: Relate Anything in Radiance Fields
☆88Mar 20, 2025Updated last year
NIneeeeeem / LangDC
View on GitHub
[EMNLP 2025 Oral] Official codebase for Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors.
☆18Sep 7, 2025Updated 10 months ago
EvolvingLMMs-Lab / open-r1-multimodal
View on GitHub
A fork to add multimodal model training to open-r1
☆1,594Feb 8, 2025Updated last year
MetabrainAGI / Awaker2.5-VL
View on GitHub
☆35Jan 21, 2025Updated last year
zai-org / GLM-Edge
View on GitHub
GLM Series Edge Models
☆163Jun 12, 2025Updated last year
lucasjinreal / MLLM_Factory
View on GitHub
A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …
☆19Apr 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FreedomIntelligence / FastLLM
View on GitHub
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆41Jan 4, 2024Updated 2 years ago
Beckschen / LLaVolta
View on GitHub
[NeurIPS 2024] Efficient Large Multi-modal Models via Visual Context Compression
☆66Feb 19, 2025Updated last year
LimHyungTae / graph_slam_tutorial
View on GitHub
Graph SLAM의 모든것 (Korean)
☆13Jan 22, 2020Updated 6 years ago
RifleZhang / LLaVA-Reasoner-DPO
View on GitHub
☆116Jan 8, 2025Updated last year
BAAI-DCAI / Bunny
View on GitHub
A family of lightweight multimodal models.
☆1,053Nov 18, 2024Updated last year
StarsfieldAI / R1-V
View on GitHub
Witness the aha moment of VLM with less than $3.
☆4,064May 19, 2025Updated last year
mucahitayhan / map_visualizer
View on GitHub
ROS2 package to visualize .pcd file and .osm file (openstreetmap) and to convert .osm file to occupancy map
☆16Aug 3, 2023Updated 2 years ago
Academic-Hammer / HammerLLM
View on GitHub
1.4B sLLM for Chinese and English - HammerLLM🔨
☆44Apr 7, 2024Updated 2 years ago
2404589803 / hf_downloader
View on GitHub
🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…
☆13Jan 5, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ShuyiZhou495 / INF
View on GitHub
INF: Implicit Neural Fusion for LiDAR and Camera　(IROS2023)
☆47Aug 29, 2023Updated 2 years ago
Arhosseini77 / ADDNN_2023
View on GitHub
Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran
☆11Feb 18, 2024Updated 2 years ago
kanghuazhao / slim
View on GitHub
Semantic base LiDAR-inertial Mapping
☆19Mar 2, 2023Updated 3 years ago
Victorwz / Open-Qwen2VL
View on GitHub
[COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources
☆314Aug 25, 2025Updated 11 months ago
turningpoint-ai / VisualThinker-R1-Zero
View on GitHub
Explore the Multimodal “Aha Moment” on 2B Model
☆624Mar 18, 2025Updated last year
andimarafioti / florence2-finetuning
View on GitHub
Quick exploration into fine tuning florence 2
☆340Sep 19, 2024Updated last year
idsia-robotics / Collaborative-Monte-Carlo-Localization
View on GitHub
Collaborative MCL
☆19Sep 3, 2025Updated 10 months ago