yakhyo / yolov1-resnetLinks

YOLOv1 re-implementation using PyTorch. Backbone is ResNet50.

☆5

Alternatives and similar repositories for yolov1-resnet

Users that are interested in yolov1-resnet are comparing it to the libraries listed below

Sorting:

OpenHelix-robot / awesome-dual-system-vla
A comprehensive list of papers about dual-system VLA models, including papers, codes, and related websites.
☆55Updated last month
RoyZry98 / MoLe-VLA-Pytorch
[Arxiv 2025: MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation]
☆40Updated 3 months ago
intuitive-robots / mdt_policy
[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…
☆146Updated 9 months ago
m2diffuser / M2Diffuser
Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"
☆63Updated last month
AoqunJin / Awesome-VLA-Post-Training
A collection of vision-language-action model post-training methods.
☆68Updated 2 weeks ago
OpenDriveLab / RoboDual
RoboDual: Dual-System for Robotic Manipulation
☆82Updated 2 weeks ago
liufanfanlff / RoboUniview
☆55Updated 5 months ago
yueyang130 / DeeR-VLA
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
☆99Updated 5 months ago
OpenHelix-robot / OpenHelix
OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation
☆220Updated last month
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆147Updated last month
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆129Updated 6 months ago
RoboDita / Dita
ICCV2025
☆105Updated last week
Robot-VLAs / RoboVLMs
☆375Updated 5 months ago
alibaba-damo-academy / WorldVLA
WorldVLA: Towards Autoregressive Action World Model
☆268Updated 2 weeks ago
OpenHelix-Team / LLaVA-VLA
LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]
☆93Updated this week
OpenDriveLab / MPI
[RSS 2024] Learning Manipulation by Predicting Interaction
☆112Updated 2 weeks ago
DelinQu / awesome-vision-language-action-model
Latest Advances on Vison-Language-Action Models.
☆84Updated 4 months ago
liruiw / HPT
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
☆503Updated 7 months ago
2toinf / UniAct
[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"
☆185Updated 3 months ago
BAAI-DCAI / SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
☆282Updated last month
microsoft / CogACT
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆303Updated last month
PRIME-RL / SimpleVLA-RL
Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory
☆290Updated last month
PKU-HMI-Lab / Hybrid-VLA
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆256Updated last month
TencentARC / Moto
[ICCV 2025] Latent Motion Token as the Bridging Language for Robot Manipulation
☆112Updated 2 months ago
JayceWen / tinyvla
☆64Updated 5 months ago
GuanxingLu / vlarl
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆168Updated 2 weeks ago
gen-robot / RL4VLA
☆78Updated last month
Zhangwenyao1 / DreamVLA
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆108Updated this week
OpenDriveLab / CLOVER
[NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
☆120Updated 2 weeks ago
ShuangLI59 / unified_video_action
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆238Updated 3 weeks ago