thu-ml/vidar

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thu-ml/vidar)

thu-ml / vidar

Official repo for vidar and vidarc: video foundation model for robotics.

☆42

Alternatives and similar repositories for vidar

Users that are interested in vidar are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yaofeng1998 / Vidar
View on GitHub
☆31Dec 26, 2025Updated 6 months ago
thkkk / manibox
View on GitHub
ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
☆49Apr 14, 2025Updated last year
EmbodiedFoundation / AnyPos
View on GitHub
AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation
☆38Jul 25, 2025Updated 11 months ago
thu-ml / embodied-data-toolkit
View on GitHub
A toolkit for processing raw embodied data into standardized formats and converting between embodied dataset schemas.
☆20Mar 16, 2026Updated 4 months ago
thu-ml / Efficient-Diffusion-Alignment
View on GitHub
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Oct 29, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
thu-ml / CEURL
View on GitHub
Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)
☆19Oct 13, 2024Updated last year
thu-ml / Motus
View on GitHub
Official code of Motus: A Unified Latent Action World Model
☆1,212Jan 5, 2026Updated 6 months ago
HongzheBi / H_RDT
View on GitHub
H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation
☆154Dec 21, 2025Updated 7 months ago
ChenDRAG / CEP-energy-guided-diffusion
View on GitHub
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction （ICML 2023）
☆55Aug 26, 2023Updated 2 years ago
Robert-gyj / Ctrl-World
View on GitHub
ICLR 2026 Paper: Ctrl-World
☆539Apr 8, 2026Updated 3 months ago
chandar-lab / semantic-wm
View on GitHub
repository for training action-conditioned latent diffusion world models for robot video generation
☆73May 29, 2026Updated last month
hjy-u / ETOG
View on GitHub
[ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
☆13Feb 7, 2025Updated last year
maxreciprocate / offline
View on GitHub
Offline RL experiments
☆15Oct 1, 2022Updated 3 years ago
jiayueru / Video2Act
View on GitHub
Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling
☆31Jun 24, 2026Updated 3 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
return-sleep / Diffusion_based_imaginative_Coordination
View on GitHub
☆18Jul 21, 2025Updated last year
nvidia-cosmos / cosmos-predict2
View on GitHub
Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…
☆793Oct 29, 2025Updated 8 months ago
NVlabs / cosmos-policy
View on GitHub
Cosmos Policy
☆835Jan 23, 2026Updated 6 months ago
UMass-Embodied-AGI / TesserAct
View on GitHub
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
☆402Aug 4, 2025Updated 11 months ago
NVIDIA / GR00T-Dreams
View on GitHub
DreamGen: Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
☆591Oct 24, 2025Updated 8 months ago
wangst0181 / pi-StepNFT
View on GitHub
☆58Mar 8, 2026Updated 4 months ago
AgibotTech / Genie-Envisioner-V1
View on GitHub
☆564Jun 24, 2026Updated 3 weeks ago
nakamotoo / dsrl_pi0
View on GitHub
Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
☆280Apr 27, 2026Updated 2 months ago
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thkkk / FCNet
View on GitHub
Fourier Controller Networks (FCNet) for Real-Time Decision-Making in Embodied Learning, ICML 2024
☆32Jan 2, 2025Updated last year
InternRobotics / VLAC
View on GitHub
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
☆319Jul 13, 2026Updated last week
ChengshuLi / MoMaGen
View on GitHub
☆67Oct 25, 2025Updated 8 months ago
thu-ml / Adaptive-Sparse-Trainer
View on GitHub
Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)
☆19Jul 1, 2025Updated last year
sii-research / tau-0-wm
View on GitHub
☆266Jul 2, 2026Updated 3 weeks ago
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,652Jul 9, 2026Updated 2 weeks ago
microsoft / VITRA
View on GitHub
[ICRA 2026] VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
☆448Jun 12, 2026Updated last month
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,477Apr 19, 2026Updated 3 months ago
Fsoft-AIC / Lightweight-Language-driven-Grasp-Detection
View on GitHub
[IROS 2024] Lightweight Language-driven Grasp Detection using Conditional Consisitency Model
☆31Aug 14, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
buoyancy99 / large-video-planner
View on GitHub
☆255Jan 31, 2026Updated 5 months ago
tsinghua-fib-lab / WorldArena
View on GitHub
WorldArena: A Unified Benchmark for Evaluating Perception and Functional Utility of Embodied World Models
☆245Updated this week
thu-ml / ReMoE
View on GitHub
[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.
☆118Dec 20, 2024Updated last year
SII-dannyXSC / Human2Robot
View on GitHub
AAAI 2026 Oral
☆18Dec 23, 2025Updated 7 months ago
cvlab-columbia / videopolicy
View on GitHub
☆64Mar 3, 2026Updated 4 months ago
xdofai / opensarm
View on GitHub
☆96Jun 23, 2026Updated last month
HuiZhang0812 / WeEdit
View on GitHub
A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing
☆20Mar 13, 2026Updated 4 months ago