seervideodiffusion/SeerVideoLDM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/seervideodiffusion/SeerVideoLDM)

seervideodiffusion / SeerVideoLDM

[ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models

☆35

Alternatives and similar repositories for SeerVideoLDM

Users that are interested in SeerVideoLDM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChenHsing / AID
View on GitHub
☆19Jun 13, 2024Updated 2 years ago
hyunseoklee-ai / ReMoDetect
View on GitHub
ReMoDetect: Reward Models Recognize Aligned LLM's Generations (NeurIPS 2024)
☆17Nov 15, 2024Updated last year
AaronFengZY / HumanCentricToVLA-Survey
View on GitHub
From Human Videos to Robot Manipulation: A Survey on Scalable Vision-Language-Action Learning with Human-Centric Data
☆15Jun 2, 2026Updated last month
xiefan-guo / i4vgen
View on GitHub
[arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation
☆24Oct 6, 2024Updated last year
jihoontack / SiMT
View on GitHub
Meta-Learning with Self-Improving Momentum Target (NeurIPS 2022)
☆23Oct 12, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
a3d-11011 / Adaptive-Affordance-Assembly-with-Dual-Arm-Manipulation
View on GitHub
☆16Jan 21, 2026Updated 6 months ago
AlessioSam / LADiff
View on GitHub
The official PyTorch implementation of "The 18th European Conference on Computer Vision" (ECCV 2024) paper Length-Aware Motion Synthesis …
☆19Dec 15, 2024Updated last year
RayYoh / Hammer
View on GitHub
[CVPR 2026] Implementation of HAMMER: Harnessing MLLMs via Cross-Modal Integration for Intention-Driven 3D Affordance Grounding
☆20Apr 30, 2026Updated 2 months ago
DirtyHarryLYL / Sandwich
View on GitHub
Bidirectional Mapping between Action Physical-Semantic Space
☆34Sep 7, 2025Updated 10 months ago
snap-research / diffusability
View on GitHub
Source code for "Improving the Diffusability of Autoencoders" [ICML 2025]
☆21Jan 6, 2026Updated 6 months ago
cythu / PeBR-R1
View on GitHub
☆15Apr 20, 2026Updated 3 months ago
facebookresearch / modemv2
View on GitHub
MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…
☆25Apr 1, 2024Updated 2 years ago
MCG-NJU / Video-DC
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
video-language-planning / vlp_code
View on GitHub
☆82May 23, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
alpargun / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆12Jul 26, 2024Updated 2 years ago
AIGeeksGroup / UniVid
View on GitHub
UniVid: The Open-Source Unified Video Model
☆32Oct 13, 2025Updated 9 months ago
DAVEISHAN / TimeBalance
View on GitHub
Placeholder
☆10Jul 17, 2023Updated 3 years ago
facebookarchive / NACS
View on GitHub
Jump to better conclusions: SCAN both left and right
☆11Jan 24, 2019Updated 7 years ago
yijiangh / pybullet_planning_tutorials
View on GitHub
Tutorials for using pybullet_planning
☆32Jun 8, 2020Updated 6 years ago
flow-diffusion / AVDC
View on GitHub
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆262Apr 25, 2024Updated 2 years ago
world-action-verifier / wav_robot
View on GitHub
☆17Apr 16, 2026Updated 3 months ago
DAG-Diff / dual-arm-grasp-diffusion
View on GitHub
[ICRA 2026] Official codebase for DAGDiff: Guiding Dual-Arm Grasp Diffusion to Stable and Collision-Free Grasps
☆20Feb 1, 2026Updated 5 months ago
Stanford-TML / extrinsic_manipulation
View on GitHub
☆22Apr 8, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ymxlzgy / DA2
View on GitHub
[RA-L+IROS'22] Tools for DA2 dataset.
☆23Sep 21, 2022Updated 3 years ago
Dev-Mrha / DualPriorsCorrection
View on GitHub
☆14Oct 17, 2024Updated last year
aim-uofa / MovieDreamer
View on GitHub
[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
☆323Aug 10, 2024Updated last year
happyhappy-jun / writing-driven-autoresearch
View on GitHub
Multi-agent harness + complete run record of the 1st-place entry at Ralphthon@ICML2026 — three AI agents wrote a workshop paper in 3 hour…
☆17Jul 14, 2026Updated 2 weeks ago
Mereithhh / TomotoClock
View on GitHub
为墨水屏设备打造的番茄钟，开箱即用，具备记录并可视化展示历史学习时长、计划倒计时、设置时间长度、本地保存/读取等功能。
☆15Apr 18, 2021Updated 5 years ago
NVlabs / pointbridge
View on GitHub
The code is meant to accompany our paper Point Bridge, which focuses on sim-to-real transfer using 3D key point based representations.
☆20Mar 6, 2026Updated 4 months ago
anishhdiwan / near
View on GitHub
Imitation Learning from Observation Through Generative Modelling
☆27Feb 12, 2025Updated last year
bbuing9 / DND
View on GitHub
Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)
☆12Aug 28, 2023Updated 2 years ago
sihyun-yu / PVDM
View on GitHub
[CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space
☆322May 14, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
m-muaz / Finetune-SVD
View on GitHub
Fine tune stable video diffusion.
☆27Dec 29, 2023Updated 2 years ago
kvablack / susie
View on GitHub
Code for subgoal synthesis via image editing
☆159Oct 23, 2023Updated 2 years ago
nilesh2797 / ELIAS
View on GitHub
Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces
☆12Apr 19, 2023Updated 3 years ago
NVlabs / CMD
View on GitHub
[ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
☆54May 14, 2024Updated 2 years ago
cvlab-columbia / dreamitate
View on GitHub
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)
☆59Jun 7, 2025Updated last year
engelen / vonmiseskde
View on GitHub
Python Von Mises Kernel Density Estimator implementation
☆11Jun 15, 2017Updated 9 years ago
GaTech-RL2 / mimiclabs
View on GitHub
MimicLabs: A Scalable Data Collection & Generation Pipeline for Table-top Manipulation
☆44Mar 13, 2026Updated 4 months ago