[ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models
☆35May 23, 2024Updated last year
Alternatives and similar repositories for SeerVideoLDM
Users that are interested in SeerVideoLDM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Jun 13, 2024Updated last year
- Codebase for HiP☆90Dec 15, 2023Updated 2 years ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Oct 6, 2024Updated last year
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 5 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆175Mar 6, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Meta-Learning with Self-Improving Momentum Target (NeurIPS 2022)☆23Oct 12, 2022Updated 3 years ago
- [CVPR2025] "AniMo: Species-Aware Model for Text-Driven Animal Motion Generation"☆45Oct 8, 2025Updated 5 months ago
- The official PyTorch implementation of "The 18th European Conference on Computer Vision" (ECCV 2024) paper Length-Aware Motion Synthesis …☆20Dec 15, 2024Updated last year
- ☆33Feb 9, 2026Updated last month
- Bidirectional Mapping between Action Physical-Semantic Space☆34Sep 7, 2025Updated 6 months ago
- Implementation of Model Tensor Planning in JAX, TMLR 2025 & ICLR 2026.☆26Jun 5, 2025Updated 9 months ago
- ☆30Apr 7, 2024Updated last year
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago
- ☆11Jul 30, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Codebase for PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem☆24Jul 11, 2024Updated last year
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- ☆78May 23, 2025Updated 10 months ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆12Jul 26, 2024Updated last year
- Placeholder☆10Jul 17, 2023Updated 2 years ago
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- Jump to better conclusions: SCAN both left and right☆11Jan 24, 2019Updated 7 years ago
- 为墨水屏设备打造的番茄钟,开箱即用,具备记录并可视化展示历史学习时长、计划倒计时、设置时间长度、本地保存/读取等功能。☆15Apr 18, 2021Updated 4 years ago
- ☆21Apr 8, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…☆23Apr 1, 2024Updated last year
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆251Apr 25, 2024Updated last year
- ☆13Oct 17, 2024Updated last year
- ☆267Nov 8, 2025Updated 4 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆322Aug 10, 2024Updated last year
- Improvement for Modular Camera based Tactile Sensor, with integrated circuit, optimized illumination, and biomimetic markers.☆16Feb 14, 2024Updated 2 years ago
- ☆58Apr 18, 2025Updated 11 months ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Aug 28, 2023Updated 2 years ago
- Fine tune stable video diffusion.☆27Dec 29, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Imitation Learning from Observation Through Generative Modelling☆25Feb 12, 2025Updated last year
- Code for subgoal synthesis via image editing☆148Oct 23, 2023Updated 2 years ago
- Python Von Mises Kernel Density Estimator implementation☆11Jun 15, 2017Updated 8 years ago
- ☆20Mar 16, 2026Updated last week
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆58Jun 7, 2025Updated 9 months ago
- 基于qwenvl微调一个多模态Xray识别的大模型☆21Oct 22, 2024Updated last year
- Responsible Visual Editing☆15Jul 10, 2024Updated last year