Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
☆1,663Sep 3, 2025Updated 6 months ago
Alternatives and similar repositories for Video-Pre-Training
Users that are interested in Video-Pre-Training are comparing it to the libraries listed below
Sorting:
- MineRL Competition for Sample Efficient Reinforcement Learning - Python Package☆926Jan 22, 2025Updated last year
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆2,166Mar 18, 2024Updated 2 years ago
- Foundation Model for MineDojo☆297Apr 2, 2023Updated 2 years ago
- Simple behavioural cloning baseline solution for BASALT 2022☆33Nov 3, 2022Updated 3 years ago
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆204Jun 4, 2024Updated last year
- An Open-Ended Embodied Agent with Large Language Models☆6,744Apr 3, 2024Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆871Oct 14, 2024Updated last year
- Mastering Diverse Domains through World Models☆2,928Sep 23, 2025Updated 5 months ago
- BASALT Benchmark datasets, evaluation code and agent training example.☆22Nov 29, 2023Updated 2 years ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 2 years ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆67Dec 18, 2023Updated 2 years ago
- Benchmarking the Spectrum of Agent Capabilities☆528Jan 23, 2024Updated 2 years ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆293Aug 3, 2023Updated 2 years ago
- Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.☆2,777Apr 29, 2024Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆165Jun 23, 2023Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆926Dec 20, 2023Updated 2 years ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆104Jun 16, 2025Updated 9 months ago
- An open-source framework for training large multimodal models.☆4,076Aug 31, 2024Updated last year
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆867Aug 12, 2024Updated last year
- ☆1,592Jun 28, 2022Updated 3 years ago
- Code for the paper "Batch size invariance for policy optimization"☆59Apr 2, 2023Updated 2 years ago
- Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning☆1,771Jan 20, 2026Updated 2 months ago
- JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models☆390Apr 8, 2024Updated last year
- General Modules for JAX☆72Feb 21, 2026Updated last month
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆414Jan 7, 2026Updated 2 months ago
- Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Ch…☆51Nov 21, 2025Updated 4 months ago
- IEEE CoG & NeurIPS workshop paper 'Counter-Strike Deathmatch with Large-Scale Behavioural Cloning'☆483Oct 29, 2024Updated last year
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- A collection of baselines for the MineRL environment/datasets & the NeurIPS 2021 MineRL competitions☆149Jul 21, 2021Updated 4 years ago
- Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memo…☆638Jun 5, 2023Updated 2 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,189Nov 18, 2024Updated last year
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,381Jul 25, 2023Updated 2 years ago
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,193Nov 9, 2025Updated 4 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆37May 19, 2023Updated 2 years ago
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,689Mar 8, 2024Updated 2 years ago
- A collection of reference environments for offline reinforcement learning☆1,662Nov 18, 2024Updated last year
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,279Aug 12, 2024Updated last year
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆142Feb 12, 2024Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Aug 22, 2024Updated last year