Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
☆1,644Sep 3, 2025Updated 5 months ago
Alternatives and similar repositories for Video-Pre-Training
Users that are interested in Video-Pre-Training are comparing it to the libraries listed below
Sorting:
- MineRL Competition for Sample Efficient Reinforcement Learning - Python Package☆920Jan 22, 2025Updated last year
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆2,152Mar 18, 2024Updated last year
- Foundation Model for MineDojo☆293Apr 2, 2023Updated 2 years ago
- Simple behavioural cloning baseline solution for BASALT 2022☆32Nov 3, 2022Updated 3 years ago
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆204Jun 4, 2024Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆869Oct 14, 2024Updated last year
- An Open-Ended Embodied Agent with Large Language Models☆6,688Apr 3, 2024Updated last year
- Mastering Diverse Domains through World Models☆2,830Sep 23, 2025Updated 5 months ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆67Dec 18, 2023Updated 2 years ago
- Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.☆2,770Apr 29, 2024Updated last year
- Benchmarking the Spectrum of Agent Capabilities☆522Jan 23, 2024Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆925Dec 20, 2023Updated 2 years ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 2 years ago
- An open-source framework for training large multimodal models.☆4,068Aug 31, 2024Updated last year
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆290Aug 3, 2023Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆164Jun 23, 2023Updated 2 years ago
- BASALT Benchmark datasets, evaluation code and agent training example.☆22Nov 29, 2023Updated 2 years ago
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,685Mar 8, 2024Updated last year
- Code for the paper "Batch size invariance for policy optimization"☆56Apr 2, 2023Updated 2 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,167Nov 18, 2024Updated last year
- An open source implementation of CLIP.☆13,430Updated this week
- Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning☆1,749Jan 20, 2026Updated last month
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆955Aug 3, 2022Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,033Jan 23, 2026Updated last month
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,378Jul 25, 2023Updated 2 years ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆413Jan 7, 2026Updated last month
- Repo for external large-scale work☆6,543Apr 27, 2024Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,873Updated this week
- A collection of reference environments for offline reinforcement learning☆1,649Nov 18, 2024Updated last year
- Taming Transformers for High-Resolution Image Synthesis☆6,434Jul 30, 2024Updated last year
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,642Feb 18, 2026Updated last week
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,169Nov 9, 2025Updated 3 months ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆866Aug 12, 2024Updated last year
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆104Jun 16, 2025Updated 8 months ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,382May 31, 2024Updated last year
- ☆1,591Jun 28, 2022Updated 3 years ago
- General Modules for JAX☆72Feb 21, 2026Updated last week
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,741Jan 8, 2024Updated 2 years ago