buoyancy99 / research-template
An ML research template with good documentation by Boyuan Chen, an MIT PhD student
☆62Updated last week
Alternatives and similar repositories for research-template:
Users that are interested in research-template are comparing it to the libraries listed below
- ☆117Updated 2 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆121Updated last week
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆42Updated 2 months ago
- Official implementation of "Self-Improving Video Generation"☆60Updated last week
- ☆64Updated 6 months ago
- ☆93Updated 6 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆49Updated 8 months ago
- ☆73Updated 6 months ago
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆138Updated 6 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆82Updated last year
- ElasticTok: Adaptive Tokenization for Image and Video☆60Updated 4 months ago
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆40Updated last month
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆177Updated last month
- Benchmarking physical understanding in generative video models☆124Updated 2 weeks ago
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆58Updated 5 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆45Updated 3 months ago
- List of papers on video-centric robot learning☆14Updated 3 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆93Updated 4 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆94Updated last month
- An open-source lightweight game generation paradigm. It includes everything from data processing to model architecture design and playabi…☆77Updated 2 months ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆25Updated 2 months ago
- This is the official implementation of Video Generation part of This&That: Language-Gesture Controlled Video Generation for Robot Plannin…☆34Updated last month
- ☆43Updated last month
- This repository is a collection of research papers on World Models.☆37Updated last year
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆90Updated 4 months ago