AlmondGod / tinyworldsLinks
A minimal implementation of DeepMind's Genie world model
☆1,013Updated this week
Alternatives and similar repositories for tinyworlds
Users that are interested in tinyworlds are comparing it to the libraries listed below
Sorting:
- A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts…☆339Updated last week
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆193Updated 8 months ago
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆772Updated last week
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆548Updated last week
- Benchmarking physical understanding in generative video models☆214Updated last week
- Native Multimodal Models are World Learners☆1,178Updated this week
- MineWorld: A Real-time interactive world model on Minecraft☆403Updated 3 months ago
- ☆1,073Updated last week
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆877Updated 3 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆665Updated last week
- Build your own visual reasoning model☆414Updated last month
- Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆655Updated last week
- Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.☆546Updated last month
- Automating the Search for Artificial Life with Foundation Models!☆437Updated 2 weeks ago
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,063Updated 7 months ago
- Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.☆1,320Updated 2 weeks ago
- Official implementation of Inductive Moment Matching☆561Updated 3 months ago
- [NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models☆1,473Updated 3 weeks ago
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆374Updated 2 months ago
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆219Updated last year
- NEO Series: Native Vision-Language Models from First Principles☆215Updated 2 weeks ago
- RLP: Reinforcement as a Pretraining Objective☆198Updated last month
- Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environment…☆717Updated last week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆704Updated last month
- Dream 7B, a large diffusion language model☆1,040Updated last month
- Interactive visualizations of the geometric intuition behind diffusion models.☆837Updated 4 months ago
- documentation for content creation☆227Updated last month
- ☆269Updated last month
- Cosmos-Transfer1-DiffusionRenderer: High-quality video de-lighting and re-lighting based on Cosmos video diffusion framework☆743Updated last month
- Code for the Molmo Vision-Language Model☆793Updated 10 months ago