Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.
☆96Oct 26, 2025Updated 7 months ago
Alternatives and similar repositories for Awesome-World-Models
Users that are interested in Awesome-World-Models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆16Jul 31, 2025Updated 10 months ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆60Feb 6, 2026Updated 4 months ago
- PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model☆28Oct 10, 2024Updated last year
- One Discrete Word for Visual Reasoning Overtakes Agentic and Latent Methods☆126Updated this week
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation.☆15Jul 21, 2025Updated 10 months ago
- ☆35Feb 24, 2026Updated 3 months ago
- Lightweight and Robust Point-Line Monocular Visual Inertial Wheel Odometry (IROS2025)☆37Jun 16, 2025Updated 11 months ago
- Rui Qian, Xin Yin, Chuanhang Deng, et al.: UGround: Towards Unified Visual Grounding with Unrolled Transformers (ICML 2026)☆26Jun 5, 2026Updated last week
- [arXiv 26] FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions☆169Jun 5, 2026Updated last week
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- ☆44Jul 9, 2025Updated 11 months ago
- [CVPR'2025] EntitySAM: Segment Everything in Video☆64Jul 13, 2025Updated 11 months ago
- [ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?☆252Dec 15, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆34Mar 27, 2026Updated 2 months ago
- ☆55Jul 16, 2025Updated 10 months ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆25Dec 2, 2025Updated 6 months ago
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"☆22Dec 8, 2024Updated last year
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 4 months ago
- Implementation of "Dual Mixup Regularized Learning for Adversarial Domain Adaptation" in Pytorch☆12Apr 8, 2021Updated 5 years ago
- ☆80Feb 27, 2026Updated 3 months ago
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space (ICML2026)☆40May 12, 2026Updated last month
- ☆68Dec 3, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- ☆13Sep 2, 2023Updated 2 years ago
- ☆99Mar 13, 2026Updated 3 months ago
- [SIGGRAPH Asia 2025] FreeArt3D: Training-Free Articulated Object Generation using 3D Diffusion☆100Apr 8, 2026Updated 2 months ago
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (ACM MM'24)☆14Oct 15, 2025Updated 7 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 7 months ago
- Human-centered Delivery Benchmark☆20Jul 24, 2024Updated last year
- [CVPR 2026] STAMP: Better, Stronger, Faster: Tackling the Trilemma in MLLM-based Segmentation with Simultaneous Textual Mask Prediction☆39Feb 21, 2026Updated 3 months ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Jul 4, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆24Mar 11, 2025Updated last year
- ☆513Sep 2, 2025Updated 9 months ago
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation☆13Dec 13, 2024Updated last year
- Flux training codes (lora) for UniTEX☆24Jun 8, 2025Updated last year
- List of papers on video-centric robot learning☆23Nov 16, 2024Updated last year
- Breed procedurally generated plants based on their DNA☆12Jan 6, 2022Updated 4 years ago
- Companion repository which facilitates the creation of Gradio endpoints which are accessible from within Digital Audio Workstations (DAWs…☆28Updated this week