yilundu / ired_code_release
☆56Updated 8 months ago
Alternatives and similar repositories for ired_code_release:
Users that are interested in ired_code_release are comparing it to the libraries listed below
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆48Updated 3 months ago
- ☆73Updated 6 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆92Updated 5 months ago
- Codebase for HiP☆88Updated last year
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆45Updated 2 years ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆27Updated 4 months ago
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆39Updated 7 months ago
- ☆28Updated 2 months ago
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆55Updated last month
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆126Updated 3 months ago
- ☆26Updated 8 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆102Updated last week
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆58Updated 5 months ago
- ☆18Updated last week
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆24Updated 8 months ago
- Stick-breaking attention☆44Updated last month
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆35Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆28Updated 10 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆61Updated 3 months ago
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆20Updated 3 months ago
- ☆17Updated 3 months ago
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆34Updated 2 weeks ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆92Updated 2 weeks ago
- ☆37Updated 6 months ago
- ☆51Updated 5 months ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated 7 months ago
- NF-Layers for constructing neural functionals.☆82Updated last year