yilundu / ired_code_release
☆60Updated 10 months ago
Alternatives and similar repositories for ired_code_release:
Users that are interested in ired_code_release are comparing it to the libraries listed below
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆98Updated 7 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆50Updated 4 months ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆46Updated 2 years ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆112Updated this week
- ☆19Updated last month
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆27Updated 6 months ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated 9 months ago
- The official implementation of flow Q-learning (FQL)☆136Updated last month
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆42Updated 9 months ago
- [ICLR 2025] Implementation of "FACTS: A Factored State-Space Framework For World Modelling"☆23Updated last month
- ☆49Updated last year
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆77Updated last week
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆65Updated 10 months ago
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆47Updated 2 months ago
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆61Updated 3 months ago
- ☆90Updated 9 months ago
- ☆31Updated 6 months ago
- ☆76Updated 8 months ago
- Codebase for HiP☆89Updated last year
- ☆28Updated 3 weeks ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆28Updated 10 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆62Updated last year
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆95Updated 2 weeks ago
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆19Updated last month
- ☆27Updated 10 months ago
- ☆143Updated 2 weeks ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆154Updated last month
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆18Updated last month
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year
- ☆31Updated 4 months ago