Official implementation for ICLR 2025 paper "Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning"
☆20Mar 5, 2025Updated last year
Alternatives and similar repositories for dicp
Users that are interested in dicp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15May 17, 2024Updated last year
- ☆27Aug 6, 2025Updated 9 months ago
- Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)☆10Oct 26, 2021Updated 4 years ago
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆35Aug 20, 2025Updated 8 months ago
- Next-gen Foundation Model for Embodied AI☆28Apr 7, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Sep 29, 2025Updated 7 months ago
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"☆20May 27, 2024Updated last year
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆132Nov 12, 2025Updated 5 months ago
- The source code of ExFunTube☆10Aug 8, 2025Updated 9 months ago
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆95Mar 20, 2026Updated last month
- This repository contains the official code for our NeurIPS 2021 publication "Robust Deep Reinforcement Learning through Adversarial Loss…☆33Jan 21, 2022Updated 4 years ago
- Official Repository for our CVPR2024 paper: ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images☆15Jun 13, 2024Updated last year
- The interface between probabilistic model checking and data-driven policy learning.☆19Apr 21, 2026Updated 2 weeks ago
- 各语言实现的围棋数据结构,包括规则与状态判断逻辑☆19Sep 21, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code and Data for ACL 2023 paper I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors☆16Jun 7, 2023Updated 2 years ago
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆50Apr 1, 2024Updated 2 years ago
- Learning Safety Constraints for Large Language Models (ICML2025)☆34Aug 4, 2025Updated 9 months ago
- 🧙🏻 Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing…☆21Dec 20, 2024Updated last year
- Non-official implementation of paper "In-context Reinforcement Learning with Algorithm Distillation"☆12Aug 15, 2024Updated last year
- 📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"☆22Sep 5, 2023Updated 2 years ago
- The 4th rank system of the SemEval 2021 Task4.☆10May 7, 2022Updated 4 years ago
- Official PyTorch implementation of "Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback" (EMNLP 2024 Main Oral)☆26Oct 15, 2024Updated last year
- Dynamic Simulation Environments for Reinforcement Learning☆13Apr 17, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repo for arxiv paper "Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion I…☆17Nov 8, 2024Updated last year
- [AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"☆34Feb 15, 2025Updated last year
- [AAAI-25] Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.☆32May 29, 2025Updated 11 months ago
- Flow RL is a high-performance RL library with flow and diffusion models.☆36Updated this week
- ESPER☆24Mar 29, 2024Updated 2 years ago
- Benchmarking Tool for Model Predictive Control based stable walking for humanoid robot☆21Nov 6, 2024Updated last year
- Source code for my homepage.☆15Apr 24, 2026Updated 2 weeks ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆23Apr 26, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🔥🔥🔥 Object State Description & Change Detection☆10Apr 6, 2026Updated last month
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year
- ☆25Sep 23, 2024Updated last year
- ☆10Jul 26, 2024Updated last year
- A GitHub repository associated with paper "Learn to Earn: Enabling Coordination Within a Ride-Hailing Fleet"☆10Jun 22, 2020Updated 5 years ago
- Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning☆10Nov 14, 2021Updated 4 years ago
- This repository hosts the codebase corresponding to our paper, published at Expert Systems With Applications, titled 'Class-Incremental L…☆14Jun 11, 2024Updated last year