Official implementation for ICLR 2025 paper "Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning"
☆20Mar 5, 2025Updated last year
Alternatives and similar repositories for dicp
Users that are interested in dicp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16May 17, 2024Updated 2 years ago
- Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)☆10Oct 26, 2021Updated 4 years ago
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆38May 20, 2026Updated 3 weeks ago
- ☆15Apr 5, 2023Updated 3 years ago
- ☆14Sep 29, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆132Nov 12, 2025Updated 7 months ago
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆98Mar 20, 2026Updated 2 months ago
- The interface between probabilistic model checking and data-driven policy learning.☆20May 14, 2026Updated last month
- Code and Data for ACL 2023 paper I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors☆17Jun 7, 2023Updated 3 years ago
- Automated Design of Agentic Systems☆10Sep 7, 2024Updated last year
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆53Apr 1, 2024Updated 2 years ago
- Onboard code for Project Instinct☆21Jan 16, 2026Updated 5 months ago
- Learning Safety Constraints for Large Language Models (ICML2025)☆35May 25, 2026Updated 3 weeks ago
- 🧙🏻 Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing…☆21Dec 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Non-official implementation of paper "In-context Reinforcement Learning with Algorithm Distillation"☆12Aug 15, 2024Updated last year
- 📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"☆22Sep 5, 2023Updated 2 years ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆68May 8, 2023Updated 3 years ago
- Dynamic Simulation Environments for Reinforcement Learning☆13Apr 17, 2021Updated 5 years ago
- [AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"☆34Feb 15, 2025Updated last year
- Flow RL is a high-performance RL library with flow and diffusion models.☆38Jun 10, 2026Updated last week
- [AAAI-25] Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.☆34May 29, 2025Updated last year
- ☆10Mar 1, 2025Updated last year
- Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)☆15Aug 15, 2025Updated 10 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Benchmarking Tool for Model Predictive Control based stable walking for humanoid robot☆22Nov 6, 2024Updated last year
- Source code for my homepage.☆15Apr 24, 2026Updated last month
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆12Jul 15, 2022Updated 3 years ago
- The current notebook implements a simple disaggregator for deep-nilmtk models compatible with NILMtk.☆11Jan 14, 2023Updated 3 years ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆23Apr 26, 2023Updated 3 years ago
- 🔥🔥🔥 Object State Description & Change Detection☆10Apr 6, 2026Updated 2 months ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆16Nov 4, 2024Updated last year
- A GitHub repository associated with paper "Learn to Earn: Enabling Coordination Within a Ride-Hailing Fleet"☆10Jun 22, 2020Updated 5 years ago
- ☆10Jul 26, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning☆10Nov 14, 2021Updated 4 years ago
- (ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation☆12May 21, 2025Updated last year
- Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"☆29Dec 21, 2025Updated 5 months ago
- Meta-RL Model-Based Algorithm☆46Apr 30, 2025Updated last year
- This repository hosts the codebase corresponding to our paper, published at Expert Systems With Applications, titled 'Class-Incremental L…☆14Jun 11, 2024Updated 2 years ago
- This repository contains the pytorch attempts to replicate the results from the recent DeepMind Paper, "On the Effectiveness of Interval …☆10May 27, 2019Updated 7 years ago
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆27Oct 22, 2022Updated 3 years ago