Pytorch implementation of AREL
☆16Dec 20, 2021Updated 4 years ago
Alternatives and similar repositories for AREL
Users that are interested in AREL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'☆17Oct 6, 2024Updated last year
- ☆22Jul 15, 2020Updated 5 years ago
- This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).☆12May 20, 2019Updated 6 years ago
- ☆14Sep 27, 2019Updated 6 years ago
- ☆11Oct 26, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆25Feb 21, 2022Updated 4 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- This repo contains PPO implementation in PyTorch for LunarLander-v2☆11Jun 26, 2020Updated 5 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- [AAAI-25] Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.☆32May 29, 2025Updated 11 months ago
- This repository is the implementation of the paper "Beating Atari with Natural Language Guided Reinforcement Learning"☆11Nov 25, 2018Updated 7 years ago
- ☆11Mar 14, 2023Updated 3 years ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated 2 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An improved version of EOI on Starcraft II task so_many_baneling. (The Emergence of Individuality)☆17Oct 28, 2021Updated 4 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- ☆12Jun 17, 2022Updated 3 years ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆36Feb 13, 2021Updated 5 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆18May 1, 2025Updated last year
- ☆18Jul 14, 2023Updated 2 years ago
- picmeup☆12Nov 23, 2020Updated 5 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Light Seeking and Obstacle Avoiding Robot☆10Feb 7, 2017Updated 9 years ago
- [NeurIPS 2024] Code for Federated Ensemble-Directed Offline Reinforcement Learning☆27Sep 25, 2024Updated last year
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- The Emergence of Individuality☆13Oct 16, 2021Updated 4 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆15Apr 25, 2024Updated 2 years ago
- 即时通信 IM 支持发送文字 语音 图片 短视频 位置 红包 名片...☆18Feb 27, 2016Updated 10 years ago
- Courbariaux, Matthieu, Yoshua Bengio, and Jean-Pierre David. "Binaryconnect: Training deep neural networks with binary weights during pro…☆12Aug 31, 2020Updated 5 years ago
- pytorch☆14Dec 11, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A lightweight JSON Parser for Arduino devices☆33Mar 28, 2015Updated 11 years ago
- Collective training of neural networks on distributed datasets.☆20Feb 6, 2023Updated 3 years ago
- ☆17Nov 29, 2022Updated 3 years ago
- PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method☆28Feb 16, 2021Updated 5 years ago
- ☆21Jan 17, 2022Updated 4 years ago
- Bulk and single-cell Multi-Omics ground truth Simulator in R☆12Feb 10, 2026Updated 2 months ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago