Code for running RL experiments on continuing (non-episodic) problems.
☆21Feb 13, 2026Updated 3 months ago
Alternatives and similar repositories for continuing-rl-exps
Users that are interested in continuing-rl-exps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023☆14Aug 3, 2023Updated 2 years ago
- The implementation of Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System.☆11Sep 8, 2025Updated 8 months ago
- Deep Learning - Visual Representation Learning by solving Jigsaw puzzles using Deep Reinforcement Learning☆10Dec 8, 2016Updated 9 years ago
- ☆11Nov 2, 2021Updated 4 years ago
- Matlab code for: 1. reconstructing CT image by applying back projection, filtered back projection and convolution back projection; 2. max…☆15Sep 26, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A package for pedestrian detection, tracking, and re-identification.☆13Feb 28, 2021Updated 5 years ago
- Learning to Incentivize Other Learning Agents☆36Jun 13, 2022Updated 3 years ago
- ☆14Nov 4, 2022Updated 3 years ago
- Push-to-See: Learning Non-Prehensile Manipulation to Enhance Instance Segmentation via Deep Q-Learning☆13Sep 2, 2022Updated 3 years ago
- Code for "Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models"☆18Mar 21, 2023Updated 3 years ago
- PyTorch implementation of MATD3☆13Apr 3, 2020Updated 6 years ago
- ☆40May 19, 2025Updated last year
- suPER is a collaborative multi-agent RL algorithm☆14Jun 11, 2024Updated last year
- Policy learning of in-hand manipulation. Proximal policy optimization trains the Allegro hand to learn a stabilizing grasp☆14Feb 5, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for "Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies"☆11Jun 27, 2025Updated 10 months ago
- ☆11May 29, 2025Updated 11 months ago
- High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, T…☆14Feb 2, 2025Updated last year
- Code for the results of the Paper:☆21May 17, 2018Updated 8 years ago
- ☆19Mar 12, 2025Updated last year
- ☆15Dec 13, 2022Updated 3 years ago
- Code exploring the use of reward machines in the context of cooperative multi-agent reinforcement learning.☆14Apr 29, 2023Updated 3 years ago
- ☆17Aug 19, 2024Updated last year
- Official implementation of VLMLight☆32Mar 31, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Jan 19, 2024Updated 2 years ago
- Multi-task gradient boosting decision tree☆13Apr 14, 2023Updated 3 years ago
- ByteTrack + ROS 2☆29Apr 17, 2024Updated 2 years ago
- Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach☆14May 10, 2024Updated 2 years ago
- ☆17Sep 23, 2022Updated 3 years ago
- Pytorch based BERT, mBART and NMT training☆15Jul 30, 2025Updated 9 months ago
- ☆11Aug 10, 2020Updated 5 years ago
- Update PDEKoopman code to Tensorflow 2☆24Apr 27, 2021Updated 5 years ago
- Context-Aware, Recommender-Powered Visualization Authoring☆22Jul 22, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Aug 13, 2018Updated 7 years ago
- Cooperative Spectrum Sensing based on Deep Recurrent Q-Network☆19Jun 8, 2019Updated 6 years ago
- The code for the article "(\tau,\epsilon)-GREEDY REINFORCEMENT LEARNING FOR ANTI-JAMMING WIRELESS COMMUNICATIONS"☆28Aug 23, 2020Updated 5 years ago
- ☆19Dec 30, 2023Updated 2 years ago
- To verify/test the performance of rectangle-represented grasp detection algorithms, this project builts a joint simulation environment ba…☆19Mar 25, 2025Updated last year
- 基于Transformer的机器翻译系统☆12Jun 28, 2022Updated 3 years ago
- ☆15May 27, 2024Updated last year