abhisheknaik96/continuing-rl-exps

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abhisheknaik96/continuing-rl-exps)

abhisheknaik96 / continuing-rl-exps

Code for running RL experiments on continuing (non-episodic) problems.

☆22

Alternatives and similar repositories for continuing-rl-exps

Users that are interested in continuing-rl-exps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Qeneb / SS-MARL
View on GitHub
The implementation of Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System.
☆11Sep 8, 2025Updated 10 months ago
MasterXiong / ModuMorph
View on GitHub
Code of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023
☆15Aug 3, 2023Updated 2 years ago
harpribot / harpreif
View on GitHub
Deep Learning - Visual Representation Learning by solving Jigsaw puzzles using Deep Reinforcement Learning
☆10Dec 8, 2016Updated 9 years ago
WhiteGrayxp / MARL-based-Dec-Spectrum-Access
View on GitHub
☆11Nov 2, 2021Updated 4 years ago
OscarHuangWind / Preference-Guided-DQN-Atari
View on GitHub
[TNNLS] PGDQN: A generalized and efficient preference-guided epsilon-greedy policy equipped DQN for Atari and Autonomous Driving
☆11Oct 9, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
HoEmpire / pedestrian_tracking_and_localizaiton
View on GitHub
A package for pedestrian detection, tracking, and re-identification.
☆13Feb 28, 2021Updated 5 years ago
SiavashBarqiJaniar / DRL_PBHWN
View on GitHub
Source code for the papers "Deep-reinforcement learning for fair distributed dynamic spectrum access in wireless networks" and "Deep‐rein…
☆14Oct 12, 2022Updated 3 years ago
DafaRen / Learning_Bifunctional_Push-grasping_Synergistic_Strategy_for_Goal-agnostic_and_Goal-oriented_Tasks
View on GitHub
☆14Nov 4, 2022Updated 3 years ago
tristan-ka / IBOAT_RL
View on GitHub
Deep Q-Network (DQN) and DDPG to address the problem of stall around the wing sail of an autonomous sailing robot
☆11Sep 18, 2018Updated 7 years ago
ALRhub / push-to-see
View on GitHub
Push-to-See: Learning Non-Prehensile Manipulation to Enhance Instance Segmentation via Deep Q-Learning
☆13Sep 2, 2022Updated 3 years ago
IIT-PAVIS / Positional_Diffusion
View on GitHub
Code for "Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models"
☆18Mar 21, 2023Updated 3 years ago
MJ10 / matd3-pytorch
View on GitHub
PyTorch implementation of MATD3
☆13Apr 3, 2020Updated 6 years ago
prasoongoyal / PixL2R
View on GitHub
☆17Dec 21, 2020Updated 5 years ago
mgerstgrasser / super
View on GitHub
suPER is a collaborative multi-agent RL algorithm
☆14Jun 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AIDefender / TSCAC
View on GitHub
[WWW 2023] The official code for the paper "Two-Stage Constrained Actor-Critic for Short Video Recommendation"
☆15Jul 21, 2023Updated 3 years ago
Traffic-Alpha / CCDA-Light
View on GitHub
Code for "Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies"
☆14Jun 27, 2025Updated last year
DRL-CASIA / EpMineEnv
View on GitHub
☆11May 29, 2025Updated last year
ognjenkundacina / Reinforcement_Learning_Spectrum_Sensing
View on GitHub
☆17Dec 3, 2019Updated 6 years ago
nnnyt / MIND
View on GitHub
News classification & recommendation in Keras
☆13Jun 15, 2020Updated 6 years ago
Pdbz199 / koopman-rl
View on GitHub
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, T…
☆14Feb 2, 2025Updated last year
SafeRL-Lab / CMORL
View on GitHub
[IEEE TPAMI] A Framework for Constrained Multi-Objective Reinforcement Learning
☆19Apr 18, 2025Updated last year
AnonymousIDforSubmission / GESA
View on GitHub
☆15Dec 13, 2022Updated 3 years ago
felixwzh / MT-GBDT
View on GitHub
Multi-task gradient boosting decision tree
☆13Apr 14, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
victorkich / Kolmogorov-PPO
View on GitHub
☆17Aug 19, 2024Updated last year
amardt / DeepGenMSM
View on GitHub
Code for the results of the Paper:
☆21May 17, 2018Updated 8 years ago
xjtushujun / Auto-6ML
View on GitHub
Auto^6ML is a jittor library allowing users to achieve machine learning automation.
☆26Sep 28, 2024Updated last year
jiang-haoyuan / X-Light
View on GitHub
☆17Jan 19, 2024Updated 2 years ago
RuiqiZhang99 / ProxFly
View on GitHub
The official codebase of manuscript “ProxFly: Robust Control for Close Proximity Quadcopter Flight via Residual Reinforcement Learning"
☆25Mar 4, 2025Updated last year
NickHan-cs / TRACK
View on GitHub
☆20Mar 12, 2025Updated last year
lunarwhite / tiny-zhihu-web
View on GitHub
Community QA forum. 仿知乎问答社区论坛
☆11Jul 10, 2026Updated last week
GZHU-DVL / tau-epsilon-greedy-RL
View on GitHub
The code for the article "(\tau,\epsilon)-GREEDY REINFORCEMENT LEARNING FOR ANTI-JAMMING WIRELESS COMMUNICATIONS"
☆30Aug 23, 2020Updated 5 years ago
CraigGin / PDEKoopman2
View on GitHub
Update PDEKoopman code to Tensorflow 2
☆24Apr 27, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
rycolab / info-theoretic-probing
View on GitHub
This code accompanies the paper "Information-Theoretic Probing for Linguistic Structure" published in ACL 2020.
☆21Apr 27, 2020Updated 6 years ago
SafaMessaoud / S2AC-Energy-Based-RL-with-Stein-Soft-Actor-Critic
View on GitHub
☆14May 27, 2024Updated 2 years ago
zhenpengguo / Machine-translation-based-on-Transformer
View on GitHub
基于Transformer的机器翻译系统
☆12Jun 28, 2022Updated 4 years ago
uwdata / dziban
View on GitHub
Context-Aware, Recommender-Powered Visualization Authoring
☆22Jul 22, 2020Updated 5 years ago
Xiaoyinggit / ConUCB
View on GitHub
☆11Aug 10, 2020Updated 5 years ago
AdvancedAI-ComplexSystem / SmartCity
View on GitHub
☆15Dec 14, 2025Updated 7 months ago
RealMarco / RoboticGraspingSimulation
View on GitHub
To verify/test the performance of rectangle-represented grasp detection algorithms, this project builts a joint simulation environment ba…
☆18Mar 25, 2025Updated last year