My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0
☆30Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for rl-tf2
Users that are interested in rl-tf2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆91Nov 21, 2023Updated 2 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Oct 10, 2018Updated 7 years ago
- This repository is the source code of a paper "Integrated Control of Steering and Braking for Effective Collision Avoidance with Autonomo…☆16Dec 5, 2022Updated 3 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- Training Agents in a cooperative multi-agent deep reinforcement learning setting to transport objects across a space☆14Jul 5, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multi objective optimization-based routing algorithm for SDN networks☆20Aug 31, 2020Updated 5 years ago
- This is the the partial code for my thesis. A COLREGS-compliant multiship collision avoidance based on deep reinforcement learning☆16Dec 4, 2025Updated 4 months ago
- Main repository of the BeFaaS project☆15Jun 29, 2023Updated 2 years ago
- TD Advantage Actor-Critic RL algorithm☆15Mar 19, 2019Updated 7 years ago
- ROS Low-Level PID Feedback Control for Unmanned Surface Vessel☆20Aug 12, 2022Updated 3 years ago
- Application of an LSTM-based policy gradient on an RL agent☆15Aug 24, 2022Updated 3 years ago
- End to End Mobile Robot Navigation using DDPG (Continuous Control with Deep Reinforcement Learning) based on Tensorflow + Gazebo☆58Nov 5, 2019Updated 6 years ago
- Python code for implementation of the paper 'Reinforcement Learning-Based Adaptive PID Controller for DPS☆16Aug 28, 2020Updated 5 years ago
- NSMC Satellite Product Data Reader (AWX)☆20Aug 16, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于深度强化学习不同算法的移动机器人导航避障☆19Jul 6, 2021Updated 4 years ago
- Trading Robot based on LSTM-PPO☆28Dec 27, 2019Updated 6 years ago
- Long-distance maritime polar route planning, taking into account complex changing environmental conditions.☆20Apr 1, 2026Updated last week
- ☆20Sep 14, 2019Updated 6 years ago
- This is a DQN-based recommendation system for item-list recommendation and it finally achieved second place in the competition of RL-base…☆11Oct 8, 2021Updated 4 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- Collecting useful literatures, database, code for ship motion simulation, especially for maneuvering research.☆13Jul 3, 2019Updated 6 years ago
- Track 1: Driving with Language☆26Aug 23, 2025Updated 7 months ago
- Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment☆27Aug 2, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Setting up DDPG based reinforcement learning in ROS Gazebo environment☆14Jul 29, 2019Updated 6 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 4 years ago
- later☆10Jul 9, 2022Updated 3 years ago
- DRL-based collision avoidance for turtlebot3☆19Feb 6, 2023Updated 3 years ago
- [MICCAI 2024] MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality☆12Sep 26, 2025Updated 6 months ago
- Literature reviews of (Unsupervised/self-supervised) pretraining on medical datasets☆18Jan 16, 2024Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆54Feb 25, 2025Updated last year
- CNN-LSTM-attention☆10Jan 6, 2021Updated 5 years ago
- ☆28Oct 14, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Code Framework of the paper "Deep Language-based Critiquing for Recommender System"☆19Jul 24, 2019Updated 6 years ago
- ☆10Sep 21, 2020Updated 5 years ago
- UAV Obstacle Avoidance using Deep Recurrent Reinforcement Learning with Temporal Attention☆112Oct 23, 2018Updated 7 years ago
- ☆16Feb 22, 2024Updated 2 years ago
- Reinforcement Learning in continuous state and action spaces. DDPG: Deep Deterministic Policy Gradient and A3C: Asynchronous Actor-Critic…☆14May 14, 2018Updated 7 years ago
- ☆19May 12, 2021Updated 4 years ago
- 基于 Dify 构建的高级搜索工具☆32Aug 22, 2024Updated last year