High-quality implementations of deep reinforcement learning algorithms for experiments
☆51Aug 30, 2024Updated last year
Alternatives and similar repositories for RL-Experiments
Users that are interested in RL-Experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆21Jul 27, 2020Updated 5 years ago
- ☆17Sep 15, 2017Updated 8 years ago
- An implementation of TRPO with GAE in PyTorch☆16Jul 22, 2023Updated 2 years ago
- RLgraph: Modular computation graphs for deep reinforcement learning☆323Nov 5, 2019Updated 6 years ago
- Implementation of the "Sim-to-Real Transfer of Robotic Control with Dynamics Randomization" paper☆13Sep 8, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- Interacting with Latent Space of AutoEncoder☆21Nov 22, 2022Updated 3 years ago
- ☆13May 29, 2018Updated 7 years ago
- Velocity in deep-learning research☆280Dec 8, 2022Updated 3 years ago
- We propose an evolution-based approach to meta-learn synthetic neural environments and reward neural networks for reinforcement learning.☆21Feb 23, 2023Updated 3 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Oct 25, 2018Updated 7 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Cellular Traces Collected in New York City for different scenarios☆13Jul 19, 2020Updated 5 years ago
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Jun 18, 2019Updated 6 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Apr 15, 2023Updated 3 years ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Gymnasium environment for reinforcement learning with multicopters☆32Jun 4, 2024Updated last year
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Nov 8, 2019Updated 6 years ago
- Bayesian Backprop RNN implementation pytorch https://arxiv.org/abs/1704.02798☆25Jan 23, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆337Nov 24, 2021Updated 4 years ago
- An application of stacked denoising autoencoders to multi-modal (images and audio) abstract feature discovery☆12Oct 23, 2013Updated 12 years ago
- ☆13Jul 2, 2021Updated 4 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Oct 7, 2018Updated 7 years ago
- ☆11Feb 5, 2024Updated 2 years ago
- PyTorch implementation of Trust Region Policy Optimization☆448Sep 13, 2018Updated 7 years ago
- Simulation system for path planning evaluation☆12Dec 13, 2025Updated 5 months ago
- Open Flight Data analysis tools☆13May 18, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Snake using RL☆21Nov 27, 2023Updated 2 years ago
- DrQ: Data regularized Q☆422Jan 13, 2023Updated 3 years ago
- An unofficial Torch implementation of InfoGAN☆19Sep 5, 2017Updated 8 years ago
- Collection of tutorials, exercises and papers on RL☆17Oct 16, 2017Updated 8 years ago
- ☆11Feb 11, 2024Updated 2 years ago
- TMARKER is a software for cancer cell nucleus detection, segmentation, counting, and classification.☆14Feb 22, 2017Updated 9 years ago
- Autoregressive policies for continuous control reinforcement learning☆33May 15, 2019Updated 7 years ago