Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)
☆21Mar 12, 2021Updated 5 years ago
Alternatives and similar repositories for Randomized-Ensembled-Double-Q-learning-REDQ-
Users that are interested in Randomized-Ensembled-Double-Q-learning-REDQ- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆184Nov 14, 2024Updated last year
- Implementation of the Self Paced Reinforcement Learning Experiments☆19Sep 27, 2023Updated 2 years ago
- ☆10Mar 22, 2021Updated 5 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆40Jan 22, 2021Updated 5 years ago
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ur5 robot with robotiq parallel grippers for testing parallel grasping algorithms☆11Apr 10, 2016Updated 10 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Mar 6, 2026Updated 3 months ago
- ☆12Aug 15, 2020Updated 5 years ago
- ☆12Sep 30, 2017Updated 8 years ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- ☆12Jun 8, 2020Updated 6 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Apr 11, 2024Updated 2 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- A meta-population model for COVID19 in China☆11Jun 10, 2020Updated 6 years ago
- ☆13Jan 5, 2021Updated 5 years ago
- Common Objects Day and Night image dataset.☆15Nov 16, 2022Updated 3 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- Machine learning to predict future number Covid19 Daily Cases (7-day moving average). Long Short Term Memory (LSTM) Predictor and Reinfor…☆14Feb 21, 2021Updated 5 years ago
- ☆78Mar 15, 2021Updated 5 years ago
- DIP & NLP期末大作业 — 课程设计☆19Dec 11, 2022Updated 3 years ago
- [NeurIPS 2024] Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression☆16Oct 29, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Deep Reinforcement Learning Agent to control traffic light providing emergency facilitation using real-time traffic data.☆10Feb 11, 2021Updated 5 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- Code for RRL (https://sites.google.com/view/abstractions4rl)☆28Jan 21, 2022Updated 4 years ago
- ☆14Jun 21, 2024Updated last year
- Code for our paper "Active Perception using Light Curtains for Autonomous Driving", ECCV 2020☆10Dec 7, 2021Updated 4 years ago
- Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Rein…☆37Nov 8, 2019Updated 6 years ago
- ☆118Apr 28, 2023Updated 3 years ago
- PyTorch implementation of MATD3☆13Apr 3, 2020Updated 6 years ago
- Knock your images before you get stressed.☆11Jan 9, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch implementation of Episodic Meta Reinforcement Learning on variants of the "Two-Step" task. Reproduces the results found in three …☆39Dec 12, 2020Updated 5 years ago
- DIGIX2021 基于多目标优化的视频推荐 亚军方案☆26Oct 7, 2021Updated 4 years ago
- Models built with TensorFlow☆26Dec 5, 2018Updated 7 years ago
- Code for the paper EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models.☆16Mar 1, 2022Updated 4 years ago
- ☆19Oct 27, 2025Updated 7 months ago
- Code for Visual Dexterity: In-Hand Reorientation of Novel and Complex Object Shapes (Science Robotics)☆153Nov 3, 2025Updated 7 months ago
- Simulation environments for Multi-Objective Reinforcement Learning (MORL)☆17Aug 2, 2022Updated 3 years ago