Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)
☆21Mar 12, 2021Updated 5 years ago
Alternatives and similar repositories for Randomized-Ensembled-Double-Q-learning-REDQ-
Users that are interested in Randomized-Ensembled-Double-Q-learning-REDQ- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆181Nov 14, 2024Updated last year
- Implementation of the Self Paced Reinforcement Learning Experiments☆19Sep 27, 2023Updated 2 years ago
- ☆10Mar 22, 2021Updated 5 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- C# 7 and .NET Core 2.0 High Performance, published by Packt☆13Jan 15, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 这是win下visual studio项目CTPtest-master(地址http://blog.csdn.net/u012234115/article/details/70195889) 的Linux实现;无GUI的量化交易平台;☆14May 1, 2017Updated 9 years ago
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- ur5 robot with robotiq parallel grippers for testing parallel grasping algorithms☆11Apr 10, 2016Updated 10 years ago
- A simple ring (circular) buffer implementation for the .NET framework, written in C#☆17Aug 28, 2019Updated 6 years ago
- Liquibook Implementation of Order Book with the CMake build system☆16Jul 2, 2018Updated 7 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- Bot for `Kore 2022 - Beta` https://www.kaggle.com/competitions/kore-2022-beta/overview☆10May 28, 2022Updated 3 years ago
- 2022 WSDM 爱奇艺用户留存预测赛 第三名方案☆14Jan 29, 2022Updated 4 years ago
- Basic set of utilities for streaming real time trade and limit order book event data☆14May 20, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Mar 6, 2026Updated 2 months ago
- ☆12Aug 15, 2020Updated 5 years ago
- ☆12Sep 30, 2017Updated 8 years ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- 高频交易系统结构☆17May 21, 2020Updated 5 years ago
- 爱奇艺用户存留预测竞赛项目,赛题任务是:给定一个时间点,预测未来七天用户会登陆几天。选手需根据给定的数据构建标签和用户行为序列特征来训练模型。☆10Jun 30, 2022Updated 3 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Apr 11, 2024Updated 2 years ago
- ☆12Jun 8, 2020Updated 5 years ago
- Cooperation and Fairness in Multi-Agent Reinforcement Learning☆16Aug 6, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This is the accompannying code for the paper "SLAM-Safe Planner: Preventing Monocular SLAM Failure using Reinforcement Learning" and "Dat…☆18Sep 15, 2017Updated 8 years ago
- ☆13Jun 23, 2017Updated 8 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- Code for researching and backtesting pairs trading☆24Mar 14, 2010Updated 16 years ago
- A meta-population model for COVID19 in China☆11Jun 10, 2020Updated 5 years ago
- ☆13Jan 5, 2021Updated 5 years ago
- Implementation of an online learning algorithm to do classification under concept drift☆23Nov 20, 2017Updated 8 years ago
- eForest: Reversible mapping between high-dimensional data and path rule identifiers using trees embedding☆24Sep 7, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- Machine learning to predict future number Covid19 Daily Cases (7-day moving average). Long Short Term Memory (LSTM) Predictor and Reinfor…☆14Feb 21, 2021Updated 5 years ago
- ☆77Mar 15, 2021Updated 5 years ago
- [NeurIPS 2024] Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression☆14Oct 29, 2025Updated 6 months ago
- Deep Reinforcement Learning Agent to control traffic light providing emergency facilitation using real-time traffic data.☆10Feb 11, 2021Updated 5 years ago
- Multi Agent Reinforcement Learning Environment For Aerial Unmanned Vehicles☆13Apr 13, 2023Updated 3 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago