Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)
☆21Mar 12, 2021Updated 5 years ago
Alternatives and similar repositories for Randomized-Ensembled-Double-Q-learning-REDQ-
Users that are interested in Randomized-Ensembled-Double-Q-learning-REDQ- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆179Nov 14, 2024Updated last year
- Implementation of the Self Paced Reinforcement Learning Experiments☆19Sep 27, 2023Updated 2 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- Backtesting tool on tick data☆11Jan 30, 2017Updated 9 years ago
- C# 7 and .NET Core 2.0 High Performance, published by Packt☆13Jan 15, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- Parser for Nasdaq TotalView ITCH 5.0☆12Jul 31, 2014Updated 11 years ago
- Efficiency/Effectiveness Trade-offs in Learning to Rank☆12Sep 11, 2018Updated 7 years ago
- ur5 robot with robotiq parallel grippers for testing parallel grasping algorithms☆11Apr 10, 2016Updated 9 years ago
- A simple ring (circular) buffer implementation for the .NET framework, written in C#☆17Aug 28, 2019Updated 6 years ago
- 在Kaggle比赛 Home Credit Default Risk中测试gplearn进行特征工程的效果☆10Jul 18, 2018Updated 7 years ago
- Python codes for Lasso feature selection☆14Oct 10, 2019Updated 6 years ago
- Liquibook Implementation of Order Book with the CMake build system☆16Jul 2, 2018Updated 7 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Bot for `Kore 2022 - Beta` https://www.kaggle.com/competitions/kore-2022-beta/overview☆10May 28, 2022Updated 3 years ago
- 2022 WSDM 爱奇艺用户留存预测赛 第三名方案☆14Jan 29, 2022Updated 4 years ago
- 基于C++的期货策略回测平台☆10Oct 3, 2019Updated 6 years ago
- Basic set of utilities for streaming real time trade and limit order book event data☆14May 20, 2022Updated 3 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Mar 6, 2026Updated 3 weeks ago
- 爱奇艺用户存留预测竞赛项目,赛题任务是:给定一个时间点,预测未来七天用户会登陆几天。选手需根据给定的数据构建标签和用户行为序列特征来训练模型。☆10Jun 30, 2022Updated 3 years ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- 高频交易系统结构☆17May 21, 2020Updated 5 years ago
- ☆15Jul 9, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Jun 8, 2020Updated 5 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Apr 11, 2024Updated last year
- Cooperation and Fairness in Multi-Agent Reinforcement Learning☆16Aug 6, 2025Updated 7 months ago
- ☆13Jun 23, 2017Updated 8 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- Advancing in Financial Machine Learning☆16Feb 27, 2020Updated 6 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- My Submissions in LeetCode Using Ruby and Python☆10Aug 29, 2019Updated 6 years ago
- A meta-population model for COVID19 in China☆11Jun 10, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13Jan 5, 2021Updated 5 years ago
- Implementation of an online learning algorithm to do classification under concept drift☆23Nov 20, 2017Updated 8 years ago
- Limit Order Book 指値戦略の高速バックテストモジュール☆11Sep 24, 2025Updated 6 months ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- ☆76Mar 15, 2021Updated 5 years ago
- eForest: Reversible mapping between high-dimensional data and path rule identifiers using trees embedding☆24Sep 7, 2020Updated 5 years ago
- Machine learning to predict future number Covid19 Daily Cases (7-day moving average). Long Short Term Memory (LSTM) Predictor and Reinfor…☆14Feb 21, 2021Updated 5 years ago