mjanschek / pytorch_seed_rlView external linksLinks
A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.
☆15Dec 8, 2020Updated 5 years ago
Alternatives and similar repositories for pytorch_seed_rl
Users that are interested in pytorch_seed_rl are comparing it to the libraries listed below
Sorting:
- My Submission for the OpenAI/NeurIPS ProcGen Competition☆11Nov 12, 2020Updated 5 years ago
- An extensible, dynamic and blazing fast derivatives trading engine☆12Feb 27, 2023Updated 2 years ago
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- time-dependent Hamilton-Jacobi PDEs (http://www.cs.columbia.edu/~cxz/TimeDepHJB/)☆14Feb 5, 2017Updated 9 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Implementation for "Statistical arbitrage in the US equities market" by Marco Avellaneda and Jeong-hyun Lee☆26Dec 10, 2018Updated 7 years ago
- D ratio is a performance metric to analyse the efficiency of algorithms that predict asset return or asset prices☆25Feb 22, 2024Updated last year
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 2 years ago
- Paper: https://arxiv.org/pdf/2008.12275.pdf☆27Aug 29, 2020Updated 5 years ago
- This is a repository for enabling collaborative and proper practices for financial machine learning.☆28Updated this week
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 2 months ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆27Jul 14, 2021Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Jun 8, 2022Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- This MLOps project productionizes a Deep Reinforcement Learning agent with a scalable, distributed data streaming infrastructure using Ka…☆31Apr 24, 2021Updated 4 years ago
- Implementation of "OPTIMAL MARKET MAKING BY REINFORCEMENT LEARNING"☆29Apr 5, 2021Updated 4 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Apr 5, 2021Updated 4 years ago
- Reinforcement Learning in FX Trading☆28Aug 17, 2019Updated 6 years ago
- Mean-Variance Portfolio Optimisation and Algorithmic Trading Strategies in MATLAB☆36Apr 4, 2021Updated 4 years ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- RL algorithm for stock trading with multiple reward functions☆11Apr 21, 2024Updated last year
- ☆35Dec 7, 2017Updated 8 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Custom Loss functions for asset return prediction with deep learning regression☆36Oct 17, 2022Updated 3 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Jul 4, 2022Updated 3 years ago
- ☆35Sep 5, 2020Updated 5 years ago
- Keras 1D Depthwise Convolutional layer☆10May 22, 2020Updated 5 years ago
- In this project, we give python and C++ codes for the Ring Polymer Molecular Dynamics (RMPD) to calculate the time correlation function(…☆12Dec 31, 2017Updated 8 years ago
- Assignments for the cryptography engineering course☆12Dec 17, 2013Updated 12 years ago
- FinanceGPT-B☆10Mar 26, 2024Updated last year
- ☆11Jun 15, 2019Updated 6 years ago
- ☆10Jul 21, 2019Updated 6 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago