A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.
☆14Dec 8, 2020Updated 5 years ago
Alternatives and similar repositories for pytorch_seed_rl
Users that are interested in pytorch_seed_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My Submission for the OpenAI/NeurIPS ProcGen Competition☆11Nov 12, 2020Updated 5 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 3 years ago
- Source code for paper: An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning☆24Sep 2, 2022Updated 3 years ago
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- Vpin caculation and backtesting☆14Aug 16, 2019Updated 6 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 4 months ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- My Homepage☆10Mar 30, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …☆17Nov 15, 2020Updated 5 years ago
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆16Jul 27, 2023Updated 2 years ago
- An extensible, dynamic and blazing fast derivatives trading engine☆12Feb 27, 2023Updated 3 years ago
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- pytorch实现的一些MARL算法☆66May 4, 2021Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Apr 5, 2021Updated 5 years ago
- SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's arch…☆836Nov 29, 2022Updated 3 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆35Sep 5, 2020Updated 5 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆132Aug 14, 2023Updated 2 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆12Nov 14, 2019Updated 6 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Jun 8, 2022Updated 3 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- A networking protocol for agent-environment communication☆109Feb 20, 2026Updated last month
- ☆14Aug 26, 2018Updated 7 years ago
- ☆32Jan 17, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Jan 4, 2021Updated 5 years ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Apr 13, 2021Updated 5 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 6 years ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- SIA - C++/Python library for model-based stochastic estimation and optimal control☆23Apr 3, 2024Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Jan 12, 2019Updated 7 years ago