A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.
☆14Dec 8, 2020Updated 5 years ago
Alternatives and similar repositories for pytorch_seed_rl
Users that are interested in pytorch_seed_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My Submission for the OpenAI/NeurIPS ProcGen Competition☆10Nov 12, 2020Updated 5 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 3 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- Source code for paper: An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning☆24Sep 2, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Vpin caculation and backtesting☆14Aug 16, 2019Updated 6 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 6 months ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- My Homepage☆10Jun 5, 2026Updated last week
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆16Jul 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An extensible, dynamic and blazing fast derivatives trading engine☆12Feb 27, 2023Updated 3 years ago
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆63Apr 5, 2021Updated 5 years ago
- SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's arch…☆835Nov 29, 2022Updated 3 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆133Aug 14, 2023Updated 2 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆13Nov 14, 2019Updated 6 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Jun 8, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- LLM-Enabled Robot Swarms☆22Mar 25, 2026Updated 2 months ago
- ☆14Aug 26, 2018Updated 7 years ago
- ☆34Jan 17, 2025Updated last year
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Apr 13, 2021Updated 5 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 7 years ago
- SIA - C++/Python library for model-based stochastic estimation and optimal control☆22Apr 3, 2024Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆148Jan 12, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Jun 15, 2019Updated 7 years ago
- ☆15Jun 5, 2019Updated 7 years ago
- 利用impress.js这款基于css3转 换和过渡、工作于现代浏览器(Google Chrome或Safari (或 Firefox 10 或 IE10))、并受prezi.com的理念启发的演示工具,制作在线PPT,为演示带来便捷和不一般的显示效果☆12Jul 27, 2012Updated 13 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆122Dec 18, 2020Updated 5 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Aug 16, 2019Updated 6 years ago
- Python API for the SUMO environment of Plymouth Rd.☆14Feb 1, 2021Updated 5 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Jul 4, 2022Updated 3 years ago