A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.
☆14Dec 8, 2020Updated 5 years ago
Alternatives and similar repositories for pytorch_seed_rl
Users that are interested in pytorch_seed_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- practice☆10Jun 30, 2020Updated 5 years ago
- My Submission for the OpenAI/NeurIPS ProcGen Competition☆11Nov 12, 2020Updated 5 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- Source code for paper: An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning☆24Sep 2, 2022Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 3 months ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- My Homepage☆10Updated this week
- Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …☆17Nov 15, 2020Updated 5 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆16Jul 27, 2023Updated 2 years ago
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- pytorch实现的一些MARL算法☆66May 4, 2021Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Apr 5, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's arch…☆837Nov 29, 2022Updated 3 years ago
- LLM-enabled Robot Swarms☆20May 20, 2025Updated 10 months ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆132Aug 14, 2023Updated 2 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆12Nov 14, 2019Updated 6 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Jun 8, 2022Updated 3 years ago
- A networking protocol for agent-environment communication☆108Feb 20, 2026Updated last month
- ☆14Aug 26, 2018Updated 7 years ago
- ☆32Jan 17, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Apr 13, 2021Updated 4 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- ☆19Nov 13, 2022Updated 3 years ago
- REDSearch: A scalable, cost-efficient framework for long-horizon search agents. Features complex task synthesis, optimized mid-training, …☆70Feb 26, 2026Updated last month
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 6 years ago
- SIA - C++/Python library for model-based stochastic estimation and optimal control☆23Apr 3, 2024Updated last year