Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation
☆37Nov 17, 2020Updated 5 years ago
Alternatives and similar repositories for asynchronous_impala_PPO
Users that are interested in asynchronous_impala_PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13May 2, 2024Updated 2 years ago
- ☆15Sep 21, 2020Updated 5 years ago
- ☆16May 4, 2021Updated 5 years ago
- meta-MADDPG (Python implementation)☆19Sep 16, 2018Updated 7 years ago
- [INFOCOM 2021] Mobile Crowdsensing for Data Freshness: A Deep Reinforcement Learning Approach☆16May 16, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [KDD 2021] Energy-Efficient 3D Vehicular Crowdsourcing for Disaster Response by Distributed Deep Reinforcement Learning☆19May 18, 2022Updated 3 years ago
- [INFOCOM 2020] Multi-Task-Oriented Vehicular Crowdsensing: A Deep Learning Approach☆15May 16, 2022Updated 3 years ago
- Official URDF and SDF models of the R1 humanoid robot.☆16Dec 6, 2023Updated 2 years ago
- ☆13Aug 23, 2023Updated 2 years ago
- Learning Matchable Image Transformations☆13Sep 10, 2019Updated 6 years ago
- POMDP wrappers for OpenAI Gym☆15Nov 4, 2019Updated 6 years ago
- A set of competitive environments for Reinforcement Learning research.☆31Dec 1, 2022Updated 3 years ago
- Codebase for the Graph-based Policy Learning algorithm, which is designed for learning policies to solve the open ad hoc teamwork problem…☆35Mar 31, 2021Updated 5 years ago
- ☆10Oct 31, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [AAAI 2022] CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving☆30Mar 25, 2026Updated last month
- ☆15Mar 14, 2020Updated 6 years ago
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Oct 5, 2021Updated 4 years ago
- SQLCipher PDO extension for Windows PHP (DLL)☆10Oct 8, 2018Updated 7 years ago
- [TMC 2021] Distributed and Energy-Efficient Mobile Crowdsensing with Charging Stations by Deep Reinforcement Learning☆29May 16, 2022Updated 3 years ago
- Implementation of Attentive Multi Task Deep Reinforcement Learning Architecture in Tensorflow☆15Apr 5, 2019Updated 7 years ago
- ☆10Mar 22, 2021Updated 5 years ago
- ☆35Dec 7, 2017Updated 8 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆28May 22, 2023Updated 2 years ago
- An overview of articles related to deep reinforcement learning in fluid mechanics☆11Nov 21, 2023Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆22Jun 13, 2023Updated 2 years ago
- 3DPhysNet in Tensorflow (IJCAI 2018) https://arxiv.org/abs/1805.00328☆15Jul 2, 2018Updated 7 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆43Oct 5, 2022Updated 3 years ago
- [RAL 2023] transformer + reinforcement learning for navigation + POMPD☆15Jul 19, 2023Updated 2 years ago
- Course slides☆12Jan 14, 2025Updated last year
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- ☆108Feb 10, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Jun 28, 2022Updated 3 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Mar 12, 2020Updated 6 years ago
- Code for the paper titled "Look Around 👀 and Learn 🎓: Self-Training Object Detection by Exploration" accepted at ECCV2024☆15Oct 4, 2024Updated last year
- Integrates Imbue's Cost Aware pareto-Region Bayesian Search (CARBS) with Weights and Biases (WanDB)☆12Mar 17, 2025Updated last year
- Implementation of papers in 101 lines of code.☆18Nov 12, 2023Updated 2 years ago
- Code and data for article "Learning nonlocal constitutive models with neural networks", CMAME☆12Sep 7, 2021Updated 4 years ago
- MADDPG in Ray/RLlib☆54Jan 14, 2020Updated 6 years ago