Reinforcement learning - Batched Impala - PyTorch - Mario Kart
☆13Jul 21, 2020Updated 5 years ago
Alternatives and similar repositories for Batched-Impala-PyTorch
Users that are interested in Batched-Impala-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementation☆22Jul 25, 2022Updated 3 years ago
- Structured Object-Aware Physics Prediction for Video Modeling and Planning☆32May 9, 2020Updated 5 years ago
- ☆10May 13, 2025Updated 10 months ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Jun 14, 2018Updated 7 years ago
- ☆13Oct 26, 2019Updated 6 years ago
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 6 years ago
- Essay on Hamiltonian Monte Carlo in PyMC3☆15Apr 6, 2023Updated 2 years ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Mar 14, 2022Updated 4 years ago
- Reinforcement learning algorithm implementation☆10Oct 31, 2021Updated 4 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning☆16Mar 11, 2020Updated 6 years ago
- Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).☆13Mar 24, 2023Updated 2 years ago
- A cog model for the all-mpnet-base-v2 sentence-transformers embedding model.☆15Jan 3, 2024Updated 2 years ago
- It's the pytorch implementation of google research football.☆43Jun 14, 2019Updated 6 years ago
- Lux AI environment interface for RLlib multi-agents☆12Sep 23, 2021Updated 4 years ago
- This is MPE-pytorch, fix some bugs.☆10Apr 26, 2020Updated 5 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 8 years ago
- A preliminary platform for up to 1 million reinforcement learning agents☆11Aug 27, 2017Updated 8 years ago
- ☆18Dec 30, 2023Updated 2 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- drift protocol arbitrage bot using python sdk (driftpy)☆13Nov 7, 2022Updated 3 years ago
- An PyTorch implementation of "Importance Weighted Actor-Learner Architectures" https://arxiv.org/abs/1802.01561☆12Jan 6, 2021Updated 5 years ago
- ☆16Jan 22, 2018Updated 8 years ago
- S.M.Ali Eslam et.al. Attend, Infer, Repeat: Fast Scene Understanding with Generative Models ICML16☆14Sep 27, 2018Updated 7 years ago
- TensorFlow implementation of Deep RL (Reinforcement Learning) papers based on deep Q-learning (DQN)☆10Mar 1, 2018Updated 8 years ago
- ☆27Dec 20, 2021Updated 4 years ago
- Library for Auto-Encoding Sequential Monte Carlo☆18Jan 29, 2024Updated 2 years ago
- Simple Interactive Machine Learning system for recognizing hand gestures in Processing with OpenCV☆31Oct 11, 2013Updated 12 years ago
- ConvDRAW: “Towards Conceptual Compression” (NIPS 2016) with TensorFlow.☆19Dec 9, 2017Updated 8 years ago
- (Pytorch ver) Code for "Fully Neural Network based Model for General Temporal Point Process"☆21Sep 15, 2020Updated 5 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Reinforcement Learning Algorithms with Unity 3D Environments☆18Jul 15, 2019Updated 6 years ago
- RLA is a tool for managing your RL experiments automatically☆72Feb 7, 2023Updated 3 years ago
- Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10☆12Apr 6, 2022Updated 3 years ago
- Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …☆12Mar 29, 2019Updated 6 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago