General implementation of Advantage Actor Critic using Pytorch
☆28Dec 7, 2021Updated 4 years ago
Alternatives and similar repositories for PyTorch-A2C
Users that are interested in PyTorch-A2C are comparing it to the libraries listed below
Sorting:
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- A2C is a special case of PPO!☆22May 20, 2022Updated 3 years ago
- A well-documented A2C written in PyTorch☆52Jun 3, 2019Updated 6 years ago
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search☆28Nov 15, 2018Updated 7 years ago
- ☆15Dec 3, 2022Updated 3 years ago
- Course Homepage☆11Aug 29, 2016Updated 9 years ago
- A python package to design and debug RL agents.☆33Jan 15, 2026Updated last month
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- (AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…☆31Oct 5, 2022Updated 3 years ago
- TensorFlow Lab for the BMVA Summer School☆13Jul 8, 2025Updated 7 months ago
- Official code repository for the MICCAI 2025 paper "UltraRay: Introducing Full-Path Ray Tracing in Physics-Based Ultrasound Simulation"☆17Aug 13, 2025Updated 6 months ago
- ☆10Sep 24, 2021Updated 4 years ago
- ☆15May 20, 2025Updated 9 months ago
- A selection of neural network models ported from torchvision for JAX & Flax.☆45Jul 19, 2025Updated 7 months ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Sep 20, 2021Updated 4 years ago
- Stripped Python images based on alpine variant of library's Python☆10Jan 20, 2022Updated 4 years ago
- Hyper-graph based Multi-task Feature Selection for Multi-modal Classification of Alzheimer's Disease☆12Jun 10, 2019Updated 6 years ago
- 🎈 Easy-to-use video player for Vue 3.x☆12Aug 22, 2023Updated 2 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆41Mar 15, 2024Updated last year
- ☆19Mar 14, 2020Updated 5 years ago
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- nd009-cn-advanced-p5,针对Udacity CN MLND P5项目☆14Jun 27, 2022Updated 3 years ago
- An experiment with the methodology using Python for analysis and d3js for visualization☆12Aug 14, 2021Updated 4 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- Hierarchical reinforcement learning framework which uses a directed graph to define the hierarchy.☆14Aug 5, 2022Updated 3 years ago
- Code companion of Multi-task Learning for Aggregated Data using Gaussian Processes paper☆10Apr 6, 2020Updated 5 years ago
- multiarch cross compiling environment for opencv☆11Jun 27, 2024Updated last year
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- Homework questions from the Coursera/Stanford course Mining Massibve Datasets. Question, no answers.☆11Nov 22, 2014Updated 11 years ago
- A Light CNN Framework!☆16Apr 8, 2019Updated 6 years ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- Differential Evolution Algorithm which uses Non-dominated Sorting for Multi-Objective Optimization☆10Mar 11, 2020Updated 5 years ago
- Minimalist Operating System designed to implement as much functionality as possible with a budget of 1000 Lines of Code☆12Sep 28, 2016Updated 9 years ago