Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow
☆20Oct 5, 2021Updated 4 years ago
Alternatives and similar repositories for reinforcement_learning_phasic_policy_gradient
Users that are interested in reinforcement_learning_phasic_policy_gradient are comparing it to the libraries listed below
Sorting:
- ☆30Nov 21, 2022Updated 3 years ago
- Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch☆22Nov 9, 2025Updated 3 months ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆81Jan 19, 2019Updated 7 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆34Oct 10, 2020Updated 5 years ago
- Analysing result obtained using quite different RL algorithm☆13Sep 5, 2019Updated 6 years ago
- a recommendation list of math courses for people with no math background.☆11Mar 2, 2021Updated 5 years ago
- ☆12May 26, 2022Updated 3 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆44Oct 4, 2020Updated 5 years ago
- 異常発音☆10Feb 11, 2026Updated 3 weeks ago
- This is MPE-pytorch, fix some bugs.☆10Apr 26, 2020Updated 5 years ago
- A2C, ACKTR and A2T implementations for ViZDoom☆10Dec 18, 2017Updated 8 years ago
- ☆10Dec 29, 2019Updated 6 years ago
- PyTorch implementation of DreamerV3, Mastering Diverse Domains through World Models.☆10Feb 16, 2024Updated 2 years ago
- retrobob is a retro gaming emulator that runs directly on your browser. Super Nintendo, NES/Famicom, Gameboy and Gameboy Color are curren…☆11Mar 25, 2024Updated last year
- ☆13Apr 3, 2024Updated last year
- Forward the UDP packages (like what NAT does) and do a simple Xor operation bytes by bytes.☆11Feb 18, 2020Updated 6 years ago
- Code accompanying paper, Forward Prediction for Physical Reasoning☆11Oct 12, 2021Updated 4 years ago
- Rust interface to the Tor Control Protocol (TorCP)☆13Nov 1, 2021Updated 4 years ago
- Reinforcement learning with Rust☆14Jul 31, 2022Updated 3 years ago
- This code monitors (or sniff) the radiosignals sent by Uponor KNX RF thermostats and sent to OpenHAB using the REST interface. A CC1101 c…☆11Dec 2, 2022Updated 3 years ago
- Developing, training, and assessing the performance of a Proximal Policy Optimization (PPO) Stock Trading Agent.☆14Aug 20, 2025Updated 6 months ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago
- Lane segmentation model trained with tensorflow implementation MobileNetV2 based U-Net☆11Mar 24, 2023Updated 2 years ago
- Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA☆13Nov 16, 2025Updated 3 months ago
- Pusher Beams Java Server SDK☆10Feb 12, 2019Updated 7 years ago
- Emulator of the soviet ternary computer "Setun-70" (Сетунь-70)☆18Dec 9, 2024Updated last year
- Q&A dataset for many-shot jailbreaking☆14Jul 19, 2024Updated last year
- Wolfram LibraryLink interface for Rust [Deprecated]☆10Mar 8, 2024Updated last year
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Code for the paper "Phasic Policy Gradient"☆267Apr 2, 2023Updated 2 years ago
- A pure Julia wrapper for TD Ameritrade APIs☆11Apr 2, 2023Updated 2 years ago
- Official code for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models (NeurIPS 2023)☆13Mar 4, 2024Updated 2 years ago
- Mahjong4RL is a project that recreates the game of Japanese Mahjong and use deep reinforcement learning to play it.☆12Feb 17, 2022Updated 4 years ago
- AGL/Golang Standard Library Ed25519 including extra25519 code.☆16Jan 4, 2021Updated 5 years ago
- Weighted-Boxes-Fusion method implementation with YOLOv4 and YOLOv5☆11Jul 14, 2022Updated 3 years ago
- Hardware-side component of Hastlayer for Microsoft Project Catapult FPGAs. See https://hastlayer.com for details.☆13Mar 28, 2020Updated 5 years ago
- Pytorch ImageNet1k Loader with Bounding Boxes.☆13Jan 23, 2022Updated 4 years ago
- .NET client for Hadoop☆14Jun 13, 2014Updated 11 years ago
- Blind RSA signatures for OpenSSL/BoringSSL.☆17Jan 31, 2026Updated last month