Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch
☆22Nov 9, 2025Updated 6 months ago
Alternatives and similar repositories for reinforcement_learning_truly_ppo
Users that are interested in reinforcement_learning_truly_ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Oct 3, 2023Updated 2 years ago
- This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.☆14Mar 14, 2023Updated 3 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Jul 16, 2023Updated 2 years ago
- Geometric Rectification of Document Images using Adversarial Gated Unwarping Network☆25Apr 30, 2020Updated 6 years ago
- A Pytorch implementation of "Deep Learning with Logged Bandit Feedback"☆10Aug 22, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Deep Q-Network (DQN) with Prioritized Experience Replay (PER)☆17Jan 1, 2020Updated 6 years ago
- Code for the paper "A Stable Variational Autoencoder for Text Modelling"☆26Mar 27, 2020Updated 6 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆18Nov 10, 2017Updated 8 years ago
- F1Tenth Gym with PPO☆17Jul 2, 2021Updated 4 years ago
- ☆12Jul 4, 2022Updated 3 years ago
- A multi agent multi arena car simulator oriented towards Reinforcement Learning with simultaneous multi instance spawning capability☆20Apr 28, 2019Updated 7 years ago
- (ICTIR2020) "Unbiased Pairwise Learning from Biased Implicit Feedback"☆19Nov 21, 2022Updated 3 years ago
- ☆18Apr 25, 2023Updated 3 years ago
- A LLM-powered agent for NetHack☆23Nov 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Jun 26, 2020Updated 5 years ago
- This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.☆12Sep 3, 2020Updated 5 years ago
- Docker image for Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation☆11Apr 14, 2024Updated 2 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Sep 21, 2022Updated 3 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Your one stop CLI for ONNX model analysis.☆47Nov 13, 2022Updated 3 years ago
- Transport code for plasma simulations☆12Mar 27, 2026Updated 2 months ago
- SineKAN: Kolmogorov-Arnold Networks Using Sinusoidal Activation Functions☆16Dec 19, 2024Updated last year
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Website for Alloytools☆13Nov 3, 2025Updated 6 months ago
- A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning☆16Oct 22, 2023Updated 2 years ago
- Genetic Algorithm for integer constrained optimization and its applications☆11Oct 5, 2023Updated 2 years ago
- Proximal Policy Optimization with TensorFlow and OpenAI Gym☆19Mar 31, 2018Updated 8 years ago
- Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)☆13Oct 25, 2023Updated 2 years ago
- ☆23Dec 8, 2020Updated 5 years ago
- Variational Autoencoder (VAE)-like neural network to solve ideal MHD equilibrium in a tokamak☆11May 20, 2022Updated 4 years ago
- ☆11Oct 13, 2023Updated 2 years ago
- Deep Implicit Coordination Graphs☆45May 29, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multi-agent Monte Carlo Tree Search implementation in C++☆15Feb 10, 2022Updated 4 years ago
- ☆42Mar 19, 2021Updated 5 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- A Python multigrid solver implementation for education.☆20May 11, 2015Updated 11 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- Bayesian Soft Actor Critic☆16Jan 6, 2023Updated 3 years ago
- ☆39Jan 3, 2025Updated last year