☆30Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for TrulyPPO
Users that are interested in TrulyPPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Nov 21, 2022Updated 3 years ago
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Oct 5, 2021Updated 4 years ago
- Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch☆22Nov 9, 2025Updated 4 months ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"☆16Aug 28, 2018Updated 7 years ago
- ☆12May 29, 2022Updated 3 years ago
- Tools to rebuild a VOXEL-enabled server and client.☆14Nov 11, 2021Updated 4 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Dec 8, 2017Updated 8 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- Build Your Own Bundle-A Neural Combinatorial Optimization Method (BYOB)☆13Apr 27, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 5 years ago
- [IEEE Transactions on Intelligent Transportation Systems] Curricular Subgoal for Inverse Reinforcement Learning☆16Jul 31, 2023Updated 2 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Nov 22, 2019Updated 6 years ago
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆35Jan 3, 2021Updated 5 years ago
- Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions☆13May 22, 2023Updated 2 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- A multi-task deep reinforcement learning model for trading futures contracts using the Interactive Brokers API and TensorFlow☆15Feb 8, 2023Updated 3 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Apr 24, 2021Updated 4 years ago
- ☆16Jul 28, 2022Updated 3 years ago
- wePoker is a multi-player poker game for Android☆12Mar 20, 2013Updated 13 years ago
- ☆10Apr 18, 2017Updated 8 years ago
- ☆10Jan 21, 2021Updated 5 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated 2 weeks ago
- AAC decoder for MPEG-4 and AAC files, with rodio support☆19Feb 15, 2024Updated 2 years ago
- ☆13Nov 5, 2024Updated last year
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Oct 11, 2023Updated 2 years ago
- Source code to the AAAI21 publication Augmenting Policy Learning with Routines Discovered from a Single Demonstration☆17Jan 7, 2021Updated 5 years ago
- Trajectory prediction using CARLA simulated data☆18May 21, 2024Updated last year
- Cloud client for douzero training☆11Dec 26, 2021Updated 4 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- CADRE: Contextual Attention-based Drug REsponse☆12Nov 23, 2020Updated 5 years ago
- Dataset for ICSE 2020 paper "Repairing Deep Neural Networks: Fix Patterns and Challenges"☆10Feb 10, 2020Updated 6 years ago