lns/dapo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lns/dapo)

lns / dapo

Source code for the paper "Divergence-Augmented Policy Optimization"

☆37

Alternatives and similar repositories for dapo

Users that are interested in dapo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lns / memoire
View on GitHub
☆18Apr 17, 2019Updated 7 years ago
jinxinglim / Game-Theoretical-Approaches-in-Multi-Agent-Reinforcement-Learning-Policy-Space-Response-Oracles
View on GitHub
☆15Oct 6, 2019Updated 6 years ago
diversepsro / diverse_psro
View on GitHub
☆22May 20, 2021Updated 5 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
MahanFathi / TRPO-TensorFlow
View on GitHub
Trust Region Policy Optimization (TRPO) in pure TensorFlow
☆18Jun 7, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fanshiliang / Hierarchical-Deep-Reinforcement-Learning
View on GitHub
paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation
☆10Mar 27, 2018Updated 8 years ago
tencent-ailab / tleague_projpage
View on GitHub
☆151Dec 9, 2024Updated last year
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
chscheller / sc2_imitation_learning
View on GitHub
StarCraft 2 Imitation Learning
☆29Jul 2, 2021Updated 5 years ago
YuhangSong / Arena-Baselines
View on GitHub
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆103Mar 6, 2025Updated last year
mihahauke / deep_rl_vizdoom
View on GitHub
Deep reinforcement learning in ViZDoom (using Tensorflow)
☆19Jan 25, 2018Updated 8 years ago
tgangwani / GuidanceRewards
View on GitHub
Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)
☆12Jul 7, 2021Updated 5 years ago
LinZichuan / AdMRL
View on GitHub
Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)
☆35Mar 6, 2021Updated 5 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
MadryLab / implementation-matters
View on GitHub
☆136Jul 25, 2024Updated last year
tesslerc / GAC
View on GitHub
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Dec 17, 2019Updated 6 years ago
manantomar / Mirror-Descent-Policy-Optimization
View on GitHub
Mirror Descent Policy Optimization
☆43Oct 31, 2020Updated 5 years ago
younggyoseo / lasertag-v0
View on GitHub
Implementation of Deepmind's LaserTag-v0 game in A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning(2017)
☆20Nov 30, 2018Updated 7 years ago
zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
ChunyuanLI / RAS
View on GitHub
AISTATS 2019: Reference-based Adversarial Sampling & Its applications to Soft Q-learning
☆15Jan 21, 2019Updated 7 years ago
microsoft / strategically_efficient_rl
View on GitHub
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Jul 30, 2024Updated last year
robintyh1 / onpolicybaselines
View on GitHub
on-policy optimization baselines for deep reinforcement learning
☆32Apr 3, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
waterhorse1 / NAC
View on GitHub
(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.
☆28Nov 19, 2021Updated 4 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
flowersteam / curious
View on GitHub
Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
☆27May 15, 2020Updated 6 years ago
cjlovering / Towards-Interpretable-Reinforcement-Learning-Using-Attention-Augmented-Agents-Replication
View on GitHub
☆22Oct 4, 2019Updated 6 years ago
YyzHarry / SV-RL
View on GitHub
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Feb 1, 2020Updated 6 years ago
rlseminar / rlseminar.github.io
View on GitHub
Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.
☆21Nov 17, 2023Updated 2 years ago
jparkerholder / PB2
View on GitHub
Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.
☆20Apr 13, 2021Updated 5 years ago
MishaLaskin / curl
View on GitHub
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
☆605Oct 28, 2020Updated 5 years ago
vub-ai-lab / bdpi
View on GitHub
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Sep 9, 2019Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
jesbu1 / carl
View on GitHub
Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings
☆14Nov 22, 2022Updated 3 years ago
google-research / seed_rl
View on GitHub
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's arch…
☆836Nov 29, 2022Updated 3 years ago
DartML / PPO-Stein-Control-Variate
View on GitHub
Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
sjtu-marl / malib
View on GitHub
A parallel framework for population-based multi-agent reinforcement learning.
☆553Dec 14, 2023Updated 2 years ago
RLAgent / state-marginal-matching
View on GitHub
Efficient Exploration via State Marginal Matching (2019)
☆70Jun 30, 2019Updated 7 years ago
xiaoyandong08 / maddpg-mpe
View on GitHub
Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).
☆21Mar 19, 2018Updated 8 years ago
google-research / policy-learning-landscape
View on GitHub
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Jan 16, 2019Updated 7 years ago