xtma/apo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xtma/apo)

xtma / apo

Average-Reward Reinforcement Learning with Trust Region Methods

☆11

Alternatives and similar repositories for apo

Users that are interested in apo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CLEANit / heatenginegym
View on GitHub
A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.
☆15Dec 20, 2021Updated 4 years ago
hellpig / SU-tools
View on GitHub
many powerful tools for studying irreducible representations of SU(n), including making animations of hadron flavor-state multiplets
☆13Jun 17, 2026Updated last month
sail-sg / optim4rl
View on GitHub
Optim4RL is a Jax framework of learning to optimize for reinforcement learning.
☆28Nov 27, 2024Updated last year
tayalmanan28 / Stride_bot
View on GitHub
This is a quadruped simulated on pybullet physics engine, walking using trot and bound mechanisms
☆16Feb 24, 2024Updated 2 years ago
1QB-Information-Technologies / COOL
View on GitHub
Controlled Online Optimization Learning (COOL): Finding the Ground State of Spin Hamiltonians with Reinforcement Learning (arXiv:2003.000…
☆13Jun 18, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Adeel-Abdullah / mpc-vsi
View on GitHub
A model predictive control based voltage source inverter
☆11Jan 11, 2020Updated 6 years ago
zsylvester / curvaturepy
View on GitHub
Code for analyzing the relationship between curvature and migration rate in meandering rivers
☆14Dec 8, 2022Updated 3 years ago
GreatDrake / non-acyclic-gfn
View on GitHub
Repository for "Revisiting Non-Acyclic GFlowNets in Discrete Environments" (ICML 2025)
☆14Oct 8, 2025Updated 9 months ago
edofazza / GameBoyLearningEnvironment
View on GitHub
☆15Oct 20, 2025Updated 9 months ago
wonderren / public_pymomapf
View on GitHub
Python implementation of algorithms for multi-objective multi-agent path finding.
☆13May 17, 2022Updated 4 years ago
Lemon-cmd / diffusion-jax
View on GitHub
Diffusion Probabilistic Model in Jax
☆13Apr 20, 2024Updated 2 years ago
mitrefireline / simharness
View on GitHub
An open-source Reinforcement Learning (RL) harness written in Python to work with SimFire for training agents to fight wildfires on real …
☆18Oct 8, 2024Updated last year
DeriZSY / hybrid_mopso
View on GitHub
Versions of hybrid pso algorithms for engineering optimization
☆10Dec 21, 2017Updated 8 years ago
SAIC-MONTREAL / hyperzero
View on GitHub
Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"
☆24Apr 26, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mathworks / Two-Zone-MVDC-Electric-Ship
View on GitHub
This submission demonstrates modeling and simulation of a Two-Zone MVDC electric ship in Simscape Electrical, and considers modeling con…
☆10Mar 23, 2026Updated 3 months ago
remilepriol / dualityviz
View on GitHub
An interactive visualization of convex duality (Fenchel conjugate)
☆22Dec 2, 2020Updated 5 years ago
xtma / dsac
View on GitHub
Distributional Soft Actor Critic
☆63Jun 6, 2020Updated 6 years ago
BorgeRokseth / ship_in_transit_simulator
View on GitHub
☆10Jan 22, 2023Updated 3 years ago
Hoang-Trung-Le / ShipSimulation
View on GitHub
☆11Jan 20, 2023Updated 3 years ago
fanshiliang / Hierarchical-Deep-Reinforcement-Learning
View on GitHub
paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation
☆10Mar 27, 2018Updated 8 years ago
oysteinvolden / vision-based-navigation
View on GitHub
Vision-Based Navigation for Auto-Docking
☆13Apr 21, 2021Updated 5 years ago
eliwchen / CVRPTW_geatpy
View on GitHub
Solving the CVRPTW with geatpy2
☆11Mar 24, 2020Updated 6 years ago
kylestach / bigvision-palivla
View on GitHub
☆15Sep 4, 2025Updated 10 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
huanzhang12 / SA_PPO
View on GitHub
[NeurIPS 2020 Spotlight] State-adversarial PPO for robust deep reinforcement learning
☆32Nov 18, 2021Updated 4 years ago
DreamMr / PCL
View on GitHub
Pose-disentangled Contrastive Learning
☆14Jan 27, 2024Updated 2 years ago
acumos / documentation
View on GitHub
☆18Jun 10, 2022Updated 4 years ago
Jiawei888 / FPGA-CNN-Accelerator
View on GitHub
The goal of this design is to use the PYNQ-Z2 development board to design a general convolution neural network accelerator. And through r…
☆11Sep 30, 2020Updated 5 years ago
HeyuanMingong / llirl
View on GitHub
Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"
☆21Jan 28, 2021Updated 5 years ago
HawkTom / VESAEA
View on GitHub
source code for VESAEA, paper in CEC2019
☆10Mar 27, 2019Updated 7 years ago
spirosrap / Deep-Reinforcement-Learning
View on GitHub
Deep Reinforcement Learning - Implementations and Theory: A path to mastery
☆13Nov 21, 2021Updated 4 years ago
quixsi / lets-party
View on GitHub
☆15Nov 20, 2025Updated 8 months ago
annieyan / Bandits-using-UCB-algorithm
View on GitHub
Thompson Sampling for Bandits using UCB policy
☆10Jul 29, 2017Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lakshayvirmani / learning-assisted-mstar
View on GitHub
☆17Oct 2, 2021Updated 4 years ago
KeshengZhang / fault_diagnosis_code_collection
View on GitHub
A collection of Fault Diagnosis python codes
☆10Mar 13, 2022Updated 4 years ago
comrob / gdip
View on GitHub
Optimal solution of the Generalized Dubins Interval Problem (GDIP)
☆23Oct 24, 2022Updated 3 years ago
silvery107 / auto-docking-vessels
View on GitHub
Webots simulation environment and a vision-based autonomous docking algorithm for robotic vessels with a novel latching system.
☆16Oct 8, 2024Updated last year
mingzhangPHD / transferlearning
View on GitHub
Everything about Transfer Learning and Domain Adaptation--迁移学习
☆10Jun 5, 2019Updated 7 years ago
robin-shaun / multicopter-vibration-attenuation
View on GitHub
A solution for multicopter vibration measurement, vibration isolator design and digital filter design
☆20Jan 23, 2021Updated 5 years ago
jack13163 / MyPlatEMO
View on GitHub
【个人源码】Matlab环境下的改进PlatEMO
☆12Apr 1, 2019Updated 7 years ago