abbyvansoest/maxent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abbyvansoest/maxent)

abbyvansoest / maxent

☆14

Alternatives and similar repositories for maxent

Users that are interested in maxent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liziniu / HyperDQN
View on GitHub
Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)
☆12Nov 28, 2023Updated 2 years ago
jinnaiyuu / Optimal-Options-ICML-2019
View on GitHub
Code for generating options for planning and reinforcement learning
☆12Feb 18, 2021Updated 5 years ago
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago
jj-zhu / jadagger
View on GitHub
Just another DAgger algorithm implementation
☆14Apr 10, 2017Updated 9 years ago
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
Shallow-Updates-for-Deep-RL / Shallow_Updates_for_Deep_RL
View on GitHub
Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"
☆18Nov 2, 2017Updated 8 years ago
microsoft / StateDecoding
View on GitHub
Reinforcement Learning via Latent State Decoding
☆29Jun 12, 2023Updated 3 years ago
MIT-REALM / dcrl
View on GitHub
Density Constrained Reinforcement Learning
☆12Mar 24, 2023Updated 3 years ago
ermongroup / best-arm-delayed
View on GitHub
Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆20Apr 3, 2018Updated 8 years ago
YingzhenLi / Dropout_BBalpha
View on GitHub
Implementations of the ICML 2017 paper (with Yarin Gal)
☆39Dec 15, 2017Updated 8 years ago
baofff / BiSM
View on GitHub
☆12Oct 20, 2020Updated 5 years ago
duguyue100 / awsdlgpu
View on GitHub
A description for setting up an Amazon EC2 GPU instance for Deep Learning.
☆16Jul 21, 2016Updated 10 years ago
jbuckman / dmdp-donutworld
View on GitHub
☆13Jul 25, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
laurabreiman / science-of-cooking
View on GitHub
☆15Apr 5, 2017Updated 9 years ago
LAVA-LAB / safe-slac
View on GitHub
Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.
☆11Mar 1, 2023Updated 3 years ago
AdityaMate / collapsing_bandits
View on GitHub
Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)
☆11Dec 3, 2025Updated 7 months ago
Victorwz / LaViA
View on GitHub
☆10Jul 13, 2024Updated 2 years ago
zackchase / intrinsic-fear-dqn
View on GitHub
Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.
☆10Nov 13, 2017Updated 8 years ago
rasoolfa / P3O
View on GitHub
P3O paper code
☆30Aug 7, 2019Updated 6 years ago
nv-tlabs / DP-Sinkhorn_code
View on GitHub
☆12Jun 17, 2022Updated 4 years ago
liziniu / policy_optimization
View on GitHub
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
☆29Dec 19, 2023Updated 2 years ago
mcmachado / count_based_exploration_sr
View on GitHub
☆31Jul 1, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
blayes / WASP
View on GitHub
Scalable Bayes via Barycenter in Wasserstein Space
☆10Sep 7, 2017Updated 8 years ago
bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 7 years ago
mingen-pan / Reinforcement-Learning-Q-learning-Gridworld-Pytorch
View on GitHub
This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld
☆14Jul 13, 2020Updated 6 years ago
rjagerman / wsdm2019-nonstationary
View on GitHub
Non-stationary Off-policy Evaluation
☆13Nov 8, 2018Updated 7 years ago
thiagopbueno / tf-mdp
View on GitHub
Probabilistic planning in continuous state-action MDPs in TensorFlow.
☆13Jun 21, 2022Updated 4 years ago
DBaudry / Information_Directed_Sampling
View on GitHub
Implementation of Russo and Van Roy work on Information Directed Sampling (2017)
☆21Jan 18, 2019Updated 7 years ago
vub-dl / u-cmab
View on GitHub
Uplifted Contextual Multi-Armed Bandit
☆19May 4, 2022Updated 4 years ago
lmb-freiburg / td-or-not-td
View on GitHub
Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…
☆12Aug 24, 2018Updated 7 years ago
mstaib / stochastic-barycenter-code
View on GitHub
Supporting code for "Parallel Streaming Wasserstein Barycenters"
☆11Nov 14, 2017Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
aolabsai / OpenReason
View on GitHub
an open source dataset and generation pipeline for Large-Scale Reinforcement Learning
☆17Apr 14, 2025Updated last year
KempnerInstitute / chess-research
View on GitHub
☆11Jun 17, 2024Updated 2 years ago
miryoosefi / ConRL
View on GitHub
Constrained episodic reinforcement learning in concave-convex and knapsack settings
☆11Oct 3, 2023Updated 2 years ago
sisl / pomdpland
View on GitHub
A tour of Pomdpland
☆10Aug 10, 2022Updated 3 years ago
ievron / RegularizationAnimation
View on GitHub
☆11Dec 27, 2021Updated 4 years ago
rgiordan / CovariancesRobustnessVBPaper
View on GitHub
☆12Jul 16, 2023Updated 3 years ago
dtak / POPCORN-POMDP
View on GitHub
Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)
☆11May 19, 2021Updated 5 years ago