☆13May 30, 2019Updated 6 years ago
Alternatives and similar repositories for maxent
Users that are interested in maxent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Just another DAgger algorithm implementation☆14Apr 10, 2017Updated 8 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Reinforcement Learning via Latent State Decoding☆29Jun 12, 2023Updated 2 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆20Apr 3, 2018Updated 7 years ago
- ☆12Oct 20, 2020Updated 5 years ago
- Implementations of the ICML 2017 paper (with Yarin Gal)☆38Dec 15, 2017Updated 8 years ago
- A description for setting up an Amazon EC2 GPU instance for Deep Learning.☆16Jul 21, 2016Updated 9 years ago
- ☆13Jul 25, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆15Apr 5, 2017Updated 8 years ago
- ☆10Jul 13, 2024Updated last year
- Face Recognition on NVIDIA TX2☆10Sep 5, 2018Updated 7 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆15May 13, 2025Updated 10 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Uplifted Contextual Multi-Armed Bandit☆19May 4, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆31Jul 1, 2019Updated 6 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Scalable Bayes via Barycenter in Wasserstein Space☆10Sep 7, 2017Updated 8 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- ☆18Oct 30, 2025Updated 4 months ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A tour of Pomdpland☆10Aug 10, 2022Updated 3 years ago
- Supporting code for "Parallel Streaming Wasserstein Barycenters"☆10Nov 14, 2017Updated 8 years ago
- ☆12Jul 16, 2023Updated 2 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- Implementation of Russo and Van Roy work on Information Directed Sampling (2017)☆21Jan 18, 2019Updated 7 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- The interface between probabilistic model checking and data-driven policy learning.☆16Mar 11, 2026Updated 2 weeks ago