☆14May 30, 2019Updated 6 years ago
Alternatives and similar repositories for maxent
Users that are interested in maxent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Just another DAgger algorithm implementation☆14Apr 10, 2017Updated 9 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Reinforcement Learning via Latent State Decoding☆29Jun 12, 2023Updated 2 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆20Apr 3, 2018Updated 8 years ago
- ☆12Oct 20, 2020Updated 5 years ago
- Implementations of the ICML 2017 paper (with Yarin Gal)☆38Dec 15, 2017Updated 8 years ago
- A description for setting up an Amazon EC2 GPU instance for Deep Learning.☆16Jul 21, 2016Updated 9 years ago
- ☆13Jul 25, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Apr 5, 2017Updated 9 years ago
- ☆10Jul 13, 2024Updated last year
- Face Recognition on NVIDIA TX2☆10Sep 5, 2018Updated 7 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)☆10Dec 3, 2025Updated 5 months ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆16May 13, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Uplifted Contextual Multi-Armed Bandit☆19May 4, 2022Updated 4 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Scalable Bayes via Barycenter in Wasserstein Space☆10Sep 7, 2017Updated 8 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- ☆19Oct 30, 2025Updated 6 months ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- A tour of Pomdpland☆10Aug 10, 2022Updated 3 years ago
- Supporting code for "Parallel Streaming Wasserstein Barycenters"☆11Nov 14, 2017Updated 8 years ago
- ☆12Jul 16, 2023Updated 2 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 5 years ago
- Implementation of Russo and Van Roy work on Information Directed Sampling (2017)☆21Jan 18, 2019Updated 7 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago