☆13May 30, 2019Updated 6 years ago
Alternatives and similar repositories for maxent
Users that are interested in maxent are comparing it to the libraries listed below
Sorting:
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆19Apr 3, 2018Updated 7 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Implementation of Russo and Van Roy work on Information Directed Sampling (2017)☆21Jan 18, 2019Updated 7 years ago
- Reinforcement Learning via Latent State Decoding☆29Jun 12, 2023Updated 2 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Dec 19, 2023Updated 2 years ago
- ☆17Oct 30, 2025Updated 4 months ago
- ☆27May 17, 2019Updated 6 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Face Recognition on NVIDIA TX2☆10Sep 5, 2018Updated 7 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Oct 14, 2020Updated 5 years ago
- Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms☆13Apr 3, 2024Updated last year
- Sawja provides a high level representation of Java bytecode programs and static analysis tools.☆12Jul 4, 2024Updated last year
- ☆10Jul 13, 2024Updated last year
- A3C style Option-Critic with deliberation cost☆40Jan 9, 2018Updated 8 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Tree-automata-based run-time type constraints for miniKanren☆14Aug 3, 2023Updated 2 years ago
- Fish shell plugin for fzf git bindings☆10Dec 13, 2021Updated 4 years ago
- ☆12Mar 3, 2023Updated 3 years ago
- A transient UI for Cargo, Rust's package manager☆11Dec 17, 2025Updated 2 months ago
- ☆12Jun 17, 2022Updated 3 years ago
- A Dependently Typed Esolang☆10Aug 4, 2017Updated 8 years ago
- Generate a release names based on a git sha☆30Dec 21, 2018Updated 7 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Mix task for running tests for a distributed application☆10Apr 3, 2022Updated 3 years ago
- Tested nixpkgs pins that work with devenv☆24Updated this week
- Scalable Bayes via Barycenter in Wasserstein Space☆10Sep 7, 2017Updated 8 years ago
- Translations for the Calendar library.☆11Feb 21, 2018Updated 8 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- Gazelle is a Javascripty Lisp for Javascript.☆69Aug 24, 2013Updated 12 years ago
- A statically-typed lisp for the BEAM☆12Aug 28, 2021Updated 4 years ago
- Adaptation of Simple Approach to Ordinal Classification for sklearn framework☆12May 18, 2022Updated 3 years ago
- Some starter code for training/testing some basic CNN models given our data.☆10Feb 15, 2017Updated 9 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 2 years ago