Code for abstracting, evaluating, and visualizing Markov Decision Processes.
☆10Jan 12, 2017Updated 9 years ago
Alternatives and similar repositories for state_abstraction
Users that are interested in state_abstraction are comparing it to the libraries listed below
Sorting:
- ☆31Feb 20, 2021Updated 5 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆22Apr 16, 2022Updated 3 years ago
- A library to benchmark reinforcement learning algorithms☆21Apr 18, 2018Updated 7 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆21Sep 16, 2024Updated last year
- Code for "oi-VAE: Output Interpretable VAEs for Nonlinear Group Factor Analysis"☆27Feb 24, 2020Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 6 years ago
- ☆31Sep 22, 2021Updated 4 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆38Mar 1, 2021Updated 5 years ago
- Implementations of SAILR, PDO, and CSC☆31Jul 15, 2024Updated last year
- Published by Packt☆11Jan 18, 2021Updated 5 years ago
- A projet for simulating the rescue after a disaster☆10Dec 4, 2020Updated 5 years ago
- PyTorch Package For Quasimetric Learning☆48Oct 31, 2024Updated last year
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch☆41Mar 23, 2018Updated 7 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆41Feb 16, 2023Updated 3 years ago
- Source files for the 2020 ICAPS Online Summer School Lab on Plan Execution.☆11Oct 16, 2020Updated 5 years ago
- 股票高频数据(数据来源:新浪)☆13Jan 29, 2020Updated 6 years ago
- N-Tuple Bandit Evolutionary Algorithm☆14May 8, 2020Updated 5 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆10Mar 10, 2021Updated 4 years ago
- Public package to compute translationally and rotationally invariant wavelet-based statistics on images.☆10Aug 25, 2023Updated 2 years ago
- ☆12Jun 2, 2021Updated 4 years ago
- ☆11Sep 30, 2022Updated 3 years ago
- An algorithm that calculates images of the sky based on light scattering phenomena☆19Oct 12, 2020Updated 5 years ago
- modified datasets for remote sensing image caption☆11Apr 23, 2019Updated 6 years ago
- code for paper -- "Seamless Satellite-image Synthesis"☆17Jul 30, 2024Updated last year
- This is the numerical approach proposed in the paper "Optimal Incentives to Mitigate Epidemics: A Stackelberg Mean Field Game Approach" b…☆12Nov 22, 2021Updated 4 years ago
- ☆13Apr 11, 2022Updated 3 years ago
- ☆11Jun 28, 2022Updated 3 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Salient Objects in Clutter, arXiv, 2021 (ECCV2018 extenstion).☆11Jun 17, 2021Updated 4 years ago
- The code and data from my personal project at Zipfian, in which I create a quantitative measure of similarities between neighborhoods in …☆18Dec 9, 2016Updated 9 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- ☆10Nov 23, 2023Updated 2 years ago
- A multi-source cross-modal retrieval network☆14Jan 8, 2024Updated 2 years ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆16Oct 12, 2022Updated 3 years ago