alexlioralexli / rllab-finetuningView external linksLinks
☆30Jan 19, 2023Updated 3 years ago
Alternatives and similar repositories for rllab-finetuning
Users that are interested in rllab-finetuning are comparing it to the libraries listed below
Sorting:
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆18Sep 17, 2019Updated 6 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Systematic generalization test for CLEVR☆15Mar 11, 2020Updated 5 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Apr 28, 2021Updated 4 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- Implementation of Data Efficient Reinforcement Learning in Pytorch☆20Aug 6, 2019Updated 6 years ago
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- Unified notation for Markov Decision Processes PO(MDP)s☆24Apr 27, 2018Updated 7 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 6 years ago
- MADDPG in Ray/RLlib☆24Jul 22, 2020Updated 5 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Jul 31, 2020Updated 5 years ago
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Implementation of HindSight Experience Replay paper with Pytorch☆31Apr 28, 2021Updated 4 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Jan 19, 2023Updated 3 years ago
- This is a DQN-based recommendation system for item-list recommendation and it finally achieved second place in the competition of RL-base…☆11Oct 8, 2021Updated 4 years ago
- Information geometry and its extension information topology☆11Dec 2, 2017Updated 8 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- Implementation of the paper https://arxiv.org/abs/1603.00448.☆38Dec 31, 2020Updated 5 years ago
- Distributed data sync using trimerge☆11Mar 26, 2024Updated last year
- ☆10Dec 14, 2020Updated 5 years ago
- Soft Actor-Critic☆156Mar 13, 2018Updated 7 years ago
- Annotated bibliographies.☆40Aug 25, 2019Updated 6 years ago
- Decoupled Neural Interfaces Using Synthetic Gradients - under develeopment☆11Jun 27, 2025Updated 7 months ago
- Study Materials, Lecture Notes, Free PDFs, PPTs, Codes & Videos☆16Oct 10, 2023Updated 2 years ago
- Goal-conditioned reinforcement learning like 🔥☆13Feb 3, 2024Updated 2 years ago
- ☆11May 27, 2021Updated 4 years ago
- A pathway and collection of resources to learning Jax from beginning to advance.☆11Jan 2, 2021Updated 5 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- 云任务调度仿真平台☆12Mar 11, 2020Updated 5 years ago
- ☆17Jul 22, 2025Updated 6 months ago
- Arithmetic in Rust's Type System☆11Feb 18, 2024Updated 2 years ago
- Adaptation of Simple Approach to Ordinal Classification for sklearn framework☆12May 18, 2022Updated 3 years ago
- Learning globally stable dynamical systems policies through imitation. A modification of the original work, focussing on waypoint-based i…☆12Oct 12, 2024Updated last year
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆11Nov 16, 2021Updated 4 years ago
- Functions for analysing public patenting data.☆15Oct 9, 2018Updated 7 years ago
- JAX implementations of core Deep RL algorithms☆83May 2, 2022Updated 3 years ago
- The source of the new Skin UI SDKs for both Android and IOS☆13Jul 8, 2023Updated 2 years ago
- Setting up highly available web application using Route53, CloudFront and S3☆13Jul 18, 2017Updated 8 years ago