☆10Aug 8, 2021Updated 4 years ago
Alternatives and similar repositories for MAML_Pytorch_RL
Users that are interested in MAML_Pytorch_RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Dec 2, 2019Updated 6 years ago
- Study of paper "Meta reinforcement learning for sim-to-real domain adaptation"☆19Jun 3, 2022Updated 3 years ago
- ☆33Jun 16, 2023Updated 2 years ago
- TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning☆11Oct 18, 2022Updated 3 years ago
- ☆10Jun 22, 2021Updated 4 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- ☆33Aug 30, 2024Updated last year
- ☆12Nov 11, 2021Updated 4 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Code snippets of Meta Reinforcement Learning algorithms☆39Sep 7, 2023Updated 2 years ago
- Peer DID method implementation in Python☆12Sep 27, 2023Updated 2 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Apr 14, 2022Updated 3 years ago
- Triangulated irregular network☆11Mar 29, 2015Updated 10 years ago
- ☆14Dec 8, 2025Updated 3 months ago
- A repository of load frequency control models implemeted in matlab☆12Feb 22, 2025Updated last year
- Transient Stability Analysis of Networked Microgrids Using Rapid Neural Lyapunov Method☆14Sep 13, 2023Updated 2 years ago
- Reimplementing existing learning-based ABR algorithms for dynamic video streaming. These algorithms were implemented with Pytorch and pyt…☆40May 29, 2024Updated last year
- Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch☆877Dec 27, 2022Updated 3 years ago
- Python-based tool to generate anthropometric human whole-body models in a URDF format☆33Jun 20, 2025Updated 9 months ago
- ☆10Oct 15, 2020Updated 5 years ago
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆10Oct 6, 2022Updated 3 years ago
- ☆27Jun 19, 2025Updated 9 months ago
- Optimal probabilistic planning of the transmission network development with the consideration of wind resource uncertainty☆11Jun 1, 2019Updated 6 years ago
- ☆11May 23, 2023Updated 2 years ago
- ☆10Apr 23, 2021Updated 4 years ago
- ☆32Jan 30, 2026Updated last month
- This work proposes a planning methodology of distribution systems formulated as a nonlinear optimization problem, which was solved throug…☆20Apr 19, 2024Updated last year
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Jun 2, 2022Updated 3 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- ☆11May 2, 2023Updated 2 years ago
- ☆15Oct 6, 2020Updated 5 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- This repo contains the scripts used to create the data for the ATC2020 paper "Reconstructing proprietary video streaming algorithms"☆14Mar 24, 2021Updated 5 years ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- The repo for ACL2021 findings paper - Don't Miss the Labels: Label-semantic Argumented Meta-Learner for Few-Shot Text Classification☆15Mar 24, 2022Updated 4 years ago