Meta RL codebase for Unstable Baselines
☆22Dec 6, 2022Updated 3 years ago
Alternatives and similar repositories for meta_rl
Users that are interested in meta_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12May 14, 2024Updated last year
- ☆20Feb 8, 2023Updated 3 years ago
- Re-implementations of SOTA RL algorithms.☆137Sep 7, 2023Updated 2 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆505Dec 1, 2022Updated 3 years ago
- ☆11Sep 8, 2023Updated 2 years ago
- Malicious URL Detection Using Deep Learning.☆15Jul 3, 2023Updated 2 years ago
- Tensorflow implementation of SNAIL and RL2☆11Aug 17, 2019Updated 6 years ago
- 2021年高教社杯全国大学生数学建模竞赛C题河北省一等奖☆12Jun 15, 2023Updated 2 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning☆39Aug 17, 2022Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆26Jun 9, 2021Updated 4 years ago
- For managing 2P imaging datasets from preprocessing to activity trace extraction☆10Apr 12, 2019Updated 6 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- 元强化学习MAML实现, 修改了部分老旧而不能运行的代码, 并可以通过render直接查看训练的结果☆11Dec 2, 2025Updated 3 months ago
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissions☆14Nov 23, 2025Updated 4 months ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- The code of "Deep Regression Representation Learning with Topology" in ICML 2024☆14Jul 4, 2024Updated last year
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- ☆10Jun 27, 2024Updated last year
- 通过 Spark SQL, Spark MLlib, Spark Streaming 技术,基于隐语义模型(LFM),结合实际项目经验,搭建一套个性化电影推荐系统☆10Aug 25, 2020Updated 5 years ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆22Apr 26, 2023Updated 2 years ago
- 专注大数据 Spark ML 机器学习:监督学习、无监督学习,主要有:分类算法、回归算法、聚类算法、推荐算法、频繁模式挖掘算法☆17Nov 6, 2020Updated 5 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- Reproduction of Curiosity-driven Exploration by Self-supervised Prediction in PyTorch☆13Jun 10, 2019Updated 6 years ago
- Code for Dataset and Benchmarks Submission, Neurips 2022☆13Aug 16, 2022Updated 3 years ago
- [ICLR 2026] The official repository for the paper "AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning".☆78Feb 27, 2026Updated 3 weeks ago
- MetaLight: a value-based meta-reinforcement learning framework for traffic signal control☆44Jan 13, 2020Updated 6 years ago
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆33Apr 14, 2024Updated last year
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Mar 14, 2022Updated 4 years ago
- Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:☆13May 21, 2022Updated 3 years ago
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆22Apr 16, 2025Updated 11 months ago
- This repo is the official implementation of "Mask-based Latent Reconstruction for Reinforcement Learning" (NeurIPS 2022).☆29Jul 6, 2023Updated 2 years ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆48Feb 10, 2024Updated 2 years ago
- ☆19May 20, 2024Updated last year
- Evaluating different engineering tricks that make RL work☆15Jun 3, 2021Updated 4 years ago
- The official implementation of Residual-MPPI☆15Mar 22, 2025Updated last year
- Implementation of Isaac gym/sim for the Mars Rover 2.0☆16Sep 10, 2024Updated last year
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- Policy learning of in-hand manipulation. Proximal policy optimization trains the Allegro hand to learn a stabilizing grasp☆13Feb 5, 2024Updated 2 years ago