Guozheng-Ma / Adaptive-Replay-RatioView external linksLinks
[ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.
☆13Oct 9, 2024Updated last year
Alternatives and similar repositories for Adaptive-Replay-Ratio
Users that are interested in Adaptive-Replay-Ratio are comparing it to the libraries listed below
Sorting:
- Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…☆35Oct 24, 2025Updated 3 months ago
- ☆25Aug 19, 2024Updated last year
- Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.☆35Feb 9, 2026Updated last week
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆78May 27, 2024Updated last year
- Unofficial baselines for ManiSkill, including RL and BC algorithms.☆17Jun 6, 2024Updated last year
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆23Oct 24, 2025Updated 3 months ago
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆80Mar 27, 2024Updated last year
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆49Jun 27, 2024Updated last year
- ☆60Jan 30, 2026Updated 2 weeks ago
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Jul 21, 2025Updated 6 months ago
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆41Jun 5, 2025Updated 8 months ago
- GeRM: A Generalist Robotic Model with Mixture-of-Experts for Quadruped Robot https://songwxuan.github.io/GeRM/☆35Apr 29, 2025Updated 9 months ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆91Nov 4, 2025Updated 3 months ago
- ☆39Feb 4, 2026Updated last week
- ☆86Jan 9, 2026Updated last month
- ☆34Oct 25, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- ☆11Jan 21, 2026Updated 3 weeks ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated 11 months ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- Create PyKDL chains from URDF robot descriptions☆13Jul 16, 2019Updated 6 years ago
- ☆12Jan 11, 2026Updated last month
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆197Oct 16, 2025Updated 4 months ago
- A curated list of visual reinforcement learning resources☆465Nov 22, 2025Updated 2 months ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- 用于cpolar内网穿透后定时获取状态通过邮件发送自己☆10Jan 22, 2024Updated 2 years ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- Code and Data for GlitchBench☆13Feb 27, 2024Updated last year
- ☆12Mar 5, 2025Updated 11 months ago
- ☆12Nov 5, 2024Updated last year
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- ☆11Nov 5, 2024Updated last year
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆22Oct 12, 2023Updated 2 years ago
- Python package for single and dual robot arm motion planning.☆13Dec 9, 2025Updated 2 months ago
- SDK for Unitree A1, Co-Working with Wego Robotics☆10Feb 20, 2022Updated 3 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year