DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards
☆26May 6, 2024Updated 2 years ago
Alternatives and similar repositories for deir
Users that are interested in deir are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆40Nov 23, 2021Updated 4 years ago
- [ICML 2024]Exploration and Anti-exploration with Distributional Random Network Distillation☆17Oct 12, 2024Updated last year
- Sandbox environment for generalizable agent research☆27Aug 19, 2022Updated 3 years ago
- [ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)☆11Aug 26, 2025Updated 9 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆30Apr 8, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- impact-driven-exploration☆136Oct 3, 2023Updated 2 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆30Mar 16, 2026Updated 2 months ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆23Apr 28, 2021Updated 5 years ago
- C311 Spring 2022☆13Mar 17, 2025Updated last year
- ☆18Jun 8, 2023Updated 3 years ago
- A variant of Varibad that is robust to difficult tasks☆11Aug 30, 2023Updated 2 years ago
- Peng et al. "RED-Net: A Recurrent Encoder–Decoder Network for Video-Based Face Alignment". IJCV, 2018.☆12Jul 19, 2018Updated 7 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Oct 23, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆16Jul 28, 2022Updated 3 years ago
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- Gridworld domains in the gym interface☆29Oct 2, 2024Updated last year
- this is a work about UpliftRec☆10Dec 10, 2024Updated last year
- A customized docker for headless GPU rendering without host-side configuration☆11Aug 22, 2022Updated 3 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆24Dec 29, 2023Updated 2 years ago
- RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random ne…☆466Apr 4, 2025Updated last year
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆11May 22, 2020Updated 6 years ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆125Feb 21, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)☆11Aug 24, 2024Updated last year
- ☆14Jun 19, 2023Updated 2 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- 💬 Send iMessages using Python through the Shortcuts app.☆18May 25, 2024Updated 2 years ago
- hustpa ics2019☆10Jul 11, 2022Updated 3 years ago
- ☆16Feb 23, 2024Updated 2 years ago
- krazy grid world☆25Mar 2, 2020Updated 6 years ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆34Nov 2, 2025Updated 7 months ago
- LITEN: Learning from Inference Time Execution for VLAs☆27Oct 23, 2025Updated 7 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The Mighty cRL library you've been looking for!☆59May 24, 2026Updated 2 weeks ago
- ☆11Jul 4, 2024Updated last year
- [ECCV] HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning☆26Sep 6, 2025Updated 9 months ago
- ☆13Apr 28, 2019Updated 7 years ago
- WWW'24, Mirror Gradient (MG) makes multimodal recommendation models approach flat local minima easier compared to models with normal trai…☆17Nov 1, 2024Updated last year
- Official PyTorch implementation of POEM (Partial Observation Experts Modelling) as introduced in the paper Contrastive Meta-Learning for …☆12Nov 1, 2023Updated 2 years ago
- A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation☆68Apr 1, 2025Updated last year