Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
☆15May 26, 2022Updated 3 years ago
Alternatives and similar repositories for rune
Users that are interested in rune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pref-RL provides ready-to-use PbRL agents that are easily extensible.☆11Aug 31, 2022Updated 3 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆18Jun 18, 2024Updated last year
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- The source code of the paper "Towards Problem of First Miss under Mobile EdgeCaching"☆11Apr 12, 2021Updated 5 years ago
- code for "Decoupled Preference-based Reinforcement Learning for Personalized Human-Robot Interaction"☆11Jul 9, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆53Nov 10, 2022Updated 3 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- Inference API server with echo and gRPC to triton server (golang)☆13Nov 16, 2022Updated 3 years ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- [ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)☆11Aug 26, 2025Updated 8 months ago
- SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)☆15Sep 26, 2025Updated 7 months ago
- Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)☆37Oct 15, 2024Updated last year
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…☆10May 5, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Oct 11, 2022Updated 3 years ago
- ☆13Feb 5, 2025Updated last year
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback☆14Apr 15, 2026Updated 3 weeks ago
- ☆26Feb 19, 2024Updated 2 years ago
- ☆12Nov 16, 2020Updated 5 years ago
- Template Code for the Paper: MILES: Making Imitation Learning Easy with Self-Supervision☆19Nov 14, 2024Updated last year
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- This repository accompanies the following paper: A Workflow for Offline Model-Free Robotic RL☆13Nov 5, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2024] Domain Gap Embeddings for Generative Dataset Augmentation☆22Jun 19, 2024Updated last year
- ☆15Jul 24, 2022Updated 3 years ago
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated 2 years ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆20Jul 27, 2025Updated 9 months ago
- Mobility-Aware Proactive Edge Caching OptimizationScheme in Information-Centric IoV Networks☆21Jan 20, 2022Updated 4 years ago
- ☆11Feb 6, 2018Updated 8 years ago
- OCTCube-M: A 3D multimodal optical coherence tomography foundation model for retinal and systemic diseases with cross-cohort and cross-de…☆28Jun 30, 2025Updated 10 months ago
- Semantic Synthesis of Pedestrian Locomotion☆13Sep 13, 2023Updated 2 years ago
- Submission Under Review☆17May 15, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation for <Understanding Robust Overftting of Adversarial Training and Beyond> in ICML'22.☆13Jul 1, 2022Updated 3 years ago
- A Hierarchical Attention Model for Social Contextual Image Recommendation, TKDE2019☆13Feb 10, 2021Updated 5 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- [TMLR 2025] A collection of research papers on constraint inference within the field of RL☆11May 9, 2025Updated 11 months ago
- Training Deep Neural Networks via Direct Loss Minimization☆17Oct 24, 2021Updated 4 years ago
- Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)☆21Sep 27, 2022Updated 3 years ago