Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
☆15May 26, 2022Updated 4 years ago
Alternatives and similar repositories for rune
Users that are interested in rune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pref-RL provides ready-to-use PbRL agents that are easily extensible.☆11Aug 31, 2022Updated 3 years ago
- ☆13Sep 24, 2024Updated last year
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- The source code of the paper "Towards Problem of First Miss under Mobile EdgeCaching"☆11Apr 12, 2021Updated 5 years ago
- code for "Decoupled Preference-based Reinforcement Learning for Personalized Human-Robot Interaction"☆11Jul 9, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆54Nov 10, 2022Updated 3 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- code for polite☆11Feb 28, 2024Updated 2 years ago
- SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)☆15Sep 26, 2025Updated 8 months ago
- Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)☆35Oct 15, 2024Updated last year
- Single-Life Reinforcement Learning☆14Dec 17, 2022Updated 3 years ago
- Context A real online retail transaction data set of two years. Content This Online Retail II data set contains all the transactions oc…☆18Jul 5, 2020Updated 5 years ago
- ☆14Jun 25, 2022Updated 3 years ago
- Code for AAAI 2021 long paper Learning from Crowds by Modeling Common Confusions.☆11Feb 6, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Code for the content caching algorithm in edge caching.☆22Sep 24, 2024Updated last year
- ☆11Aug 10, 2020Updated 5 years ago
- ☆14Oct 11, 2022Updated 3 years ago
- ☆13Feb 5, 2025Updated last year
- Collapsed Gibbs sampling for Latent Dirichlet Allocation☆18Jun 11, 2012Updated 13 years ago
- Code for our ACL 2019 long paper: "Ensuring Readability and Data-fidelity using Head-modifier Templates in Deep Type Description Generati…☆11Nov 5, 2022Updated 3 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- Deep learning based predictive analytics for efficient content caching in edge network☆18Dec 26, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback☆14May 19, 2026Updated last week
- Attempted implementation of a Bi-directional GRU followed by a linear-chain-CRF (from scratch) for Named Entity Recognition.☆15Dec 5, 2017Updated 8 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- ☆26Feb 19, 2024Updated 2 years ago
- Template Code for the Paper: MILES: Making Imitation Learning Easy with Self-Supervision☆19Nov 14, 2024Updated last year
- ☆12Nov 16, 2020Updated 5 years ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)☆11Sep 16, 2025Updated 8 months ago
- GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems☆10Jul 7, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2024] Domain Gap Embeddings for Generative Dataset Augmentation☆22Jun 19, 2024Updated last year
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)