Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆23Apr 17, 2024Updated 2 years ago
Alternatives and similar repositories for mobile
Users that are interested in mobile are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou☆11Jul 20, 2021Updated 4 years ago
- Faster RCNN using TensorFlow☆10Jul 31, 2022Updated 3 years ago
- ☆10Mar 11, 2024Updated 2 years ago
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆26Feb 28, 2022Updated 4 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆390May 2, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆115Apr 16, 2026Updated last month
- ☆27Apr 22, 2024Updated 2 years ago
- ☆11May 29, 2025Updated 11 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆163Sep 12, 2023Updated 2 years ago
- ☆17May 1, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆29Dec 1, 2024Updated last year
- ☆21Mar 19, 2024Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190May 17, 2022Updated 4 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- ☆18Apr 17, 2026Updated last month
- Standalone library of frequently-used wrappers for dm_env environments.☆19Jul 9, 2024Updated last year
- ☆23Apr 2, 2024Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Jan 24, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆22May 27, 2024Updated last year
- ☆24Feb 8, 2024Updated 2 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.☆275Jul 29, 2023Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆197Dec 8, 2022Updated 3 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆27Mar 19, 2026Updated 2 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Nov 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆41Feb 18, 2025Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆25Apr 19, 2024Updated 2 years ago
- Action Value Gradient Algorithm☆28May 18, 2025Updated last year
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆136Nov 21, 2024Updated last year
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆107May 17, 2022Updated 4 years ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆41Sep 13, 2023Updated 2 years ago