Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆23Apr 17, 2024Updated last year
Alternatives and similar repositories for mobile
Users that are interested in mobile are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou☆11Jul 20, 2021Updated 4 years ago
- Faster RCNN using TensorFlow☆10Jul 31, 2022Updated 3 years ago
- ☆10Mar 11, 2024Updated 2 years ago
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆26Feb 28, 2022Updated 4 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆386Jul 11, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112May 27, 2024Updated last year
- ☆27Apr 22, 2024Updated last year
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆32Jun 2, 2023Updated 2 years ago
- ☆12May 29, 2025Updated 9 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆162Sep 12, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆16May 1, 2023Updated 2 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆29Dec 1, 2024Updated last year
- ☆21Mar 19, 2024Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆31Jul 4, 2024Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆190May 17, 2022Updated 3 years ago
- ☆18Mar 18, 2026Updated last week
- Standalone library of frequently-used wrappers for dm_env environments.☆19Jul 9, 2024Updated last year
- ☆23Apr 2, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆37Jan 24, 2026Updated 2 months ago
- ☆22May 27, 2024Updated last year
- ☆23Feb 8, 2024Updated 2 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆77Mar 4, 2025Updated last year
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.☆274Jul 29, 2023Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆198Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆27Mar 19, 2026Updated last week
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Nov 19, 2023Updated 2 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆41Feb 18, 2025Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Apr 19, 2024Updated last year
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆133Nov 21, 2024Updated last year
- Action Value Gradient Algorithm☆28May 18, 2025Updated 10 months ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago