Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)
☆22Aug 1, 2021Updated 4 years ago
Alternatives and similar repositories for CODAC
Users that are interested in CODAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Nov 22, 2023Updated 2 years ago
- Official Github Repository for "Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees". (NeurIPS 2024)☆11Nov 30, 2025Updated 4 months ago
- Official implementation of the paper: Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate☆25Dec 4, 2023Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Scaling safe exploration to vision control☆14Feb 19, 2025Updated last year
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- Deep PILCO PyTorch Implementation☆15Mar 25, 2023Updated 3 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- Implementation of Robust Adversarial Reinforcement Learning☆14Nov 27, 2017Updated 8 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆94Mar 4, 2023Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆190Jul 25, 2024Updated last year
- safety analysis for hard-to-specify failures☆28Mar 8, 2026Updated last month
- Official code for 《FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models》 MM2024☆15Nov 3, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Apr 28, 2019Updated 6 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- 108下 計算機組織 Computer Organization 李毅郎☆11Feb 22, 2021Updated 5 years ago
- ☆19Jun 15, 2023Updated 2 years ago
- Diffusing States and Matching Scores: A New Framework for Imitation Learning☆22Nov 16, 2024Updated last year
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- ☆10Apr 8, 2024Updated 2 years ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Feb 27, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures (CoRL 2025)☆21Sep 23, 2025Updated 6 months ago
- A Deep Reinforcement Learning Strategy and Framework for Floating Waste Capture☆13Mar 13, 2025Updated last year
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆75Mar 22, 2024Updated 2 years ago
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,063May 23, 2024Updated last year
- OpenAI-gym-like Reinforcement Learning environment for Dispatching of Mobile Chargers with SUMO. Compatible with Gym and popular RL libra…☆15Mar 16, 2025Updated last year
- Gym environments modified with adversarial agents☆36Mar 21, 2017Updated 9 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- ☆12Sep 15, 2021Updated 4 years ago
- ☆11Feb 11, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for RSS 2025 paper "Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning …☆40Jun 18, 2025Updated 9 months ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆29Jan 12, 2023Updated 3 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- Code space for L4DC paper "State-wise Safe Reinforcement Learning With Pixel Observations"☆11Apr 5, 2024Updated 2 years ago
- repository for "Exploiting Proximity-Aware Tasks for Embodied Social Navigation" paper code☆11Nov 16, 2023Updated 2 years ago
- KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts☆19Jun 21, 2022Updated 3 years ago
- ☆21Nov 30, 2024Updated last year