☆10Oct 15, 2020Updated 5 years ago
Alternatives and similar repositories for Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
Users that are interested in Code-for-Error-Bounds-of-Imitating-Policies-and-Environments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Theory of Reinforcement Learning☆18Apr 20, 2021Updated 4 years ago
- A beamer template for LAMDA lab at NJU☆16Oct 17, 2020Updated 5 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆26Jun 14, 2022Updated 3 years ago
- ☆19Oct 27, 2025Updated 5 months ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆20Feb 29, 2020Updated 6 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy☆21Jun 1, 2022Updated 3 years ago
- Counterfactual explanations for Reinforcement Learning agents on Atari☆12Apr 3, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- ☆10Sep 19, 2023Updated 2 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- A boilerplate (dbs, envs, teleop, models, web-apps) for robotic learning experiments & a Pytorch Implementation of "Learning Latent Plans…☆11Oct 23, 2020Updated 5 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated last year
- ☆33Aug 30, 2024Updated last year
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- [NeurIPS 2023] Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model☆20Dec 9, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 3 years ago
- ☆17Sep 23, 2022Updated 3 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 5 months ago
- ☆12Sep 15, 2021Updated 4 years ago
- Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model☆10Jun 14, 2021Updated 4 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning☆40Aug 17, 2022Updated 3 years ago
- ☆25Nov 6, 2025Updated 5 months ago
- ☆16May 4, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆20Nov 25, 2024Updated last year
- ☆10Jun 4, 2024Updated last year
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.☆26Oct 23, 2024Updated last year
- D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.☆32Oct 18, 2023Updated 2 years ago
- send and receive message and file by python3 socket☆12May 24, 2018Updated 7 years ago