Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
☆16Jul 17, 2020Updated 5 years ago
Alternatives and similar repositories for rlai-exercises
Users that are interested in rlai-exercises are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- RefTeacher is a strong baseline method for Semi-Supervised Referring Expression Comprehension.☆13May 26, 2023Updated 2 years ago
- Plan time-optimal paths with both speed and turn-rate controls☆10May 15, 2021Updated 4 years ago
- Official code for AAAI 2026 paper (One-Step Generative Policies with Q-Learning: A Reformulation of MeanFlow)☆31Dec 15, 2025Updated 4 months ago
- Reinforcement Learning Practice for Multi and Single-Agent Autonomous vehicle☆13Dec 11, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for submission to 2024 submission to Automatica titled "Closed-loop Data-enabled Predictive Control and its equivalence with Closed-…☆14Sep 26, 2024Updated last year
- This is a collection of Matlab functions that are useful in the development of target tracking algorithms.☆15Sep 3, 2015Updated 10 years ago
- Benchmark result of different RL algorithms on MetaDrive environments, including Multi-agent RL (IPPO, centralized critics, CoPO).☆16Oct 25, 2022Updated 3 years ago
- ☆20Mar 10, 2025Updated last year
- ☆18Oct 6, 2021Updated 4 years ago
- ☆11May 15, 2024Updated last year
- ☆29Apr 23, 2025Updated 11 months ago
- GCN CAV☆14Mar 29, 2021Updated 5 years ago
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…☆17Nov 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Kuka Reacher Reinforcement Learning Sim2Real Environment for Omniverse Isaac Gym/Sim☆21Nov 22, 2023Updated 2 years ago
- 哈工大深圳 校园网全自动登录☆10Feb 7, 2023Updated 3 years ago
- [CVPR 2023] Official code release of Cafi-Net: Self-Supervised Learning of Pose-Canonicalized Neural Fields☆15Jul 14, 2023Updated 2 years ago
- This is a tool for creating symbolic links for Isaac Sim Python packages, designed to improve code auto-completion in IDEs like VSCode. B…☆22Aug 4, 2025Updated 8 months ago
- Procedural Data Generation for Cloth Manipulation - codebase for IEEE RA-L paper☆18Jun 3, 2025Updated 10 months ago
- ☆12Sep 7, 2024Updated last year
- My Solutions to Sutton and Barto exercises, 2nd edition☆14Apr 27, 2018Updated 7 years ago
- PPDDL plan evalutation simulator☆15Dec 30, 2019Updated 6 years ago
- Benchmarking general decision-making with open & random worlds☆20Mar 27, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Non-official implementation of paper "In-context Reinforcement Learning with Algorithm Distillation"☆12Aug 15, 2024Updated last year
- pybullet grasping with time contrastive network embeddings☆22Jun 18, 2019Updated 6 years ago
- Simulate Grasp Dataset Generation (未完成版)主要实现的功能是,使用Antipodal算法对虚拟的物理环境中的Mesh模型进行6-Dof抓取采样☆14Jun 14, 2022Updated 3 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- Code of the article "Benchmarking the Sim-to-Real Gap in Cloth Manipulation"☆23May 26, 2025Updated 10 months ago
- PyBullet simulator for Franka Emika Panda☆15Jul 30, 2020Updated 5 years ago
- Implementation of Proximal Policy Optimization algorithm on a custom Unity environment.☆17Feb 3, 2022Updated 4 years ago
- 🧙🏻 Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing…☆21Dec 20, 2024Updated last year
- Python D* Lite☆28Sep 18, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR25 Oral] RL framework for manipulation of diverse shapes and deformable objects☆30Apr 11, 2025Updated last year
- A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.☆18Feb 27, 2023Updated 3 years ago
- academic pages for c2 group☆11Apr 11, 2023Updated 3 years ago
- ☆20Apr 24, 2022Updated 3 years ago
- In this repository you can find the resuls of the simulated evaluation of an innovative, optimized for real-life use, STC-based, multi-ro…☆15Jan 20, 2022Updated 4 years ago
- [NeurIPS 2022 Spotlight] Hand-Object Interaction Image Generation☆33Nov 29, 2022Updated 3 years ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 9 months ago