Assignment Solutions to CS234: Reinforcement learning course
☆36Aug 24, 2018Updated 7 years ago
Alternatives and similar repositories for Stanford-CS234
Users that are interested in Stanford-CS234 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 3 years ago
- My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019☆170Mar 24, 2023Updated 3 years ago
- 🐲 Stanford CS234 : Reinforcement Learning☆12Jan 14, 2019Updated 7 years ago
- Stanford CS234: Reinforcement Learning assignments and practices☆63Jul 31, 2024Updated last year
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13May 5, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- A 2 month Ego-vision Dataset with Autographer Wearable Camera and 2 users☆11Apr 28, 2020Updated 5 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- Experiments in applying interpretability techniques to learned reward functions.☆10Dec 11, 2020Updated 5 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- Codebase for Numerical Renaissance by Thomas Bewley☆16Mar 11, 2024Updated 2 years ago
- Dataset Bias correction (Python)☆20Jan 7, 2018Updated 8 years ago
- ☆12Jun 8, 2018Updated 7 years ago
- Primal-Dual Policy Learning Simple Example☆15Apr 12, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆13May 14, 2017Updated 8 years ago
- Underactuated Robotics course project fall 2018 - SLQ MPC algorithm implementation for pendulum and cart pole☆12Mar 11, 2019Updated 7 years ago
- A simple and extensible Octave/Matlab library for Model Predictive Path Integral control scheme.☆18Dec 16, 2019Updated 6 years ago
- Companion code for ICML 2022 paper "Imitation Learning by Estimating Expertise of Demonstrators"☆11Jul 5, 2023Updated 2 years ago
- ☆11Sep 16, 2023Updated 2 years ago
- Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions☆13May 22, 2023Updated 2 years ago
- Code for our SIGGRAPH 2023 paper, "Acting as Inverse Inverse Planning"☆20Apr 21, 2023Updated 2 years ago
- Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"☆12May 20, 2019Updated 6 years ago
- nonlinear solver for the constrained problem☆20Sep 18, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Matlab interior point solver for quadratic programs☆14Jul 24, 2017Updated 8 years ago
- Leave No Trace is an algorithm for safe reinforcement learning.☆15Apr 30, 2018Updated 7 years ago
- Study materials about "Deep Learning for Molecular Applications".☆15Aug 5, 2019Updated 6 years ago
- ☆21Dec 17, 2020Updated 5 years ago
- Monte Carlo value iteration for continuous-state POMDPs☆12Sep 3, 2013Updated 12 years ago
- ROS 2 simulation packages for the Neobotix robots☆38May 12, 2025Updated 10 months ago
- PyTorch implementation of "Learning Stable Deep Dynamics Models" (https://papers.nips.cc/paper/9292-learning-stable-deep-dynamics-models)…☆17May 1, 2020Updated 5 years ago
- Pytorch Implementation of Deep Kalman Filter☆12Sep 30, 2025Updated 6 months ago
- Algorithms for Uni-Modal Inverse Reinforcement Learning☆22Sep 23, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Nov 10, 2023Updated 2 years ago
- ROS wrapper for SMAC, a versatile tool for optimizing algorithm parameters☆11Jul 19, 2021Updated 4 years ago
- ☆14Jan 20, 2018Updated 8 years ago
- Source code and code description of Team6_ISU for NVIDIA AICity Challenge 2017 track 1☆22Sep 28, 2021Updated 4 years ago
- Some hard problems for reinforcement learning.☆32Oct 5, 2018Updated 7 years ago
- ☆33Sep 22, 2019Updated 6 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago