Assignment Solutions to CS234: Reinforcement learning course
☆36Aug 24, 2018Updated 7 years ago
Alternatives and similar repositories for Stanford-CS234
Users that are interested in Stanford-CS234 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 3 years ago
- My Solution to Assignments of CS234☆94May 9, 2019Updated 7 years ago
- 🐲 Stanford CS234 : Reinforcement Learning☆13Jan 14, 2019Updated 7 years ago
- Implementation of the paper "Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory", Ron Amit and Ron Meir, ICML 2018☆22Oct 30, 2019Updated 6 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆11Feb 19, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A 2 month Ego-vision Dataset with Autographer Wearable Camera and 2 users☆11Apr 28, 2020Updated 6 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- Experiments in applying interpretability techniques to learned reward functions.☆10Dec 11, 2020Updated 5 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- Primal-Dual Policy Learning Simple Example☆15Apr 12, 2021Updated 5 years ago
- Almost Surely Stable Deep Dynamics [NeurIPS 2020]☆12Dec 8, 2022Updated 3 years ago
- Code for our SIGGRAPH 2023 paper, "Acting as Inverse Inverse Planning"☆20Apr 21, 2023Updated 3 years ago
- Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"☆12May 20, 2019Updated 6 years ago
- This is the Pytorch implementation of paper--Training deep neural-networks using a noise adaptation layer.☆10Apr 18, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CNN using C++ and CUDA☆16May 21, 2019Updated 6 years ago
- Bivariate Shapley is a Shapley-based method of identifying directional feature interactions and feature redundancy☆20May 19, 2025Updated 11 months ago
- Leave No Trace is an algorithm for safe reinforcement learning.☆15Apr 30, 2018Updated 8 years ago
- Trajectory-ranked Reward EXtrapolation (T-REX) for Inverse Reinforcement Learning - A Tensorflow implementation trained on OpenAI Gym env…☆19Jul 4, 2019Updated 6 years ago
- ☆21Dec 17, 2020Updated 5 years ago
- Monte Carlo value iteration for continuous-state POMDPs☆12Sep 3, 2013Updated 12 years ago
- Implementation of Deep Variational Bayes Filter☆13Aug 9, 2019Updated 6 years ago
- PyTorch implementation of "Learning Stable Deep Dynamics Models" (https://papers.nips.cc/paper/9292-learning-stable-deep-dynamics-models)…☆17May 1, 2020Updated 6 years ago
- Adversarial Imitation Learning from Incomplete Demonstrations☆15Apr 2, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pytorch Implementation of Deep Kalman Filter☆12Sep 30, 2025Updated 7 months ago
- Algorithms for Uni-Modal Inverse Reinforcement Learning☆22Sep 23, 2022Updated 3 years ago
- Container system setup to use tensorflow and anaconda (and nvidia for gpu enabled systems)☆10Dec 19, 2016Updated 9 years ago
- ☆18Nov 10, 2023Updated 2 years ago
- Source code and code description of Team6_ISU for NVIDIA AICity Challenge 2017 track 1☆22Sep 28, 2021Updated 4 years ago
- Some hard problems for reinforcement learning.☆32Oct 5, 2018Updated 7 years ago
- ☆33Sep 22, 2019Updated 6 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- ☆40Apr 15, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Small diffusion model in PyTorch.☆16Apr 18, 2024Updated 2 years ago
- intrinsic motivation in grid worlds☆26May 3, 2020Updated 6 years ago
- Convolutional Neural Network for Click-Through Rate prediction.☆15Sep 28, 2016Updated 9 years ago
- GPU-accelerated LLM Training Simulator☆51Jun 26, 2025Updated 10 months ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Jun 20, 2019Updated 6 years ago
- This is an implementation of the paper "Coordinated Multi Agent Imitation Learning", or the Sloan version "Data-Driven Ghosting using Dee…☆41Jun 28, 2018Updated 7 years ago
- TorchDriveEnv is a lightweight 2D driving reinforcement learning environment, supported by a solid simulator and smart non-playable chara…☆27Apr 8, 2025Updated last year