arowdy98/Stanford-CS234

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/arowdy98/Stanford-CS234)

arowdy98 / Stanford-CS234

Assignment Solutions to CS234: Reinforcement learning course

☆37

Alternatives and similar repositories for Stanford-CS234

Users that are interested in Stanford-CS234 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NataliaDiaz / Egoshots
View on GitHub
A 2 month Ego-vision Dataset with Autographer Wearable Camera and 2 users
☆11Apr 28, 2020Updated 6 years ago
danielrherber / admm-qp
View on GitHub
☆13May 14, 2017Updated 9 years ago
neale / avoiding-side-effects
View on GitHub
Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
☆12Jun 3, 2021Updated 5 years ago
karthikncode / Grounded-RL-Transfer
View on GitHub
☆13Dec 6, 2018Updated 7 years ago
monimoyb / pd_polLearn
View on GitHub
Primal-Dual Policy Learning Simple Example
☆15Apr 12, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
vvrs / Tube-MPC
View on GitHub
☆12Jun 8, 2018Updated 8 years ago
acxz / mppi
View on GitHub
A simple and extensible Octave/Matlab library for Model Predictive Path Integral control scheme.
☆19Dec 16, 2019Updated 6 years ago
oxfordcontrol / qpip
View on GitHub
Matlab interior point solver for quadratic programs
☆14Jul 24, 2017Updated 9 years ago
qiaoguanren / Multi-Modal-Inverse-Constrained-Reinforcement-Learning
View on GitHub
NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement
☆13Feb 19, 2024Updated 2 years ago
Stanford-ILIAD / ILEED
View on GitHub
Companion code for ICML 2022 paper "Imitation Learning by Estimating Expertise of Demonstrators"
☆11Jul 5, 2023Updated 3 years ago
gioramponi / sigma-girl-MIIRL
View on GitHub
Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions
☆13May 22, 2023Updated 3 years ago
jerrylin1121 / BCO
View on GitHub
Implementation of Behavioral Cloning from Observationmentation
☆16Nov 28, 2019Updated 6 years ago
ofirnachum / models
View on GitHub
Models built with TensorFlow
☆26Dec 5, 2018Updated 7 years ago
watcl-lab / cs886-winter-2025
View on GitHub
CS886: Graph Neural Networks
☆14Mar 28, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
dsbrown1331 / bayesianrex
View on GitHub
☆21Dec 17, 2020Updated 5 years ago
Jgmorton / vbf
View on GitHub
Implementation of Deep Variational Bayes Filter
☆13Aug 9, 2019Updated 6 years ago
harwiltz / stable-dynamics-models
View on GitHub
PyTorch implementation of "Learning Stable Deep Dynamics Models" (https://papers.nips.cc/paper/9292-learning-stable-deep-dynamics-models)…
☆17May 1, 2020Updated 6 years ago
morim3 / DeepKalmanFilter
View on GitHub
Pytorch Implementation of Deep Kalman Filter
☆12Sep 30, 2025Updated 9 months ago
snt-robotics / denmpc
View on GitHub
An event-based on-line adaptable fast nonlinear model predictive control framework
☆25Oct 29, 2018Updated 7 years ago
ykwon0407 / wdro_local_perturbation
View on GitHub
Principled learning method for Wasserstein distributionally robust optimization with local perturbations (ICML 2020)
☆21Mar 24, 2023Updated 3 years ago
d-biswa / Symplectic-ODENet
View on GitHub
☆16Dec 15, 2020Updated 5 years ago
aaronsnoswell / unimodal-irl
View on GitHub
Algorithms for Uni-Modal Inverse Reinforcement Learning
☆22Sep 23, 2022Updated 3 years ago
oscar-lima / autom_param_optimization
View on GitHub
ROS wrapper for SMAC, a versatile tool for optimizing algorithm parameters
☆11Jul 19, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
stevemessinger / L1_MPC-HELI
View on GitHub
Non-Linear L1-Adaptive Model Predictive Control of a Micro 3D Helicopter
☆18May 16, 2022Updated 4 years ago
HumanCompatibleAI / learning_biases
View on GitHub
Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.
☆25Sep 26, 2020Updated 5 years ago
Mchristos / empowerment
View on GitHub
intrinsic motivation in grid worlds
☆26May 3, 2020Updated 6 years ago
m5823779 / pose-estimation
View on GitHub
A Convolutional Neural Network for Real Time robot pose estimation by RGB Image
☆13Nov 23, 2022Updated 3 years ago
Feesuu / MemoryTree
View on GitHub
An unofficial implementation of MemTree: From Isolated Conversations to Hierarchical Schemas: Dynamic Tree Memory Representation for LLMs
☆18Jul 14, 2025Updated last year
inverted-ai / torchdriveenv
View on GitHub
TorchDriveEnv is a lightweight 2D driving reinforcement learning environment, supported by a solid simulator and smart non-playable chara…
☆28Apr 8, 2025Updated last year
thowell / IterativeLQR.jl
View on GitHub
A Julia package for constrained iterative LQR (iLQR)
☆44Mar 17, 2023Updated 3 years ago
guytenn / Act2Vec
View on GitHub
☆13May 10, 2019Updated 7 years ago
HumanCompatibleAI / population-irl
View on GitHub
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆27Jun 20, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pfnet-research / capg
View on GitHub
Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
☆31Jun 24, 2018Updated 8 years ago
IRLL / HIPPO_Gym
View on GitHub
☆20Sep 8, 2023Updated 2 years ago
AtlantiaKing / Procedural-2D-Dungeon-Generator
View on GitHub
A 2D Dungeon Generator using seperation steering behavior, triangulation and the minimum spanning tree algorithm with parameters that inf…
☆14Jan 12, 2023Updated 3 years ago
LIKERobo / SemanticMapGeneration
View on GitHub
Template repository for generating semantic maps
☆16Feb 4, 2019Updated 7 years ago
TUMFTM / MixNet
View on GitHub
Structured Deep Neural Motion Prediction of Opposing Vehicles for an Autonomous Racecar
☆25Apr 27, 2026Updated 3 months ago
mitsuhiko / small-ctor
View on GitHub
Minimal, dependency free implementation of the ctor crate
☆17Aug 1, 2024Updated last year
SchrodingerZhu / memcpy-amd64
View on GitHub
An implementation of memcpy for amd64 with clang/gcc
☆15Feb 7, 2022Updated 4 years ago