Assignment Solutions to CS234: Reinforcement learning course
β36Aug 24, 2018Updated 7 years ago
Alternatives and similar repositories for Stanford-CS234
Users that are interested in Stanford-CS234 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS234: Reinforcement Learning Winter 2020β19Mar 24, 2023Updated 3 years ago
- π² Stanford CS234 : Reinforcement Learningβ13Jan 14, 2019Updated 7 years ago
- β10Jan 31, 2019Updated 7 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.β13May 5, 2021Updated 5 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implementβ13Feb 19, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- In this repository, I place my solution for the exercises in multiple famous math textbooks, including Stochastic Differential Equation, β¦β14Nov 13, 2023Updated 2 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environmentsβ12Jun 3, 2021Updated 5 years ago
- Codebase for Numerical Renaissance by Thomas Bewleyβ16Mar 11, 2024Updated 2 years ago
- Solutions to coding assignments of Stanford Reinforcement Learning course Winter 2021β13Aug 29, 2021Updated 4 years ago
- β12Jun 8, 2018Updated 8 years ago
- Primal-Dual Policy Learning Simple Exampleβ15Apr 12, 2021Updated 5 years ago
- A simple and extensible Octave/Matlab library for Model Predictive Path Integral control scheme.β19Dec 16, 2019Updated 6 years ago
- Almost Surely Stable Deep Dynamics [NeurIPS 2020]β12Dec 8, 2022Updated 3 years ago
- Companion code for ICML 2022 paper "Imitation Learning by Estimating Expertise of Demonstrators"β11Jul 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for our SIGGRAPH 2023 paper, "Acting as Inverse Inverse Planning"β20Apr 21, 2023Updated 3 years ago
- URDFs for the Stretch mobile manipulators from Hello Robot Inc.β16Jun 5, 2026Updated last week
- nonlinear solver for the constrained problemβ21Jun 7, 2026Updated last week
- This is the Pytorch implementation of paper--Training deep neural-networks using a noise adaptation layer.β10Apr 18, 2021Updated 5 years ago
- Bivariate Shapley is a Shapley-based method of identifying directional feature interactions and feature redundancyβ20May 19, 2025Updated last year
- Models built with TensorFlowβ26Dec 5, 2018Updated 7 years ago
- Study materials about "Deep Learning for Molecular Applications".β15Aug 5, 2019Updated 6 years ago
- A brief understanding of ffmpeg cli through pseudocodeβ11Dec 20, 2020Updated 5 years ago
- Implementation of Deep Variational Bayes Filterβ13Aug 9, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Accompanying code for "Weak form generalized Hamiltonian learning"β11Feb 13, 2021Updated 5 years ago
- Library for parsing and analyzing DOT scriptsβ18Updated this week
- Adversarial Imitation Learning from Incomplete Demonstrationsβ15Apr 2, 2020Updated 6 years ago
- Pytorch Implementation of Deep Kalman Filterβ12Sep 30, 2025Updated 8 months ago
- Algorithms for Uni-Modal Inverse Reinforcement Learningβ22Sep 23, 2022Updated 3 years ago
- Container system setup to use tensorflow and anaconda (and nvidia for gpu enabled systems)β10Dec 19, 2016Updated 9 years ago
- β19Nov 10, 2023Updated 2 years ago
- Some hard problems for reinforcement learning.β32Oct 5, 2018Updated 7 years ago
- β33Sep 22, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Yet another Game Boy Emulatorβ14May 30, 2026Updated 2 weeks ago
- PyTorch implementation for Graph Gated Neural Network (for Knowledge Graphs)β48Oct 19, 2022Updated 3 years ago
- Tiva Ware C Series blinky example using CMake and Visual Studio Codeβ16Aug 5, 2019Updated 6 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.β25Sep 26, 2020Updated 5 years ago
- β16Dec 15, 2020Updated 5 years ago
- Dataset of conversations, generated by prompting Gemini Ultra. These are conversations between a teacher and a student, where the teacherβ¦β36Oct 29, 2024Updated last year
- GPU-accelerated LLM Training Simulatorβ52Jun 26, 2025Updated 11 months ago