Stanford CS234 : Reinforcement Learning
☆183Oct 3, 2019Updated 6 years ago
Alternatives and similar repositories for stanford-cs234
Users that are interested in stanford-cs234 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS234: Reinforcement Learning assignments and practices☆63Jul 31, 2024Updated last year
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 3 years ago
- My lecture notes on the RL series provided by Stanford.☆15Aug 31, 2022Updated 3 years ago
- Machine Learning and Reinforcement Learning in Finance Specialization (MOOC) Assignments☆12Nov 4, 2021Updated 4 years ago
- ☆10Jul 26, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple tic-tac-toe using `brick`☆12Nov 18, 2021Updated 4 years ago
- 复现华泰证券《强化学习初探与DQN择时》研报中的DQN模型与效果☆40Oct 4, 2022Updated 3 years ago
- ☆10Jan 31, 2019Updated 7 years ago
- Collection of dynamic-graph entities aimed at implementing torque control on different robots.☆14Nov 24, 2025Updated 4 months ago
- Turtlesim-PID-Controller: A ROS 2 node designed to navigate a turtle in the turtlesim simulator using Proportional control. Aims to drive…☆14Feb 26, 2025Updated last year
- Anything I read, whether it's a paper, a book, or an article, I'll post here.☆11Feb 13, 2025Updated last year
- https://cs330.stanford.edu/☆62Mar 24, 2023Updated 3 years ago
- Pytorch implementation of SelectiveNet https://arxiv.org/abs/1901.09192☆12Oct 28, 2020Updated 5 years ago
- Dataset Bias correction (Python)☆20Jan 7, 2018Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Experiments with reinforcement learning and recurrent neural networks☆115Oct 27, 2023Updated 2 years ago
- Model-Predictive Control for realtime synthesis of agile motor control using MuJoCo.☆10May 17, 2024Updated last year
- ☆11Jan 5, 2017Updated 9 years ago
- A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Explora…☆27Jul 15, 2025Updated 8 months ago
- ☆12Aug 6, 2024Updated last year
- Solutions from "C++ Crash Course A Fast-Paced Introduction" by Josh Lospinoso.☆12May 10, 2023Updated 2 years ago
- Learning to race on F1 tracks using Deep Reinforcement Learning☆17Dec 24, 2022Updated 3 years ago
- This is a quadruped simulated on pybullet physics engine, walking using trot and bound mechanisms☆16Feb 24, 2024Updated 2 years ago
- Resources regarding evML (edge verified machine learning)☆22Jan 4, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago
- Introduction to Quantitative Risk Management☆18Sep 26, 2022Updated 3 years ago
- DEPRECATED, please use upstream at @sjtug☆13Dec 26, 2017Updated 8 years ago
- Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)☆145Jul 2, 2022Updated 3 years ago
- ☆13Jul 10, 2024Updated last year
- Implementation of the Recursive Logit model for prediction and estimation☆21Jan 28, 2025Updated last year
- Bridge between Isaac ROS VSLAM and PX4 via DDS☆13Feb 19, 2024Updated 2 years ago
- Pick your next book from your Goodreads reading list☆16Jan 4, 2023Updated 3 years ago
- Homework 3 for Berkeley CS 280: our version of the MIT Mini Places challenge☆12Mar 5, 2016Updated 10 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [Humanoids 2024 award finalist] Online DNN-Driven Nonlinear MPC for Stylistic Humanoid Robot Walking with Step Adjustment☆20Feb 5, 2025Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆23Feb 6, 2025Updated last year
- Solutions to C++ crash course : a fast-paced introduction☆14Jun 20, 2020Updated 5 years ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆24Nov 29, 2024Updated last year
- Benchmarking Tool for Model Predictive Control based stable walking for humanoid robot☆21Nov 6, 2024Updated last year
- Package for solving dial-a-ride problems.☆11Apr 16, 2024Updated last year
- All notes and materials for the CS229: Machine Learning course by Stanford University☆3,162Feb 14, 2025Updated last year