Stanford CS234 : Reinforcement Learning
☆189Oct 3, 2019Updated 6 years ago
Alternatives and similar repositories for stanford-cs234
Users that are interested in stanford-cs234 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS234: Reinforcement Learning assignments and practices☆63Jul 31, 2024Updated last year
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 3 years ago
- My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019☆170Mar 24, 2023Updated 3 years ago
- My lecture notes on the RL series provided by Stanford.☆16Aug 31, 2022Updated 3 years ago
- Assignment Solutions to CS234: Reinforcement learning course☆36Aug 24, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 🐲 Stanford CS234 : Reinforcement Learning☆13Jan 14, 2019Updated 7 years ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- 🕹️ CS234: Reinforcement Learning, Winter 2019 | YouTube videos 👉☆312Mar 25, 2023Updated 3 years ago
- Single notebook implementation of Deep RL algorithms☆35Sep 17, 2020Updated 5 years ago
- Winning 3rd Place solution for HubMap - Hacking the Human Vasculature hosted on Kaggle☆14Aug 10, 2023Updated 2 years ago
- Code for ASE'24 paper "B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests"☆11Sep 10, 2024Updated last year
- An Erasure Code Library with Efficient Repair and Update Features☆11Jan 3, 2022Updated 4 years ago
- 复现华泰证券《强化学习初探与DQN择时》研报中的DQN模型与效果☆42Oct 4, 2022Updated 3 years ago
- Corpus and code for Aligned Recipe Actions (ARA) corpus, EMNLP 2021☆10May 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Turtlesim-PID-Controller: A ROS 2 node designed to navigate a turtle in the turtlesim simulator using Proportional control. Aims to drive…☆15Feb 26, 2025Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- Project for my graduate neural networks course - combining RL with VAEs☆22Nov 10, 2019Updated 6 years ago
- https://cs330.stanford.edu/☆62Mar 24, 2023Updated 3 years ago
- Pytorch implementation of SelectiveNet https://arxiv.org/abs/1901.09192☆12Oct 28, 2020Updated 5 years ago
- This repository contains the code to train the baseline agent provided in the 2022 edition of Learning to Run a Power Network and to recr…☆15Aug 2, 2022Updated 3 years ago
- Experiments with reinforcement learning and recurrent neural networks☆116Oct 27, 2023Updated 2 years ago
- The official code for paper entitled "TRCA-Net: Using TRCA filters to boost the SSVEP classification with convolutional neural network" a…☆14Jul 18, 2023Updated 2 years ago
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆13Apr 29, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reinforcement learning algorithms A2C, A3C and DQN☆18Oct 3, 2023Updated 2 years ago
- ☆12Jun 30, 2019Updated 7 years ago
- Code for Representation Bending Paper☆17Jul 15, 2025Updated 11 months ago
- This is a quadruped simulated on pybullet physics engine, walking using trot and bound mechanisms☆16Feb 24, 2024Updated 2 years ago
- 6-DoF wheeled biped robot☆18Jan 19, 2022Updated 4 years ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆18May 29, 2023Updated 3 years ago
- The different parts of the F651 Hexacopter was developed in Solid Works and you find the models here for 3D printing. The drawings were d…☆23Nov 16, 2018Updated 7 years ago
- Implementation of the Recursive Logit model for prediction and estimation☆22Jan 28, 2025Updated last year
- Homework 3 for Berkeley CS 280: our version of the MIT Mini Places challenge☆12Mar 5, 2016Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆43May 7, 2026Updated last month
- [Humanoids 2024 award finalist] Online DNN-Driven Nonlinear MPC for Stylistic Humanoid Robot Walking with Step Adjustment☆20Feb 5, 2025Updated last year
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆26Nov 29, 2024Updated last year
- ☆19Jan 2, 2024Updated 2 years ago
- ☆26Feb 16, 2022Updated 4 years ago
- Official implementation of (ICML 2026) Training-Free Vector Quantization via Gaussian VAEs☆23Jan 3, 2026Updated 5 months ago
- Deep Reinforcement Learning Hands-On, 3E_Published by Packt☆468Mar 2, 2026Updated 4 months ago