π² Stanford CS234 : Reinforcement Learning
β27Jun 8, 2019Updated 7 years ago
Alternatives and similar repositories for CS234_RL
Users that are interested in CS234_RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS234 : Reinforcement Learningβ187Oct 3, 2019Updated 6 years ago
- Stanford CS234: Reinforcement Learning Winter 2020β19Mar 24, 2023Updated 3 years ago
- Minimal example to access PyBullet using C++β12Mar 19, 2021Updated 5 years ago
- Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.β21Nov 17, 2023Updated 2 years ago
- Corpus and code for Aligned Recipe Actions (ARA) corpus, EMNLP 2021β10May 22, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- My lecture notes on the RL series provided by Stanford.β15Aug 31, 2022Updated 3 years ago
- β12Jan 11, 2018Updated 8 years ago
- BUILD YOUR OWN BLOCKCHAIN: A PYTHON TUTORIAL Download the full Jupyter/iPython notebook from Github here Build Your Own Blockchain β Theβ¦β19Jun 15, 2019Updated 6 years ago
- Teleop Twist Keyboard for ROS2β25Oct 29, 2023Updated 2 years ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimationβ16Aug 3, 2023Updated 2 years ago
- β13Feb 24, 2020Updated 6 years ago
- β21Mar 28, 2018Updated 8 years ago
- Implementation of "Learning Across Tasks and Domains" ICCV 2019β15Mar 24, 2023Updated 3 years ago
- Benchmark result of different RL algorithms on MetaDrive environments, including Multi-agent RL (IPPO, centralized critics, CoPO).β16Oct 25, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- simulation of "A novel reinforcement learning algorithm for virtual network emb e dding" paperβ18Jan 16, 2020Updated 6 years ago
- Water Hackweek Machine Learning workshopβ15Sep 2, 2020Updated 5 years ago
- A SCADA system that uses prime for intrusion tolerance. Using PVBrowser as an HMIβ10May 27, 2015Updated 11 years ago
- Notes from Reinforcement Learning Specialisaitonβ10Jul 6, 2021Updated 4 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.β21Mar 9, 2021Updated 5 years ago
- Dummy package and node for ROS2 GDB debuggingβ10Sep 3, 2020Updated 5 years ago
- C++ implementation of the algorithm in "Fast and Accurate Least-Mean-Squares Solvers", NIPS19β11Mar 4, 2020Updated 6 years ago
- Python Advance Levelβ18Oct 6, 2024Updated last year
- Flow-based programming frameworkβ16Apr 9, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- State Space Models for Reinforcement Learning in Tensorflowβ19Jan 27, 2019Updated 7 years ago
- β10Apr 16, 2020Updated 6 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.β17Aug 2, 2018Updated 7 years ago
- Urho3D extra minimal examples and demos. Tested in Ubuntu 18.04.β11Feb 25, 2022Updated 4 years ago
- β12May 20, 2026Updated 3 weeks ago
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)β23Jul 16, 2022Updated 3 years ago
- Variants for ROS (implemented as metapackages)β11May 31, 2025Updated last year
- Visualising what each LSTM cell learns from data.β24Jan 26, 2020Updated 6 years ago
- The project tracks a person wearing a specific logo t-shirt from video and estimates height of the personβ18May 28, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β15Nov 11, 2014Updated 11 years ago
- Tock Tracker (a tock is like a pomodoro but longer)β11Jul 28, 2014Updated 11 years ago
- Auxiliary variable Markov chain Monte Carlo methodsβ10Oct 24, 2017Updated 8 years ago
- Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a β κ€)β13Oct 25, 2023Updated 2 years ago
- Use your iPad as a Gobanβ30Aug 4, 2010Updated 15 years ago
- Linear Algebra for Machine Learning Book Exercisesβ13May 19, 2019Updated 7 years ago
- Deep Developmental Reinforcement Learningβ29Jul 1, 2020Updated 5 years ago