Stanford CS234: Reinforcement Learning Winter 2020
☆19Mar 24, 2023Updated 2 years ago
Alternatives and similar repositories for CS234-2020
Users that are interested in CS234-2020 are comparing it to the libraries listed below
Sorting:
- Assignment Solutions to CS234: Reinforcement learning course☆36Aug 24, 2018Updated 7 years ago
- Stanford CS234: Reinforcement Learning assignments and practices☆63Jul 31, 2024Updated last year
- Solutions to coding assignments of Stanford Reinforcement Learning course Winter 2021☆13Aug 29, 2021Updated 4 years ago
- My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019☆170Mar 24, 2023Updated 2 years ago
- Stanford CS234 : Reinforcement Learning☆182Oct 3, 2019Updated 6 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- 🕹️ CS234: Reinforcement Learning, Winter 2019 | YouTube videos 👉☆311Mar 25, 2023Updated 2 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 3 months ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 9 months ago
- ☆33Apr 29, 2023Updated 2 years ago
- ROS 2 simulation packages for the Neobotix robots☆38May 12, 2025Updated 9 months ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- Python Implementation of the CME FedWatch Tool for Estimating Probabilities of Federal Funds Rate Changes at Upcoming FOMC Meetings.☆11Sep 20, 2023Updated 2 years ago
- A brief understanding of ffmpeg cli through pseudocode☆11Dec 20, 2020Updated 5 years ago
- Poker hand evaluation for Go☆12Feb 7, 2014Updated 12 years ago
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- Swarm learning algorithm☆11Jun 2, 2021Updated 4 years ago
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆13Oct 16, 2024Updated last year
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- About Homework assignments for the Stanford CS 330 (Deep Multi-Task and Meta Learning) class offered in Fall 2022.☆10Dec 15, 2023Updated 2 years ago
- Solutions from "C++ Crash Course A Fast-Paced Introduction" by Josh Lospinoso.☆11May 10, 2023Updated 2 years ago
- MLflow App Using React, Hooks, RabbitMQ, FastAPI Server, Celery, Microservices☆11Sep 25, 2022Updated 3 years ago
- nd009-cn-advanced-p5,针对Udacity CN MLND P5项目☆14Jun 27, 2022Updated 3 years ago
- Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo☆11Feb 12, 2023Updated 3 years ago
- Reinforcement learning training project for a SLG game☆13Dec 21, 2017Updated 8 years ago
- ☆11Sep 17, 2020Updated 5 years ago
- A Simple Game Using Unity ML-Agents☆10Nov 20, 2020Updated 5 years ago
- ☆16Jul 13, 2022Updated 3 years ago
- paddle cifar100 training☆14May 28, 2021Updated 4 years ago
- Implementation of elo rating for large competitions☆10Nov 25, 2016Updated 9 years ago
- Tuning the PI controller parameters by using a contextual bandit approach☆15Jan 13, 2022Updated 4 years ago
- LibGL for ExaGear☆11Feb 14, 2021Updated 5 years ago
- Code from the CMU LM inference fall 2025 edition.☆34Dec 7, 2025Updated 2 months ago