Stanford CS234: Reinforcement Learning assignments and practices
☆63Jul 31, 2024Updated last year
Alternatives and similar repositories for cs234-assignments
Users that are interested in cs234-assignments are comparing it to the libraries listed below
Sorting:
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 2 years ago
- Solutions to coding assignments of Stanford Reinforcement Learning course Winter 2021☆13Aug 29, 2021Updated 4 years ago
- Stanford CS234 : Reinforcement Learning☆183Oct 3, 2019Updated 6 years ago
- My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019☆170Mar 24, 2023Updated 2 years ago
- Assignment Solutions to CS234: Reinforcement learning course☆36Aug 24, 2018Updated 7 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 9 months ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆106Dec 31, 2021Updated 4 years ago
- 🕹️ CS234: Reinforcement Learning, Winter 2019 | YouTube videos 👉☆310Mar 25, 2023Updated 2 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 3 months ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 5 years ago
- Experiments with reinforcement learning and recurrent neural networks☆114Oct 27, 2023Updated 2 years ago
- ☆33Apr 29, 2023Updated 2 years ago
- DiagnoSys is a comprehensive web application that provides advanced detection and analysis for various health conditions. This project le…☆14May 6, 2024Updated last year
- ☆10Feb 27, 2026Updated last week
- Representing robots as graphs for reinforcement-learning in PyBullet locomotion environments.☆35Apr 11, 2021Updated 4 years ago
- ROS 2 simulation packages for the Neobotix robots☆38May 12, 2025Updated 9 months ago
- ☆10Nov 16, 2023Updated 2 years ago
- cross lingual text classification on amazon reviews☆10Nov 4, 2019Updated 6 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆13Aug 13, 2025Updated 6 months ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Dec 9, 2021Updated 4 years ago
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- Autonomous Agent for Kubernetes☆14Feb 14, 2025Updated last year
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- [CIKM-21] Pytorch implementation of LiteGT: Efficient and Lightweight Graph Transformers☆12Nov 16, 2021Updated 4 years ago
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Poker hand evaluation for Go☆12Feb 7, 2014Updated 12 years ago
- ROS wrapper for SMAC, a versatile tool for optimizing algorithm parameters☆11Jul 19, 2021Updated 4 years ago
- Implementation of Reinforce for educational purposes.☆12Jun 12, 2023Updated 2 years ago
- ☆10Mar 11, 2020Updated 5 years ago
- ☆15May 3, 2024Updated last year
- A Very Simple Demo of Fine Tuning Sentence Transformers☆15Jun 15, 2023Updated 2 years ago