ksang / cs234-assignmentsView external linksLinks
Stanford CS234: Reinforcement Learning assignments and practices
☆63Jul 31, 2024Updated last year
Alternatives and similar repositories for cs234-assignments
Users that are interested in cs234-assignments are comparing it to the libraries listed below
Sorting:
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 2 years ago
- Solutions to coding assignments of Stanford Reinforcement Learning course Winter 2021☆13Aug 29, 2021Updated 4 years ago
- Stanford CS234 : Reinforcement Learning☆179Oct 3, 2019Updated 6 years ago
- This repo mainly contains CS234 (Spring 2024) assignment's coding problems☆57Feb 4, 2025Updated last year
- My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019☆170Mar 24, 2023Updated 2 years ago
- Assignment Solutions to CS234: Reinforcement learning course☆36Aug 24, 2018Updated 7 years ago
- ☆13Aug 1, 2021Updated 4 years ago
- 🐲 Stanford CS234 : Reinforcement Learning☆26Jun 8, 2019Updated 6 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 9 months ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 2 years ago
- A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)☆24Jul 26, 2024Updated last year
- Deep Reinforcement Learning for Continuous Control in PyTorch☆105Dec 31, 2021Updated 4 years ago
- 🕹️ CS234: Reinforcement Learning, Winter 2019 | YouTube videos 👉☆311Mar 25, 2023Updated 2 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 2 months ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 5 years ago
- Experiments with reinforcement learning and recurrent neural networks☆114Oct 27, 2023Updated 2 years ago
- ☆33Apr 29, 2023Updated 2 years ago
- Representing robots as graphs for reinforcement-learning in PyBullet locomotion environments.☆35Apr 11, 2021Updated 4 years ago
- Series of lectures and hands-on tutorials organized to familiarize new lab entrants with the fundamental areas of robotics research.☆35Jul 3, 2021Updated 4 years ago
- Single notebook implementation of Deep RL algorithms☆34Sep 17, 2020Updated 5 years ago
- ROS 2 simulation packages for the Neobotix robots☆36May 12, 2025Updated 9 months ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆13Aug 13, 2025Updated 6 months ago
- ☆10Nov 16, 2023Updated 2 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆26Oct 16, 2025Updated 4 months ago
- A Very Simple Demo of Fine Tuning Sentence Transformers☆15Jun 15, 2023Updated 2 years ago
- Simple rules based grapheme to phoneme in Python☆11Sep 2, 2017Updated 8 years ago
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Dec 9, 2021Updated 4 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- https://github.com/mitsuba-renderer/mitsuba2 in docker☆10Jun 13, 2020Updated 5 years ago
- Convert datasets from Hugging Face to FiftyOne for Visualization☆11Mar 15, 2024Updated last year
- ☆13Jul 10, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- Implementation of Reinforce for educational purposes.☆12Jun 12, 2023Updated 2 years ago
- Solutions from "C++ Crash Course A Fast-Paced Introduction" by Josh Lospinoso.☆11May 10, 2023Updated 2 years ago
- ROS wrapper for SMAC, a versatile tool for optimizing algorithm parameters☆11Jul 19, 2021Updated 4 years ago