Rhyme0730 / CS234-Reinforcement-LearningLinks
This repo mainly contains CS234 assignment's coding problems
☆39Updated 7 months ago
Alternatives and similar repositories for CS234-Reinforcement-Learning
Users that are interested in CS234-Reinforcement-Learning are comparing it to the libraries listed below
Sorting:
- Stanford CS234: Reinforcement Learning assignments and practices☆59Updated last year
- Stanford CS234 : Reinforcement Learning☆161Updated 5 years ago
- ☆173Updated last week
- ☆360Updated 8 months ago
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆227Updated 2 weeks ago
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆65Updated 5 months ago
- ☆239Updated last year
- 🌀 Stanford CS 228 - Probabilistic Graphical Models☆122Updated 6 years ago
- ☆235Updated 2 years ago
- ☆68Updated 2 years ago
- ☆51Updated last year
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆30Updated last year
- 🦍 Stanford CS236 : Deep Generative Models☆149Updated 6 years ago
- A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/C…☆106Updated 3 months ago
- ☆186Updated last year
- 🌲 Stanford CS 228 - Probabilistic Graphical Models☆139Updated last year
- Material for the "Probabilistic Machine Learning" Course at the University of Tübingen, Summer Term 2023☆180Updated 2 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆363Updated this week
- Interactive textbook on state-space models☆197Updated last year
- Minimal hackable GRPO implementation☆285Updated 7 months ago
- Notes and commented code for RLHF (PPO)☆107Updated last year
- Collecting research materials on EBM/EBL (Energy Based Models, Energy Based Learning)☆342Updated 2 months ago
- An implementation of PPO in Pytorch☆95Updated last month
- Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).☆389Updated 2 years ago
- ☆96Updated 11 months ago
- ☆82Updated last year
- ☆150Updated 9 months ago
- Generative Flow Networks - GFlowNet☆275Updated this week
- Code and links for over 25,000 trained Atari agents☆98Updated last year
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆62Updated 6 months ago