Stanford CS234: Reinforcement Learning assignments and practices
☆63Jul 31, 2024Updated last year
Alternatives and similar repositories for cs234-assignments
Users that are interested in cs234-assignments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 3 years ago
- Stanford CS234 : Reinforcement Learning☆183Oct 3, 2019Updated 6 years ago
- Solutions to coding assignments of Stanford Reinforcement Learning course Winter 2021☆13Aug 29, 2021Updated 4 years ago
- Assignment Solutions to CS234: Reinforcement learning course☆36Aug 24, 2018Updated 7 years ago
- My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019☆170Mar 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🐲 Stanford CS234 : Reinforcement Learning☆26Jun 8, 2019Updated 6 years ago
- 🐲 Stanford CS234 : Reinforcement Learning☆13Jan 14, 2019Updated 7 years ago
- Single notebook implementation of Deep RL algorithms☆34Sep 17, 2020Updated 5 years ago
- CS294/194-196 Large Language Model Agents☆46Dec 20, 2024Updated last year
- Test version of frechet☆14Mar 30, 2025Updated last year
- Collection of dynamic-graph entities aimed at implementing torque control on different robots.☆15Nov 24, 2025Updated 4 months ago
- My own version from "Writing a C Compiler" Book from NoStarchPress using C++ and LLVM libraries.☆38Mar 21, 2026Updated 3 weeks ago
- SSR: A Unifed Framework for Prediction in Secondary Structure of RNA. 整合了LinearFold, E2Efold, MXfold2算法,包含数据集,欢迎一键使用。☆10Dec 31, 2021Updated 4 years ago
- Dataset Bias correction (Python)☆20Jan 7, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 11 months ago
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION☆29Jan 26, 2026Updated 2 months ago
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- Solutions from "C++ Crash Course A Fast-Paced Introduction" by Josh Lospinoso.☆13May 10, 2023Updated 2 years ago
- CS 70 - Discrete Mathematics and Probability Theory - UC Berkeley - Fall 2017☆12Jan 17, 2018Updated 8 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 5 years ago
- AU335:计算机视觉 课程大作业。Back to the Non-deep-learning Era: 非深度学习的车牌检测与识别算法。基于特征提取、模板匹配等技术。☆13Nov 22, 2021Updated 4 years ago
- ☆12Feb 19, 2024Updated 2 years ago
- ☆22Oct 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A prerender demo for Vue 3 base on Vite.☆10Jun 2, 2022Updated 3 years ago
- ☆13Jul 4, 2019Updated 6 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆106Dec 31, 2021Updated 4 years ago
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- My Solution to Assignments of CS234☆94May 9, 2019Updated 6 years ago
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆275Mar 9, 2024Updated 2 years ago
- ☆10Apr 20, 2023Updated 2 years ago
- Experiments with reinforcement learning and recurrent neural networks☆115Oct 27, 2023Updated 2 years ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆16Aug 3, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code Repository for Deep Learning with Real World Projects, Published by Packt☆18Jan 18, 2023Updated 3 years ago
- [Humanoids 2024 award finalist] Online DNN-Driven Nonlinear MPC for Stylistic Humanoid Robot Walking with Step Adjustment☆20Feb 5, 2025Updated last year
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!