Stanford CS234: Reinforcement Learning assignments and practices
☆63Jul 31, 2024Updated last year
Alternatives and similar repositories for cs234-assignments
Users that are interested in cs234-assignments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 3 years ago
- Stanford CS234 : Reinforcement Learning☆186Oct 3, 2019Updated 6 years ago
- Solutions to coding assignments of Stanford Reinforcement Learning course Winter 2021☆13Aug 29, 2021Updated 4 years ago
- 🐲 Stanford CS234 : Reinforcement Learning☆13Jan 14, 2019Updated 7 years ago
- CS294/194-196 Large Language Model Agents☆47Dec 20, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Variational Inference for a Normal Distribution☆13Mar 11, 2018Updated 8 years ago
- ☆10Jan 31, 2019Updated 7 years ago
- Collection of dynamic-graph entities aimed at implementing torque control on different robots.☆15Nov 24, 2025Updated 6 months ago
- SSR: A Unifed Framework for Prediction in Secondary Structure of RNA. 整合了LinearFold, E2Efold, MXfold2算法,包含数据集,欢迎一键使用。☆10Dec 31, 2021Updated 4 years ago
- *ROS 입문자를 위한 실무 마스터 코스*☆10Jul 20, 2020Updated 5 years ago
- Livox device driver under ros☆11Feb 4, 2022Updated 4 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆27May 12, 2025Updated last year
- 🕹️ CS234: Reinforcement Learning, Winter 2019 | YouTube videos 👉☆312Mar 25, 2023Updated 3 years ago
- ☆10Apr 23, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Vanilla implementation of Rapidly Exploring Random Tree (RRT), Rapidly Exploring Random Graph (RRG) and Rapidly Exploring Random Tree* (R…☆16Dec 10, 2017Updated 8 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 6 years ago
- AU335:计算机视觉 课程大作业。Back to the Non-deep-learning Era: 非深度学习的车牌检测与识别算法。基于特征提取、模板匹配等技术。☆13Nov 22, 2021Updated 4 years ago
- [ICLR 2026] The official implementation of the paper “Anchored Supervised Fine-Tuning”☆41May 8, 2026Updated 3 weeks ago
- A prerender demo for Vue 3 base on Vite.☆10Jun 2, 2022Updated 3 years ago
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION☆36Jan 26, 2026Updated 4 months ago
- Transpile JSON Schema to Type aliases for many languages☆27Jan 3, 2025Updated last year
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆275Mar 9, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- ☆10Apr 20, 2023Updated 3 years ago
- Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"☆179May 14, 2026Updated 2 weeks ago
- Bridge between Isaac ROS VSLAM and PX4 via DDS☆13Feb 19, 2024Updated 2 years ago
- Experiments with reinforcement learning and recurrent neural networks☆116Oct 27, 2023Updated 2 years ago
- PX4 Simulink Software-In-Loop Simulation☆19Feb 13, 2026Updated 3 months ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆16Aug 3, 2023Updated 2 years ago
- ☆12Sep 15, 2021Updated 4 years ago
- Device emulator for ckb-next☆15Aug 31, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [Humanoids 2024 award finalist] Online DNN-Driven Nonlinear MPC for Stylistic Humanoid Robot Walking with Step Adjustment☆19Feb 5, 2025Updated last year
- Implementation of Q-Learning using TD error to navigate a maze avoiding obstacles and a moving enemy☆10Mar 4, 2018Updated 8 years ago
- Benchmarking Tool for Model Predictive Control based stable walking for humanoid robot☆22Nov 6, 2024Updated last year
- Official code for "Traffic Speed Imputation with Spatio-Temporal Attentions and Cycle-Perceptual Training" (CIKM'22).☆13Mar 8, 2024Updated 2 years ago
- Make/Encode some basic logic puzzles☆18Jul 10, 2024Updated last year
- ☆17Jul 12, 2025Updated 10 months ago
- ☆22Feb 4, 2026Updated 3 months ago