Stanford CS234: Reinforcement Learning assignments and practices
☆63Jul 31, 2024Updated last year
Alternatives and similar repositories for cs234-assignments
Users that are interested in cs234-assignments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 3 years ago
- Stanford CS234 : Reinforcement Learning☆183Oct 3, 2019Updated 6 years ago
- This repo mainly contains CS234 (Spring 2024) assignment's coding problems☆58Feb 4, 2025Updated last year
- Assignment Solutions to CS234: Reinforcement learning course☆36Aug 24, 2018Updated 7 years ago
- My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019☆170Mar 24, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NIPS 2021] Code release for "Pareto Domain Adaptation"☆11Dec 13, 2021Updated 4 years ago
- CS294/194-196 Large Language Model Agents☆47Dec 20, 2024Updated last year
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION☆24Jan 26, 2026Updated 2 months ago
- ☆10Jan 31, 2019Updated 7 years ago
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆24May 2, 2025Updated 10 months ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 10 months ago
- Dataset Bias correction (Python)☆20Jan 7, 2018Updated 8 years ago
- Изучение C++ и алгоритмов CV на спец. курсе для 11 классов ФМЛ №239☆10Sep 5, 2024Updated last year
- A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)☆24Jul 26, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 🕹️ CS234: Reinforcement Learning, Winter 2019 | YouTube videos 👉☆312Mar 25, 2023Updated 3 years ago
- Model-Predictive Control for realtime synthesis of agile motor control using MuJoCo.☆10May 17, 2024Updated last year
- ☆11Jan 5, 2017Updated 9 years ago
- CS 70 - Discrete Mathematics and Probability Theory - UC Berkeley - Fall 2017☆12Jan 17, 2018Updated 8 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 5 years ago
- Embedded segmental K-means (ES-KMeans) in Python.☆14Apr 22, 2024Updated last year
- ☆13Jul 4, 2019Updated 6 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆106Dec 31, 2021Updated 4 years ago
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Jul 10, 2024Updated last year
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆273Mar 9, 2024Updated 2 years ago
- Experiments with reinforcement learning and recurrent neural networks☆115Oct 27, 2023Updated 2 years ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆16Aug 3, 2023Updated 2 years ago
- ☆12Sep 15, 2021Updated 4 years ago
- [CIKM-21] Pytorch implementation of LiteGT: Efficient and Lightweight Graph Transformers☆12Nov 16, 2021Updated 4 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- About Homework assignments for the Stanford CS 330 (Deep Multi-Task and Meta Learning) class offered in Fall 2022.☆10Dec 15, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆17Jul 12, 2025Updated 8 months ago
- ☆22Feb 4, 2026Updated last month
- [NeurIPS 2023] Efficient Diffusion Policy☆112Oct 31, 2023Updated 2 years ago
- ロボットシステム入門 / Let's learn how to create intelligent robot systems with Roomba!☆23Aug 4, 2023Updated 2 years ago
- This model detects arabic fonts (نسخ, رقعة) given a picture of the text, Live https://calbot.hawzen.me/☆17May 27, 2023Updated 2 years ago
- Isaac Lab locomotion tasks for the Kangaroo robot☆17Aug 22, 2025Updated 7 months ago
- Hybrid Pointer Networks for Traveling Salesman Problems Optimization☆29Oct 13, 2022Updated 3 years ago