My lecture notes on the RL series provided by Stanford.
☆15Aug 31, 2022Updated 3 years ago
Alternatives and similar repositories for Stanford-CS234-RL---Lecture-Notes
Users that are interested in Stanford-CS234-RL---Lecture-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Benchmark Dataset, Env and Agent for DSN Scheduling☆12Mar 3, 2022Updated 4 years ago
- Belief state estimation for Stanford's CS238/AA228 Decision Making Under Uncertainty☆10Nov 16, 2023Updated 2 years ago
- Prompt Optimization with Human Feedback☆18Aug 7, 2024Updated last year
- Python library and demo code for processing and visualization of data from Ocean Observatories Initiative (OOI)☆17May 25, 2026Updated 3 weeks ago
- Ilya Sutskever “If you really learn all of these, you’ll know 90% of what matters today”. This repo records my thoughts on these papers.☆11May 24, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Biodiversity detection for stationary cameras and ecological video.☆30Apr 28, 2026Updated last month
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- The first open-source synthetic dataset for collaborative perception focused on adverse weather conditions☆25Apr 29, 2025Updated last year
- This is a python implementation of Hierarchical Image Matting Model for Segmentation.☆11Jun 21, 2022Updated 3 years ago
- Stanford CS234 : Reinforcement Learning☆187Oct 3, 2019Updated 6 years ago
- 训练营讲义☆21Jan 21, 2025Updated last year
- A collection of tools for the analysis of biological data☆12Mar 22, 2016Updated 10 years ago
- 🐲 Stanford CS234 : Reinforcement Learning☆27Jun 8, 2019Updated 7 years ago
- LW-BenchHub is a unified benchmark hub built on Isaac Lab–Arena for embodied AI, providing consistent interfaces, realistic environments,…☆166May 18, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code release for "Gaze-Assisted Medical Image Segmentation" [AIM-FM @ NeurIPS, 2024]☆14Oct 22, 2024Updated last year
- Programming assignments for Coursera's Machine Learning Course.☆15Sep 1, 2017Updated 8 years ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated 2 years ago
- Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for sing…☆16Aug 9, 2023Updated 2 years ago
- ICLR 2026: Towards Bridging the Gap between Large-Scale Pretraining and Efficient Finetuning for Humanoid Controls☆107Mar 28, 2026Updated 2 months ago
- Init with augmentation loss☆19Feb 7, 2023Updated 3 years ago
- 🏕️ An out-of-doors, make-it-yours programming adventure. July 28th to 31st 2023 in Vermont's Northeast Kingdom.☆81Jun 4, 2025Updated last year
- In order to demonstrate any signal accurately it is important to know the noise containt in the signal. Thus, a fundamental measure is th…☆13May 10, 2021Updated 5 years ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.☆16Mar 28, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- python版本的UMDHMM,包括了forward-backward、viterbi、baum-welch算法。☆17Mar 21, 2013Updated 13 years ago
- A lightweight repository for exploring and experimenting with AI agents☆15Jul 22, 2025Updated 10 months ago
- simple-centerline-extraction☆18May 24, 2025Updated last year
- Suite of tools for noise analysis in Python☆17Sep 8, 2024Updated last year
- Enhance vessel structures in 3D images using Hessian/Frangi/eigenvalue filter through the ITK library☆19Jul 25, 2021Updated 4 years ago
- notes for convex optimization☆21Aug 30, 2024Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- DREEM Relates Every Entities' Motion (DREEM). Global Tracking Transformers for biological multi-object tracking.☆17Mar 23, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆26Nov 27, 2022Updated 3 years ago
- 2D/3D biopolymer network extraction and quantification.☆17Mar 29, 2026Updated 2 months ago
- Medical Assistant , Voice- Assistant using voice flow, Django Back-end API For Storing Client Records☆23Nov 27, 2020Updated 5 years ago
- Neural network backend for training and inference for animal pose estimation.☆20Jun 6, 2026Updated last week
- Cross view training for sequence labeling in pytorch☆20Jul 25, 2024Updated last year
- Offline evaluation of multi-armed bandit algorithms☆23Dec 1, 2020Updated 5 years ago
- Hugging Face Audio Course 中文版,帮助学习者快速入门音频模态☆37May 25, 2024Updated 2 years ago