My lecture notes on the RL series provided by Stanford.
☆15Aug 31, 2022Updated 3 years ago
Alternatives and similar repositories for Stanford-CS234-RL---Lecture-Notes
Users that are interested in Stanford-CS234-RL---Lecture-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Benchmark Dataset, Env and Agent for DSN Scheduling☆12Mar 3, 2022Updated 4 years ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆25Dec 4, 2025Updated 4 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- This is a python implementation of Hierarchical Image Matting Model for Segmentation.☆11Jun 21, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Stanford CS234 : Reinforcement Learning☆183Oct 3, 2019Updated 6 years ago
- ☆43Aug 3, 2025Updated 8 months ago
- ☆16Dec 4, 2025Updated 4 months ago
- A set of Python class implementing basic several turbo-algorithms (e.g. : turbo-decoding)☆13Aug 31, 2020Updated 5 years ago
- LW-BenchHub is a unified benchmark hub built on Isaac Lab–Arena for embodied AI, providing consistent interfaces, realistic environments,…☆128Updated this week
- A collection of tools for the analysis of biological data☆12Mar 22, 2016Updated 10 years ago
- 🐲 Stanford CS234 : Reinforcement Learning☆26Jun 8, 2019Updated 6 years ago
- what if removed adversiral loss from adversarial motion piror? a pairwise motion piror solution inspired by https://arxiv.org/abs/1706.0…☆38Sep 22, 2024Updated last year
- Code release for "Gaze-Assisted Medical Image Segmentation" [AIM-FM @ NeurIPS, 2024]☆14Oct 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Jan 24, 2018Updated 8 years ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated 2 years ago
- Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for sing…☆16Aug 9, 2023Updated 2 years ago
- Init with augmentation loss☆19Feb 7, 2023Updated 3 years ago
- 🏕️ An out-of-doors, make-it-yours programming adventure. July 28th to 31st 2023 in Vermont's Northeast Kingdom.☆82Jun 4, 2025Updated 10 months ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.☆16Mar 28, 2020Updated 6 years ago
- ☆14May 9, 2024Updated last year
- A lightweight repository for exploring and experimenting with AI agents☆14Jul 22, 2025Updated 8 months ago
- simple-centerline-extraction☆18May 24, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- HTK Toolkit with Linux 64 bit and Docker support☆20Oct 4, 2021Updated 4 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- DREEM Relates Every Entities' Motion (DREEM). Global Tracking Transformers for biological multi-object tracking.☆17Mar 23, 2026Updated 3 weeks ago
- ☆26Nov 27, 2022Updated 3 years ago
- 2D/3D biopolymer network extraction and quantification.☆16Mar 29, 2026Updated 2 weeks ago
- Medical Assistant , Voice- Assistant using voice flow, Django Back-end API For Storing Client Records☆23Nov 27, 2020Updated 5 years ago
- Cross view training for sequence labeling in pytorch☆20Jul 25, 2024Updated last year
- Offline evaluation of multi-armed bandit algorithms☆23Dec 1, 2020Updated 5 years ago
- The Speech Rate Meter (hereinafter SRM) software module is designed to measure a complex of characteristics of the tempo (rate) of oral s…☆23Jul 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Hugging Face Audio Course中文版,帮助学习者快速入门音频模态☆37May 25, 2024Updated last year
- RootPainter3D: Interactive-machine-learning enables rapid and accurate contouring for radiotherapy☆24Jun 5, 2024Updated last year
- ☆26Jan 26, 2024Updated 2 years ago
- Implementation notebooks and scripts of Deep Reinforcement learning Algorithms in PyTorch and TensorFlow.☆23Jan 16, 2020Updated 6 years ago
- Code associated with the paper Cascaded Multitask U-Net using topological loss for vessel segmentation and centerline extraction☆26Jul 22, 2024Updated last year
- ☆14Feb 26, 2024Updated 2 years ago
- The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.☆33Nov 29, 2018Updated 7 years ago