Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
☆11Apr 3, 2019Updated 7 years ago
Alternatives and similar repositories for Dynamic-Programming
Users that are interested in Dynamic-Programming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A brief tutorial for eBPF: Verifier, observability, networking, and security.☆12Sep 19, 2024Updated last year
- Rajomon: Decentralized and Coordinated Overload Control for Latency-Sensitive Microservices☆12May 19, 2025Updated 11 months ago
- Classic environments for reinforcement learning and dynamic programming, implemented in OpenAI Gym and Gymnasium.☆21May 2, 2023Updated 2 years ago
- Eliminate compaction jobs in secondary nodes within a group of replicated RocksDB.☆10Jun 5, 2024Updated last year
- Learning Long-Horizon Robot Exploration Strategies for Multi-Object Search in Continuous Action Spaces. http://multi-object-search.cs.uni…☆13Nov 29, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Differential game theory for multi-agent collision avoidance. Simulations set up.☆12Jan 27, 2021Updated 5 years ago
- ☆11Dec 16, 2025Updated 4 months ago
- Simple q-learning implementation for taxi-v3 environment of Open AI gym.☆21Feb 16, 2022Updated 4 years ago
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- Python implement of paper "PD-FAC: Probability Density Factorized Multi-Agent Distributional Reinforcement Learning for Multi-Robot Relia…☆11Mar 5, 2022Updated 4 years ago
- ☆15Dec 13, 2024Updated last year
- Gymnasium environment for research of UAVs and risk constraints☆12Oct 29, 2024Updated last year
- Tacotron 2 training notebook supporting Japanese, French, and Mandarin☆11Nov 19, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Dec 19, 2022Updated 3 years ago
- ☆12Mar 18, 2024Updated 2 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Electroplating simulation environment☆20Sep 26, 2024Updated last year
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- ☆11Feb 29, 2024Updated 2 years ago
- 变邻域搜索算法(VNS)求解TSP(附C++详细代码及注释)☆10May 12, 2019Updated 6 years ago
- This project offers to solve Multi-Agent-Path-Finding(MAPF) problem optimally using Conflict-Based Search(CBS).☆13Aug 31, 2022Updated 3 years ago
- Tutorial on how to train a custom voice recognition model using Hugging face models.☆11Jul 2, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆18Mar 8, 2025Updated last year
- Scalable Monotonic Neural Networks☆12Mar 14, 2024Updated 2 years ago
- [RAL 2025] MTIL: Encoding Full History with Mamba for Temporal Imitation Learning☆43Apr 2, 2026Updated 3 weeks ago
- 桌面天气预报(基于Qt5,代码结构清晰并含有详细注释)☆11Jul 29, 2023Updated 2 years ago
- ☆10Jun 23, 2023Updated 2 years ago
- Code to accompany "Conformal Prediction as Bayesian Quadrature" by Jake Snell & Tom Griffiths (ICML 2025 Outstanding Paper)☆23Jul 14, 2025Updated 9 months ago
- A caching framework for microservice applications☆24Apr 22, 2024Updated 2 years ago
- Code for Learning to Defer to Multiple Experts: Consistent Surrogate Losses, Confidence Calibration, and Conformal Ensembles [AISTATS'23]☆13Jul 28, 2023Updated 2 years ago
- This project is from the Airbnb Recruitment Challenge on Kaggle. The challenge is to solve a multi-class classification problem of predic…☆11Feb 22, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A survival guide for Vanderbilt Biostatistics first year comprehensive exams☆14May 12, 2020Updated 5 years ago
- A python implementation of the COACH algorithm for the Cartpole problem in OpenAI gym.☆11Mar 15, 2019Updated 7 years ago
- General information about DEEP BERLIN's AI for Good Hackathon 2020☆11Apr 14, 2020Updated 6 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Variational Monte Carlo with generative flows for mini-BMN matrix models☆19Aug 20, 2020Updated 5 years ago
- Open DRUWA - Open Deep Realtime User Welcoming Assistant☆16Nov 4, 2022Updated 3 years ago
- Elegant and efficient error handling and reporting patterns for C☆10Oct 9, 2016Updated 9 years ago