☆49Jan 30, 2026Updated 2 months ago
Alternatives and similar repositories for RLpapersnote
Users that are interested in RLpapersnote are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Aug 24, 2021Updated 4 years ago
- Algorithms Library for Supply Chain Inventory Optimization☆19Feb 2, 2019Updated 7 years ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 9 months ago
- Code to accompany "Conformal Prediction as Bayesian Quadrature" by Jake Snell & Tom Griffiths (ICML 2025 Outstanding Paper)☆23Jul 14, 2025Updated 9 months ago
- Paper notes☆12Apr 27, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for paper "Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control"☆10May 26, 2019Updated 6 years ago
- Code for NeurIPS2021 submission "A Surrogate Objective Framework for Prediction+Programming with Soft Constraints"☆13Aug 30, 2021Updated 4 years ago
- Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)☆25Oct 26, 2021Updated 4 years ago
- ☆25Aug 25, 2021Updated 4 years ago
- ☆18Apr 17, 2019Updated 6 years ago
- OpenAI Gym environment for AirSim☆22Nov 27, 2019Updated 6 years ago
- ☆72May 5, 2023Updated 2 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 3 years ago
- ☆14Dec 5, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆21Dec 22, 2020Updated 5 years ago
- This project uses gpt-4 to build agents to play one night werewolf.☆10Jul 14, 2023Updated 2 years ago
- ☆13Jan 15, 2022Updated 4 years ago
- RDFS: an erasure code based cloud storage system☆38Jul 28, 2014Updated 11 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- Implementation of Continuous Control RL Algorithms☆11Dec 8, 2022Updated 3 years ago
- On-the-fly Table Generation - SIGIR'18☆10Feb 1, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Erasure code library for Erlang☆12Sep 5, 2024Updated last year
- 本工具采用随机算法计算指定文件夹内两两 .docx 文件间的相似性。☆15Jun 15, 2020Updated 5 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- ☆25Sep 30, 2025Updated 6 months ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- Code of our work "Maneuver-based Anchor Trajectory Hypotheses at Roundabouts".☆13Sep 15, 2022Updated 3 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- Designing an optimized path for multiple robots in a warehouse for picking and delivery operations using A* algorithm (shortest path) and…☆11Jul 28, 2023Updated 2 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆452Oct 21, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Dec 19, 2022Updated 3 years ago
- ☆16Updated this week
- ☆12Mar 18, 2024Updated 2 years ago
- Autonomous vehicle learn how to navigate efficiently at crossroad☆16Jan 31, 2018Updated 8 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- Code for paper: Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory co…☆23Mar 1, 2025Updated last year