☆15Mar 4, 2020Updated 6 years ago
Alternatives and similar repositories for reinforcement-learning-sutton
Users that are interested in reinforcement-learning-sutton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.☆11Feb 6, 2023Updated 3 years ago
- Bayesian Regression Models using pymc3☆11Feb 4, 2017Updated 9 years ago
- Least Squares Policy Iteration (LSPI) in Python☆11May 25, 2015Updated 11 years ago
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- ☆13Jul 26, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deep Learning Course Project☆11Dec 9, 2017Updated 8 years ago
- Code for the Reset-free Trial and Error learning paper (RTE) experiments☆10Jan 3, 2018Updated 8 years ago
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 7 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Reward Propagation using Graph Convolutional Networks☆13Jun 19, 2021Updated 4 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year
- EC EN 674 Flight Dynamics & Control Design Project☆15May 24, 2023Updated 3 years ago
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated last year
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- A list of top level domains (TLDs) in CSV-format☆29Apr 14, 2023Updated 3 years ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆27May 4, 2021Updated 5 years ago
- A mini book for java unit testing☆13Dec 12, 2020Updated 5 years ago
- Translation and understanding of the Pop-art paper.☆18Oct 21, 2019Updated 6 years ago
- The implementation of a video stabilization system of the NAO robot is presented through the data of the IMU, this stabilized image is us…☆17Mar 20, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- OpenPifPaf plugin for Posetrack☆32May 27, 2022Updated 4 years ago
- Lightweight python library for launching experiments and tuning hyperparameters, either locally or on a cluster☆23Sep 29, 2023Updated 2 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- ☆16Feb 8, 2024Updated 2 years ago
- [AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple i…☆16Apr 16, 2025Updated last year
- 🛩️⚙️ 3D Planning, PID Control, Extended Kalman Filter for the Udacity Flying Car Nanodegree // FCND-Term1☆19May 22, 2020Updated 6 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆86Jul 27, 2022Updated 3 years ago
- ☆13Jan 16, 2025Updated last year
- A PPO agent leveraging reinforcement learning performs Penetration Testing in a simulated computer network environment. The agent is trai…☆29Apr 2, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Visual Odometry System using MPU9250 IMU and Ueye Global Shutter Camera☆24Mar 21, 2019Updated 7 years ago
- The first real-world FL benchmark for legal NLP☆13Nov 29, 2023Updated 2 years ago
- OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206☆11Aug 30, 2024Updated last year
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- A high-level HTTP / REST client for Node☆29Oct 2, 2020Updated 5 years ago
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago
- The official implantation of SGPT (CVPR2024)☆18Jul 15, 2024Updated last year