☆15Mar 4, 2020Updated 6 years ago
Alternatives and similar repositories for reinforcement-learning-sutton
Users that are interested in reinforcement-learning-sutton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A minimal Unreal Engine project for developing and testing UnrealCV☆17Nov 8, 2018Updated 7 years ago
- Bayesian Regression Models using pymc3☆11Feb 4, 2017Updated 9 years ago
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago
- APM Plane, APM Copter, APM Rover, and APM Solo historical tagged releases☆14Nov 10, 2016Updated 9 years ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Reward Propagation using Graph Convolutional Networks☆13Jun 19, 2021Updated 5 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- BING++: A Fast High Quality Object Proposal Generator at 100fps☆16May 18, 2016Updated 10 years ago
- [Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapter☆12Sep 4, 2023Updated 2 years ago
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated last year
- My defense presentation☆10Mar 7, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆23Oct 26, 2018Updated 7 years ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆27May 4, 2021Updated 5 years ago
- A mini book for java unit testing☆13Dec 12, 2020Updated 5 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Translation and understanding of the Pop-art paper.☆18Oct 21, 2019Updated 6 years ago
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- The implementation of a video stabilization system of the NAO robot is presented through the data of the IMU, this stabilized image is us…☆17Mar 20, 2020Updated 6 years ago
- Code for ijcai-24 paper "Federated Adaptation for Foundation Model-based Recommendations"☆12Apr 18, 2025Updated last year
- Lightweight python library for launching experiments and tuning hyperparameters, either locally or on a cluster☆24Sep 29, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- ☆16Feb 8, 2024Updated 2 years ago
- [AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple i…☆16Apr 16, 2025Updated last year
- 🛩️⚙️ 3D Planning, PID Control, Extended Kalman Filter for the Udacity Flying Car Nanodegree // FCND-Term1☆20May 22, 2020Updated 6 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆88Jul 27, 2022Updated 3 years ago
- A PPO agent leveraging reinforcement learning performs Penetration Testing in a simulated computer network environment. The agent is trai…☆29Apr 2, 2025Updated last year
- Visual Odometry System using MPU9250 IMU and Ueye Global Shutter Camera☆24Mar 21, 2019Updated 7 years ago
- A Survey on Vulnerability of Federated Learning: An Algorithm Perspective☆18May 30, 2024Updated 2 years ago
- A federated image segmentation method based on style transfer☆16Sep 28, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The first real-world FL benchmark for legal NLP☆13Nov 29, 2023Updated 2 years ago
- OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206☆11Aug 30, 2024Updated last year
- Associated codebase for Byzantine-resilient distributed / decentralized machine learning papers from INSPIRE Lab☆14Oct 11, 2021Updated 4 years ago
- Common functionality for object detection☆16May 4, 2021Updated 5 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆13Jun 27, 2024Updated last year
- The official implantation of SGPT (CVPR2024)☆18Jul 15, 2024Updated last year