[动手学强化学习]系列,基于pytorch。
☆59Jun 2, 2021Updated 4 years ago
Alternatives and similar repositories for reinforcement_learning
Users that are interested in reinforcement_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Feb 28, 2022Updated 4 years ago
- Multi-objective application placement in fog computing using graph neural network-based reinforcement learning☆10Oct 20, 2025Updated 5 months ago
- dqn autoplay mario bros☆21Jul 24, 2017Updated 8 years ago
- 动手学强化学习代码☆66Jan 17, 2024Updated 2 years ago
- pytorch implementation of DQN, NAF, DDPG☆13Jun 7, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Repository for Robust Trajectory Optimization with Stochastic Complementarity☆12Dec 15, 2020Updated 5 years ago
- Reinforcement Learning for Self Organization and Power Control of Two-Tier Heterogeneous Networks☆20Jan 18, 2020Updated 6 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- [ICLR' 25] The PyTorch implementation of our paper: "Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Lea…☆21Feb 26, 2025Updated last year
- Rethinking Graph Regularization for Graph Neural Networks (AAAI2021)☆34Jun 6, 2021Updated 4 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆52Feb 10, 2019Updated 7 years ago
- Parser for files in OpenDRIVE format, offers additional functions to navigate through the road network☆12Sep 6, 2017Updated 8 years ago
- A step by step implementation of building an AI agent that plays 3d shooting game☆21Jul 16, 2025Updated 8 months ago
- Repository for "Known Unknowns: Uncertainty Quality in Bayesian Neural Networks" paper.☆12Mar 3, 2017Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The Manta v1 software architecture for Autonomous Underwater Vehicles (AUVs) - Master's thesis☆10Aug 11, 2022Updated 3 years ago
- Hybrid Computational Offloading☆16Jul 6, 2022Updated 3 years ago
- ☆36Mar 12, 2019Updated 7 years ago
- A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M.,…☆10Dec 7, 2020Updated 5 years ago
- Deep Reinforcement Learning and BCD to solve phase shift and resource allocation of RIS and RSU☆32Jan 18, 2021Updated 5 years ago
- Repo for our AKBC-2021 paper: Abg-CoQA: Clarifying Ambiguity in Conversational Question Answering☆10Oct 10, 2021Updated 4 years ago
- 基于Deep Qlearning Network的股票交易模型☆57May 15, 2017Updated 8 years ago
- ☆15Dec 10, 2019Updated 6 years ago
- (Pattern Recognition 2025) Towards Trustworthy Dataset Distillation☆14Dec 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Gym Environment for AUV docking procedure☆11Sep 20, 2022Updated 3 years ago
- ROS wrapper for the Oculus M750d Multibeam Echosounder used in the Maritime Robotics Laboratory at KTH.☆13Sep 3, 2021Updated 4 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Application for detecting command and control (C2) communication through network traffic analysis.☆16May 12, 2023Updated 2 years ago
- Trying to come up with an innovative robotic arm trajectory generating controller.☆17Sep 18, 2020Updated 5 years ago
- A wrapper around SOEM to allow multiple masters and devices on EtherCAT☆18Feb 23, 2024Updated 2 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- Not All Patches Are Equal: Hierarchical Dataset Condensation for Single Image Super-Resolution☆11May 7, 2024Updated last year
- TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Le…☆16Jul 2, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions☆16Jan 21, 2025Updated last year
- ☆19Feb 25, 2023Updated 3 years ago
- Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)☆11Jun 16, 2024Updated last year
- This is a official code implementation for Nonlinear RISE based Integral Reinforcement Learning algorithms for perturbed Bilateral Teleop…☆24Mar 26, 2025Updated last year
- Experiments on Model-Agnostic Meta-Learning on Few-Shot Image Classification and Meta-RL (Meta-World)☆17Mar 30, 2021Updated 5 years ago
- Source code for "Learning Deep Priors for Image Dehazing", ICCV 2019☆10Sep 18, 2020Updated 5 years ago
- ☆11Jul 30, 2025Updated 8 months ago