OpenAI团队的深度强化学习教程中文版
☆35May 16, 2020Updated 6 years ago
Alternatives and similar repositories for spinningup
Users that are interested in spinningup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆19May 20, 2023Updated 3 years ago
- 控制算法☆25May 21, 2025Updated last year
- [ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"☆14Apr 30, 2023Updated 3 years ago
- ROS配置和使用Xbox One无线手柄☆17Sep 12, 2018Updated 7 years ago
- PyTorch implementation of the paper Overcoming Exploration in Reinforcement Learning with Demonstrations in surgical robot manipulation t…☆12Aug 21, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A PyTorch implementation of SSINet.☆16Nov 10, 2020Updated 5 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 3 years ago
- Modified Pytorch Lightning implementation of paper:-https://jcheminf.biomedcentral.com/track/pdf/10.1186/s13321-019-0407-y☆10Dec 22, 2020Updated 5 years ago
- Neural Message Passing for NMR Chemical Shift Prediction☆11Aug 10, 2022Updated 3 years ago
- Implementation for mSAC methods in PyTorch☆42Oct 10, 2021Updated 4 years ago
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 6 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- android compose catalog☆17Jul 4, 2025Updated 11 months ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Apr 26, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ossian generic framework☆12Aug 25, 2021Updated 4 years ago
- Visual Studio Code Extension that aligns all cursors using spaces.☆29Apr 29, 2026Updated last month
- GAMMA: A General Agent Motion Prediction Model for Autonomous Driving☆14Nov 17, 2021Updated 4 years ago
- Replication of "Taming the Factor Zoo: A Test of New Factors (Feng, Giglio, and Xiu, 2020, JF)"☆10Mar 4, 2024Updated 2 years ago
- ☆34Mar 24, 2023Updated 3 years ago
- ☆24Feb 24, 2023Updated 3 years ago
- Allegro hand controller package which works with ROS Noetic.☆24May 9, 2024Updated 2 years ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated 2 years ago
- MindSpore implementations of deep reinforcement learning algorithms and environments☆17Sep 3, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated 3 months ago
- An implementation of HOME: Heatmap Output for future Motion Estimation☆13Feb 7, 2022Updated 4 years ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆21Mar 4, 2026Updated 3 months ago
- Collision Avoidance using Buffered Voronoi Cell☆14Feb 10, 2017Updated 9 years ago
- ☆29Oct 10, 2018Updated 7 years ago
- Codes for Complex-Valued Spectrum Estimation Network and Applications in Super-Resolution HRRPs Analysis with Wideband Radars☆12Dec 10, 2021Updated 4 years ago
- ☆19Jun 30, 2024Updated last year
- Whale Optimization Algorithm used to train Neural Network☆19Nov 28, 2016Updated 9 years ago
- Maddpg_flight code☆10Jul 4, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 7 years ago
- machine learning algorithms source code☆25Jun 8, 2021Updated 5 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Code of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023☆15Aug 3, 2023Updated 2 years ago
- Flash Artifact for SIGCOMM22☆14Jun 14, 2022Updated 4 years ago
- ☆21Jul 2, 2024Updated last year
- object tracking learning☆13Aug 29, 2020Updated 5 years ago