OpenAI团队的深度强化学习教程中文版
☆35May 16, 2020Updated 5 years ago
Alternatives and similar repositories for spinningup
Users that are interested in spinningup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆19May 20, 2023Updated 2 years ago
- Codebase for ReLMM☆22Apr 17, 2023Updated 2 years ago
- PyTorch implementation of the paper Overcoming Exploration in Reinforcement Learning with Demonstrations in surgical robot manipulation t…☆12Aug 21, 2022Updated 3 years ago
- A PyTorch implementation of SSINet.☆16Nov 10, 2020Updated 5 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.☆11Sep 8, 2023Updated 2 years ago
- ☆14Oct 10, 2025Updated 5 months ago
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- Ossian generic framework☆12Aug 25, 2021Updated 4 years ago
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆32Apr 27, 2025Updated 11 months ago
- ☆34Mar 24, 2023Updated 3 years ago
- Qt-like event loops, signals and slots for communication across threads and processes in Python☆14Mar 26, 2024Updated 2 years ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated last year
- MindSpore implementations of deep reinforcement learning algorithms and environments☆16Sep 3, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated last month
- An implementation of HOME: Heatmap Output for future Motion Estimation☆13Feb 7, 2022Updated 4 years ago
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆19Oct 5, 2021Updated 4 years ago
- PSO for Nash Equilibrium. This is the code for my undergraduate thesis.粒子群算法求解纳什均衡☆11Jan 5, 2023Updated 3 years ago
- 📜 [NeurIPS 2022] "Symbolic Distillation for Learned TCP Congestion Control", S P Sharan, Wenqing Zheng, Kuo-Feng Hsu, Jiarong Xing, Ang …☆16Oct 13, 2022Updated 3 years ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆21Mar 4, 2026Updated 3 weeks ago
- Collision Avoidance using Buffered Voronoi Cell☆14Feb 10, 2017Updated 9 years ago
- ☆19Jun 30, 2024Updated last year
- machine learning algorithms source code☆24Jun 8, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Code of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023☆13Aug 3, 2023Updated 2 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆657Apr 9, 2022Updated 3 years ago
- Superfast Line Spectral Estimation☆11Jan 27, 2023Updated 3 years ago
- MLOT - A machine learning algorithm, written for use with MATLAB, in order to track in 3D moving particles based on a training data set. …☆12Dec 24, 2018Updated 7 years ago
- 2D Simulator for Smart Decision in ICRA 2019 RoboMaster AI Challenge☆57Apr 6, 2021Updated 4 years ago
- ☆14Mar 26, 2019Updated 7 years ago
- ☆16Aug 30, 2022Updated 3 years ago
- Cebinae: Scalable In-network Fairness Augmentation (SIGCOMM 2022)☆22Jul 2, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repository contains all the projects, and necessary scripts and files developed for the anti-jamming projects. You can clone it to t…☆27Aug 13, 2023Updated 2 years ago
- Block-Recurrent Dynamics in ViTs 🦖☆34Dec 24, 2025Updated 3 months ago
- ☆18May 4, 2020Updated 5 years ago
- Adaptive Attention Span for Reinforcement Learning☆136May 11, 2020Updated 5 years ago
- Approximate Bayesian Inference Toolkit (Python, C++)☆14Apr 16, 2014Updated 11 years ago
- ☆93Feb 16, 2026Updated last month
- ☆17Oct 8, 2024Updated last year