OpenAI团队的深度强化学习教程中文版
☆35May 16, 2020Updated 5 years ago
Alternatives and similar repositories for spinningup
Users that are interested in spinningup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆19May 20, 2023Updated 2 years ago
- Codebase for ReLMM☆22Apr 17, 2023Updated 3 years ago
- PyTorch implementation of the paper Overcoming Exploration in Reinforcement Learning with Demonstrations in surgical robot manipulation t…☆12Aug 21, 2022Updated 3 years ago
- A PyTorch implementation of SSINet.☆16Nov 10, 2020Updated 5 years ago
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆18Sep 27, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- OpenAI团队的深度强化学习教程中文版☆91May 21, 2023Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 2 years ago
- 臸娥粂陆亩竟☆10May 11, 2024Updated last year
- OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.☆11Sep 8, 2023Updated 2 years ago
- Implementation for mSAC methods in PyTorch☆42Oct 10, 2021Updated 4 years ago
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 5 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Apr 26, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Ossian generic framework☆12Aug 25, 2021Updated 4 years ago
- GAMMA: A General Agent Motion Prediction Model for Autonomous Driving☆14Nov 17, 2021Updated 4 years ago
- ☆34Mar 24, 2023Updated 3 years ago
- ☆24Feb 24, 2023Updated 3 years ago
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated 2 months ago
- An implementation of HOME: Heatmap Output for future Motion Estimation☆13Feb 7, 2022Updated 4 years ago
- PSO for Nash Equilibrium. This is the code for my undergraduate thesis.粒子群算法求解纳什均衡☆11Jan 5, 2023Updated 3 years ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆22Mar 4, 2026Updated 2 months ago
- ☆29Oct 10, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Macro-Action Generator-Critic (MAGIC) - Learning Macro-actions for online POMDP planning☆17Feb 23, 2023Updated 3 years ago
- Maddpg_flight code☆10Jul 4, 2018Updated 7 years ago
- machine learning algorithms source code☆25Jun 8, 2021Updated 4 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 6 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- ☆21Jul 2, 2024Updated last year
- object tracking learning☆13Aug 29, 2020Updated 5 years ago
- Reinforcement learning based multi object tracker☆10Jan 29, 2018Updated 8 years ago
- [NeurIPS, 2020 - Reproducibility Challenge]: [RE] Towards Interpretable Reinforcement Learning Using Attention Augmented Agents☆13Apr 26, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Image Restoration via Multi-domain Learning☆26May 25, 2025Updated 11 months ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆665Apr 9, 2022Updated 4 years ago
- AI model for making mazes that extends OpenAIs GPT2 model☆15Dec 21, 2023Updated 2 years ago
- 2D Simulator for Smart Decision in ICRA 2019 RoboMaster AI Challenge☆57Apr 6, 2021Updated 5 years ago
- A repository to introduce the algorithmic information theory. You could learn what is Kolmogorov complexity and why it is important here.☆13Jul 23, 2025Updated 9 months ago
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆42Apr 27, 2025Updated last year
- ☆11Oct 8, 2022Updated 3 years ago