A collection of deep reinforcement learning algorithm implementations
☆11Jan 9, 2020Updated 6 years ago
Alternatives and similar repositories for reinforcement_learning_collections
Users that are interested in reinforcement_learning_collections are comparing it to the libraries listed below
Sorting:
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- Scalable MCTS for team scenarios☆17Jun 14, 2024Updated last year
- ☆20Sep 14, 2019Updated 6 years ago
- curriculum☆27Feb 7, 2023Updated 3 years ago
- Implementation of DyMA-CL, MARL algorithm☆28Apr 18, 2020Updated 5 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 3 months ago
- Examples for the HEBI Robotics Python API☆14Jan 9, 2026Updated last month
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Oct 7, 2025Updated 4 months ago
- A projet for simulating the rescue after a disaster☆10Dec 4, 2020Updated 5 years ago
- Course project: optimal control of a 2 dof manipulator exploiting the DDP algorithm☆13Mar 4, 2022Updated 4 years ago
- Public package to compute translationally and rotationally invariant wavelet-based statistics on images.☆10Aug 25, 2023Updated 2 years ago
- ☆10Apr 26, 2023Updated 2 years ago
- A multi-source cross-modal retrieval network☆14Jan 8, 2024Updated 2 years ago
- ☆10Nov 23, 2023Updated 2 years ago
- code for paper -- "Seamless Satellite-image Synthesis"☆17Jul 30, 2024Updated last year
- ☆11Sep 30, 2022Updated 3 years ago
- (TG'2023) Official code for the paper "Revisiting of AlphaStar" (previously called "Rethinking of AlphaStar"). It compares the raw interf…☆10Sep 6, 2021Updated 4 years ago
- Analysis of data from the videogame/eSport League of Legends☆12Nov 4, 2019Updated 6 years ago
- Detection valid SET combinations from images with SET-cards☆12Dec 8, 2022Updated 3 years ago
- Remote sensing Image Captioning is a special case of Image Captioning which solves the difficulties in processing the remote sensing imag…☆11Jun 16, 2021Updated 4 years ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆16Oct 12, 2022Updated 3 years ago
- modified datasets for remote sensing image caption☆11Apr 23, 2019Updated 6 years ago
- The source code for an online Human-Agent Interaction (HAI) system, controlling directions (left or right) of Unity games using online EE…☆10Jan 17, 2021Updated 5 years ago
- yet another DL framework☆11Oct 28, 2018Updated 7 years ago
- N-Tuple Bandit Evolutionary Algorithm☆14May 8, 2020Updated 5 years ago
- Reflexxes Type II provides acceleration-limited trajectory smoothing☆11Feb 3, 2022Updated 4 years ago
- Source code for the Joint Shapley values: a measure of joint feature importance☆13Sep 14, 2021Updated 4 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- ☆10Mar 10, 2021Updated 4 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆10Sep 3, 2021Updated 4 years ago
- This is the numerical approach proposed in the paper "Optimal Incentives to Mitigate Epidemics: A Stackelberg Mean Field Game Approach" b…☆12Nov 22, 2021Updated 4 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- ☆11Jun 28, 2022Updated 3 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago