Applying minimaxQ learning algorithm to 2 agents games
☆33Nov 27, 2017Updated 8 years ago
Alternatives and similar repositories for MinimaxQ-Learning
Users that are interested in MinimaxQ-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 8 years ago
- Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …☆12Mar 29, 2019Updated 7 years ago
- ☆12Mar 21, 2024Updated 2 years ago
- A Survey on Wi-Fi Channel State Information Datasets for Human Activity Recognition☆14Aug 3, 2022Updated 3 years ago
- Differential game theory for multi-agent collision avoidance. Simulations set up.☆12Jan 27, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- multi-agent reinforcement learning for competitive environments using pytorch☆14Dec 31, 2019Updated 6 years ago
- A GPU-accelerated toolbox for hyperbolic PDEs in a weaker (viscosity) sense. It leverages the integral to the solution of the conservatio…☆14May 6, 2026Updated 3 weeks ago
- Pytorch implementation of the paper 'Towards Scenario Generalization for Vision-based Roadside 3D Object Detection'☆17Mar 9, 2025Updated last year
- Code to reproduce the experiments from the paper "Self-Compatibility: Evaluating Causal Discovery without Ground Truth"☆12Mar 9, 2024Updated 2 years ago
- ☆11Nov 8, 2022Updated 3 years ago
- Roshambo bots including Python translation of Dan Egnor's Iocaine Powder☆11Jul 28, 2012Updated 13 years ago
- Application of Deep Reinforcement Learning to Supply Chain management. Reference: https://blog.griddynamics.com/deep-reinforcement-learni…☆12Jul 21, 2021Updated 4 years ago
- A reinforcement deep learning approach for route planning.☆14Nov 6, 2020Updated 5 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆34Oct 22, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆22Sep 25, 2022Updated 3 years ago
- pytorch implementation of DQN, NAF, DDPG☆13Jun 7, 2018Updated 7 years ago
- OpenAI Gym environment for graph search problems such as shortest path.☆11Dec 24, 2019Updated 6 years ago
- Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"☆20Oct 2, 2022Updated 3 years ago
- Boolean satisfiability for propositional logic in Python☆14Aug 19, 2025Updated 9 months ago
- Learning Multiaspect Traffic Couplings by Multirelational Graph Attention Networks for Traffic Prediction☆13Oct 7, 2022Updated 3 years ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆13Jul 27, 2021Updated 4 years ago
- ☆11Aug 9, 2017Updated 8 years ago
- JAX implementations of various deep reinforcement learning algorithms.☆25Feb 2, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A python client library for microRTS.☆20Feb 5, 2020Updated 6 years ago
- Fictitious Self-play & Reinforcement Learning☆18Jan 26, 2018Updated 8 years ago
- ☆10Jun 26, 2024Updated last year
- ☆13Updated this week
- The Official Implementation of Domain Adaptive Imitation Learning (DAIL)☆24Oct 26, 2020Updated 5 years ago
- Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottlen…☆52Jun 28, 2020Updated 5 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Simple Python implementation of a Complete/Systematic SAT Solver with the DPLL algorithm☆14May 24, 2018Updated 8 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆51Apr 14, 2025Updated last year
- [IJCAI'23] Semantic-aware Generation of Multi-view Portrait Drawings (SAGE)☆10Feb 25, 2024Updated 2 years ago
- An M.Sc project on multi-agents AI using the Python module Pyke.☆13Jan 27, 2012Updated 14 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- The goal of the project is to implement robotic agents that could rapidly build structures from random objects in a disaster/crisis situa…☆11Dec 8, 2017Updated 8 years ago
- Predicting stock value☆22Sep 9, 2018Updated 7 years ago
- ☆27Jan 20, 2021Updated 5 years ago