Applying minimaxQ learning algorithm to 2 agents games
☆33Nov 27, 2017Updated 8 years ago
Alternatives and similar repositories for MinimaxQ-Learning
Users that are interested in MinimaxQ-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 8 years ago
- Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …☆12Mar 29, 2019Updated 7 years ago
- ☆12Mar 21, 2024Updated 2 years ago
- A Survey on Wi-Fi Channel State Information Datasets for Human Activity Recognition☆14Aug 3, 2022Updated 3 years ago
- The code to simulate spiking neural networks as used in the paper "Spiking Time-Dependent Plasticity Leads to Efficient Coding of Predict…☆10Nov 24, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code to reproduce the experiments from the paper "Self-Compatibility: Evaluating Causal Discovery without Ground Truth"☆12Mar 9, 2024Updated 2 years ago
- ☆17Feb 12, 2025Updated last year
- Application of Deep Reinforcement Learning to Supply Chain management. Reference: https://blog.griddynamics.com/deep-reinforcement-learni…☆12Jul 21, 2021Updated 4 years ago
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆22Sep 25, 2022Updated 3 years ago
- Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"☆20Oct 2, 2022Updated 3 years ago
- Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)☆15Feb 20, 2023Updated 3 years ago
- Learning Multiaspect Traffic Couplings by Multirelational Graph Attention Networks for Traffic Prediction☆13Oct 7, 2022Updated 3 years ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆14Jul 27, 2021Updated 4 years ago
- A ROS-independent Gazebo plugin for Ardupilot's SITL☆10Aug 4, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Aug 9, 2017Updated 8 years ago
- JAX implementations of various deep reinforcement learning algorithms.☆25Feb 2, 2025Updated last year
- ☆24Aug 5, 2024Updated last year
- Stationary distributions for arbitrary finite state Markov processes, including specializations for the Moran, Wright-Fisher, and other …☆22Aug 10, 2018Updated 7 years ago
- ☆10Jun 26, 2024Updated last year
- ☆13Updated this week
- Robotics in Python☆13Feb 21, 2023Updated 3 years ago
- Some helpful glTF components for A-Frame☆14Jan 18, 2023Updated 3 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 5 years ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆50Apr 14, 2025Updated last year
- Reimplementation of AAAI21 paper "Beyond Low-frequency Information in Graph Convolutional Networks" based on PyTorch and PyTorch Geometri…☆24Sep 27, 2022Updated 3 years ago
- [IJCAI'23] Semantic-aware Generation of Multi-view Portrait Drawings (SAGE)☆10Feb 25, 2024Updated 2 years ago
- An M.Sc project on multi-agents AI using the Python module Pyke.☆13Jan 27, 2012Updated 14 years ago
- The goal of the project is to implement robotic agents that could rapidly build structures from random objects in a disaster/crisis situa…☆11Dec 8, 2017Updated 8 years ago
- Predicting stock value☆22Sep 9, 2018Updated 7 years ago
- algorithms implemented in golang☆11Jul 3, 2021Updated 4 years ago
- A tutorial on JAX (https://github.com/google/jax/)☆47Jan 16, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Stainless neural networks in JAX☆34Feb 3, 2026Updated 2 months ago
- Documentation and guidelines for the Alan GPU cluster at the University of Liège.☆21Jul 19, 2023Updated 2 years ago
- The official implementation of DropGNN: Random Dropouts Increase the Expressiveness of Graph Neural Networks (NeurIPS 2021)☆26Jun 26, 2022Updated 3 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 5 months ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- Object recognition with NAO using a deep learning model☆16Sep 16, 2021Updated 4 years ago