CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting)

CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting

☆12

Alternatives and similar repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

Users that are interested in Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CSKrishna / Tree_LSTMs-for-Aspect-based-Sentiment-Analysis
View on GitHub
Tree Structured LSTM model for sentence level aspect based sentiment analysis
☆37Aug 16, 2017Updated 8 years ago
AmrinderRai / MARL-Cooperative-Path-Planning
View on GitHub
Multi-Agent Reinforcement Learning for Path Planning
☆15Jan 8, 2022Updated 4 years ago
NotAnyMike / RL-Football
View on GitHub
Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …
☆12Mar 29, 2019Updated 7 years ago
Zhang-xie / CSI-HAR-dataset-survey
View on GitHub
A Survey on Wi-Fi Channel State Information Datasets for Human Activity Recognition
☆14Aug 3, 2022Updated 3 years ago
MahanFathi / TRPO-TensorFlow
View on GitHub
Trust Region Policy Optimization (TRPO) in pure TensorFlow
☆18Jun 7, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yjpark1 / competitiveMARL
View on GitHub
multi-agent reinforcement learning for competitive environments using pytorch
☆14Dec 31, 2019Updated 6 years ago
JenAlchimowicz / Multi-Agent-Reinforcement-Learning-simulating-collaboration-with-VDN-and-IQL
View on GitHub
Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment
☆11Apr 6, 2022Updated 4 years ago
AppliedDynamicMechanics / Emergency-evacuation-Deep-reinforcement-learning
View on GitHub
Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles
☆10Mar 6, 2026Updated 4 months ago
liumingrui814 / Multi-Agents-sorting
View on GitHub
Multi-agent-path-planning by Python,with 4 entrances, 4 target and 8 AGVs
☆27Jun 23, 2022Updated 4 years ago
CSKrishna / Recommender-Systems-for-Implicit-Feedback-datasets
View on GitHub
Matrix Factorization augmented with customer item meta data
☆22Nov 2, 2017Updated 8 years ago
Elucidation / mapf-multiagent-robot-planning
View on GitHub
Multi-Agent PathFinding (MAPF) for 2D Robots moving inventory on a grid - Practice building environment + robots + planning + inventory m…
☆16Nov 20, 2023Updated 2 years ago
xiaorancs / feature-select
View on GitHub
featselector是一个基于统计分析和模型选择的特征选择器.
☆14Mar 4, 2019Updated 7 years ago
godisreal / Evac-Network-Flow
View on GitHub
A Novel Network-Flow Model for Building Evacuation: Route Choices of Evacuees are Modeled with Herding Effect
☆11Sep 6, 2024Updated last year
SadAngelF / Distributed-Dueling-DQN
View on GitHub
Here is our algorithm for Pursuit Problem based on the Distributed Reinforcement Learning for Cooperative Multi-robot Pursuit
☆10Apr 17, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jk96491 / C-COMA
View on GitHub
Continual Multi-agent Reinforcement Learning in Dynamic Environments
☆11Jul 1, 2021Updated 5 years ago
christinakouridi / multiagent_gym
View on GitHub
Adaptation of DQN, DDQN and COMA for multi-agent Gym environments
☆10Oct 3, 2023Updated 2 years ago
sisl / PyroRL
View on GitHub
An RL environment made for wildfire evacuation.
☆18Apr 2, 2025Updated last year
restorenode / mappo-competitive-reinforcement
View on GitHub
🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem
☆22Sep 25, 2022Updated 3 years ago
ZiyuanMa / MAPF_RL
View on GitHub
multi-agent pathfinding via dqn
☆16May 19, 2021Updated 5 years ago
xiaocao1991 / MADDPG-AUV
View on GitHub
This is a MADDPG algorithm to be used on particle environment styles. I use it to test my own scenarios for underwater target localizatio…
☆18Jun 23, 2021Updated 5 years ago
dennisushi / Multi-Agent-RL-DQN
View on GitHub
Deep Q Network for Multi-agent RL
☆15Oct 18, 2020Updated 5 years ago
AU-Master-Thesis / magics
View on GitHub
Master Thesis Project in Computer Engineering at Aarhus University 2024 on "Simulating Multi-agent Path Planning in Complex environments …
☆18Oct 12, 2025Updated 9 months ago
solmp / VideoMatting
View on GitHub
Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python
☆12Mar 10, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
belaalb / CEVAE-VampPrior
View on GitHub
CEVAE with VampPrior
☆11Jul 18, 2018Updated 8 years ago
CarloLonghi / PSO-PathPlanning
View on GitHub
Particle Swar Optimization algorithm applied to a path planning task
☆14Aug 11, 2021Updated 4 years ago
david-simoes-93 / Multi-agent-Double-Deep-Q-Networks
View on GitHub
A multi-agent version of the Double DQN algorithm, with Foraging Task and Pursuit Game test scenarios
☆12Apr 24, 2017Updated 9 years ago
pplonski / gafe
View on GitHub
Genetic Algorithm Feature Engineering
☆15Oct 3, 2017Updated 8 years ago
scullion / lrfu
View on GitHub
A simple implementation of the LRFU cache eviction policy in Python.
☆10Feb 1, 2015Updated 11 years ago
Aaricis / BioMARL
View on GitHub
基于生物启发式算法的多智能体强化学习算法
☆15Apr 14, 2021Updated 5 years ago
redapt / pyspark-s3-parquet-example
View on GitHub
This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…
☆19Jun 23, 2016Updated 10 years ago
julia-bel / MAPF_G2RL
View on GitHub
Implementation of the G2RL approach in the POGEMA environment
☆15Jun 5, 2024Updated 2 years ago
PathPlanning / ManipulationPlanning-SI-RRT
View on GitHub
Combination of Rapidly-Exporing Random Trees (RRT) and Safe Interval Path Planning (SIPP) for high-DOF planning in dynamic environments,…
☆19May 17, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tsdataclinic / open-data-week
View on GitHub
This data analysis provided information for the March 6th, 2018, NYC Open Data Week event hosted by the Two Sigma Data Clinic, "The State…
☆13Jan 9, 2025Updated last year
cs224 / pybnl
View on GitHub
python interface to bnlearn and other probabilistic graphical model libraries
☆10Mar 26, 2020Updated 6 years ago
purpleleaves007 / WiGRUNT
View on GitHub
☆25Aug 5, 2024Updated last year
Abluceli / Multi-agent-Reinforcement-Learning-Algorithms
View on GitHub
Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)
☆16May 24, 2020Updated 6 years ago
luogantt / onnxruntime_cpp_demo
View on GitHub
☆12Mar 2, 2022Updated 4 years ago
Whu-wxy / Enhanced_Qt-Opencv-DNN
View on GitHub
Deploy SSD object detector with opencv+Qt, it works on windows and android.
☆10Mar 3, 2019Updated 7 years ago
XD1227 / MultiExit-Rainbow
View on GitHub
Multi-Exit Evacuation simulation; Rainbow DQN application
☆17Sep 4, 2020Updated 5 years ago