CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-settingView external linksLinks
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Mar 9, 2018Updated 7 years ago
Alternatives and similar repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
Users that are interested in Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting are comparing it to the libraries listed below
Sorting:
- Multi-Agent Reinforcement Learning for Path Planning☆15Jan 8, 2022Updated 4 years ago
- Multi-agent-path-planning by Python,with 4 entrances, 4 target and 8 AGVs☆27Jun 23, 2022Updated 3 years ago
- featselector是一个基于统计分析和模型选择的特征选择器.☆14Mar 4, 2019Updated 6 years ago
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 3 years ago
- Implementation of the G2RL approach in the POGEMA environment☆13Jun 5, 2024Updated last year
- Tree Structured LSTM model for sentence level aspect based sentiment analysis☆36Aug 16, 2017Updated 8 years ago
- Multi-Objective Reinforcement Learning sandbox☆12Dec 20, 2021Updated 4 years ago
- Raw data for numerical experiments exploring integer factorization on NISQ devices☆10Aug 30, 2018Updated 7 years ago
- Getting started with PyQuil: Quantum dice example☆14Jul 10, 2018Updated 7 years ago
- Website for the ECC 2022 paper titled "Learning Eco-Driving Strategies at Signalized Intersections"☆13May 30, 2023Updated 2 years ago
- My accepted! proposal for the unitary fund quantum computing grant. Also accepted as an abstract to a conference. See github.com/LSaldyt/…☆12Jan 12, 2019Updated 7 years ago
- Randomized Linear Algebra in Python☆13Mar 21, 2017Updated 8 years ago
- python interface to bnlearn and other probabilistic graphical model libraries☆10Mar 26, 2020Updated 5 years ago
- Implementation of "Reinforcement Learning in Possibly Nonstationary Environments"☆10Mar 10, 2025Updated 11 months ago
- A demo project on how to connect Materialize and Streamlit (using Redpanda & FastAPI)☆11Apr 18, 2022Updated 3 years ago
- Solutions to the book "Collection of Data Science TakeHome Challenges" in Python.☆10Nov 15, 2017Updated 8 years ago
- My Data Engineering project @ Insight Data Science☆10Jul 23, 2018Updated 7 years ago
- "Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE 517/414 Reinforcement Learning.☆13Dec 8, 2021Updated 4 years ago
- A collection of tools that help me work with Avro☆24Jan 7, 2010Updated 16 years ago
- Quantum Connect Four☆15Dec 5, 2022Updated 3 years ago
- Master Thesis Project in Computer Engineering at Aarhus University 2024 on "Simulating Multi-agent Path Planning in Complex environments …☆16Oct 12, 2025Updated 4 months ago
- This data analysis provided information for the March 6th, 2018, NYC Open Data Week event hosted by the Two Sigma Data Clinic, "The State…☆13Jan 9, 2025Updated last year
- A python module that provides access to the Kaiko Bittrex Historical trade data☆10Aug 20, 2017Updated 8 years ago
- Offline RL algoritms implemented in Stable Baselines3 (pytorch)☆10Dec 7, 2021Updated 4 years ago
- ☆15Jan 21, 2022Updated 4 years ago
- Deploy SSD object detector with opencv+Qt, it works on windows and android.☆10Mar 3, 2019Updated 6 years ago
- Heterogeneous capacitated vehicle routing problem☆11Aug 4, 2018Updated 7 years ago
- Python package for Simulink-based reinforcement learning environments.☆11Aug 20, 2021Updated 4 years ago
- An AI for Hearthstone using deep reinforcement learning☆10Oct 6, 2017Updated 8 years ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- CEVAE with VampPrior☆11Jul 18, 2018Updated 7 years ago
- Causal Feature Selection Tutorial for AMIA2018☆12Nov 3, 2018Updated 7 years ago
- ☆10May 10, 2017Updated 8 years ago
- A simple implementation of the LRFU cache eviction policy in Python.☆10Feb 1, 2015Updated 11 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Simulation and testing of a torque vectoring controller for a 4WD electric vehicle☆10May 9, 2019Updated 6 years ago
- Code and example associated with the paper 'Model Predictive Control of Nonlinear Latent Force Models: A Scenario-based Approach' by T. W…☆11Jun 22, 2022Updated 3 years ago
- our secret vision for universal finance☆12Sep 22, 2020Updated 5 years ago
- Freddie Mac Single Loan Data Analysis & Machine Learning (Regression / Classification)☆12Jun 11, 2017Updated 8 years ago