CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-settingView external linksLinks
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Mar 9, 2018Updated 7 years ago
Alternatives and similar repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
Users that are interested in Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting are comparing it to the libraries listed below
Sorting:
- Multi-Agent Reinforcement Learning for Path Planning☆15Jan 8, 2022Updated 4 years ago
- Multi-agent-path-planning by Python,with 4 entrances, 4 target and 8 AGVs☆27Jun 23, 2022Updated 3 years ago
- featselector是一个基于统计分析和模型选择的特征选择器.☆14Mar 4, 2019Updated 6 years ago
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 3 years ago
- Implementation of the G2RL approach in the POGEMA environment☆13Jun 5, 2024Updated last year
- Tree Structured LSTM model for sentence level aspect based sentiment analysis☆36Aug 16, 2017Updated 8 years ago
- A collection of tools that help me work with Avro☆24Jan 7, 2010Updated 16 years ago
- My accepted! proposal for the unitary fund quantum computing grant. Also accepted as an abstract to a conference. See github.com/LSaldyt/…☆12Jan 12, 2019Updated 7 years ago
- Solutions to the book "Collection of Data Science TakeHome Challenges" in Python.☆10Nov 15, 2017Updated 8 years ago
- My Data Engineering project @ Insight Data Science☆10Jul 23, 2018Updated 7 years ago
- "Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE 517/414 Reinforcement Learning.☆13Dec 8, 2021Updated 4 years ago
- Multi-Objective Reinforcement Learning sandbox☆12Dec 20, 2021Updated 4 years ago
- python interface to bnlearn and other probabilistic graphical model libraries☆10Mar 26, 2020Updated 5 years ago
- A demo project on how to connect Materialize and Streamlit (using Redpanda & FastAPI)☆11Apr 18, 2022Updated 3 years ago
- Website for the ECC 2022 paper titled "Learning Eco-Driving Strategies at Signalized Intersections"☆13May 30, 2023Updated 2 years ago
- Randomized Linear Algebra in Python☆13Mar 21, 2017Updated 8 years ago
- Raw data for numerical experiments exploring integer factorization on NISQ devices☆10Aug 30, 2018Updated 7 years ago
- Getting started with PyQuil: Quantum dice example☆14Jul 10, 2018Updated 7 years ago
- Implementation of "Reinforcement Learning in Possibly Nonstationary Environments"☆10Mar 10, 2025Updated 11 months ago
- ☆15Jan 21, 2022Updated 4 years ago
- Combination of Rapidly-Exporing Random Trees (RRT) and Safe Interval Path Planning (SIPP) for high-DOF planning in dynamic environments,…☆17Oct 3, 2025Updated 4 months ago
- Mixtures of Gaussian Process Experts in GPflow/TensorFlow☆12Aug 1, 2022Updated 3 years ago
- Simulation and testing of a torque vectoring controller for a 4WD electric vehicle☆10May 9, 2019Updated 6 years ago
- Code and example associated with the paper 'Model Predictive Control of Nonlinear Latent Force Models: A Scenario-based Approach' by T. W…☆11Jun 22, 2022Updated 3 years ago
- CEVAE with VampPrior☆11Jul 18, 2018Updated 7 years ago
- A Novel Network-Flow Model for Building Evacuation: Route Choices of Evacuees are Modeled with Herding Effect☆11Sep 6, 2024Updated last year
- Heterogeneous capacitated vehicle routing problem☆11Aug 4, 2018Updated 7 years ago
- Code for Deep Structured Mixtures of Gaussian Processes (DSMGPs)☆11Jan 27, 2022Updated 4 years ago
- A python module that provides access to the Kaiko Bittrex Historical trade data☆10Aug 20, 2017Updated 8 years ago
- our secret vision for universal finance☆12Sep 22, 2020Updated 5 years ago
- ☆10May 10, 2017Updated 8 years ago
- Coalesce 2022 Python models demo with Databricks. Not actively maintained.☆13Dec 4, 2024Updated last year
- An AI for Hearthstone using deep reinforcement learning☆10Oct 6, 2017Updated 8 years ago
- Causal Feature Selection Tutorial for AMIA2018☆12Nov 3, 2018Updated 7 years ago
- Freddie Mac Single Loan Data Analysis & Machine Learning (Regression / Classification)☆12Jun 11, 2017Updated 8 years ago
- Deploy SSD object detector with opencv+Qt, it works on windows and android.☆10Mar 3, 2019Updated 6 years ago
- Master Thesis Project in Computer Engineering at Aarhus University 2024 on "Simulating Multi-agent Path Planning in Complex environments …☆16Oct 12, 2025Updated 4 months ago
- This data analysis provided information for the March 6th, 2018, NYC Open Data Week event hosted by the Two Sigma Data Clinic, "The State…☆13Jan 9, 2025Updated last year
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago