reproduce some RL or Multi-Agent models
☆35May 22, 2019Updated 6 years ago
Alternatives and similar repositories for ModelRepo
Users that are interested in ModelRepo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- an implementation of CommNet☆35Nov 14, 2017Updated 8 years ago
- ☆33Nov 21, 2022Updated 3 years ago
- FEN Code☆41Nov 4, 2019Updated 6 years ago
- Implementing different learning algorithms and analyzing their performance in a Markov game model called the Soccer Game☆23Jan 29, 2023Updated 3 years ago
- Rethinking Graph Regularization for Graph Neural Networks (AAAI2021)☆34Jun 6, 2021Updated 4 years ago
- Project 1 of Udacity's Deep Reinforcement Learning nanodegree program☆13Dec 2, 2018Updated 7 years ago
- A Toolkit for Mining Data in a Structural Fashion☆10Oct 16, 2022Updated 3 years ago
- EDGE: Scalable and optimum mutual information estimator for high-dimensional applications including deep learning☆39May 27, 2022Updated 3 years ago
- Extending rllab to event-driven multiagent environments☆13Oct 1, 2018Updated 7 years ago
- PyTorch RL for Pommerman☆38Sep 24, 2018Updated 7 years ago
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆54Apr 27, 2020Updated 5 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Objective Quality-of-Experience Model Benchmark☆26Feb 26, 2020Updated 6 years ago
- Implementation of MAML in numpy, deriving gradients and implementing backprop manually☆14Nov 15, 2018Updated 7 years ago
- Deep Reinforcement Learning applied to trading☆15Jan 29, 2019Updated 7 years ago
- Mixed Integer Quadratic Programming for Python (using MINLP-solver Bonmin)☆14Mar 12, 2018Updated 8 years ago
- The code for the ACL 2017 paper "Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling"☆29Apr 30, 2017Updated 8 years ago
- (T-ASE 2024) LeTO: Learning Constrained Visuomotor Policy with Differentiable Trajectory Optimization☆14Oct 14, 2024Updated last year
- A collection of papers on reinforcement learning applied to NLP☆14Sep 7, 2018Updated 7 years ago
- (TG'2023) Official code for the paper "Revisiting of AlphaStar" (previously called "Rethinking of AlphaStar"). It compares the raw interf…☆10Sep 6, 2021Updated 4 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Multi-Agent Reinforcement Learning with Stable-Baselines3☆20Dec 3, 2021Updated 4 years ago
- ☆29Apr 13, 2019Updated 6 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- Variance Reduction for Reinforcement Learning in Input-Driven Environments (ICLR '19)☆31May 6, 2019Updated 6 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆43Jan 29, 2019Updated 7 years ago
- Use Neural Network as a trading strategy model☆29Aug 15, 2017Updated 8 years ago
- A demo to show how to convert a TensorFlow model to TensorRT uff or PLAN☆11Jul 22, 2018Updated 7 years ago
- Sensor based Mission Planning for single robot using greedy based genetic algorithm☆11Apr 1, 2019Updated 6 years ago
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning☆446Feb 21, 2019Updated 7 years ago
- Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359☆15May 16, 2019Updated 6 years ago
- Deep Deterministic Policy Gradients in TF r2.0☆13Feb 6, 2020Updated 6 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Visual Transition State Clustering☆13Jan 6, 2018Updated 8 years ago
- 软考--系统架构设计师(软考高级)通过复习资料,包含2009-2018历年综合知识、案例分析真题与详细答案以及教材和整套教学视频。2018年相关资料作者近期正在持续更新。☆12Jan 26, 2024Updated 2 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- Contextual Combinatorial Cascading Bandits☆10Jun 30, 2016Updated 9 years ago
- 6-DoF wheeled biped robot☆18Jan 19, 2022Updated 4 years ago
- A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)☆692Jun 5, 2018Updated 7 years ago