(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Jun 22, 2021Updated 4 years ago
Alternatives and similar repositories for action-hypergraph-networks
Users that are interested in action-hypergraph-networks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jul 2, 2020Updated 5 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 4 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆38Oct 14, 2020Updated 5 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi-agent active perception with prediction rewards☆12Nov 13, 2020Updated 5 years ago
- Applications of reinforcement learning to Groebner basis computation.☆14Jun 13, 2021Updated 5 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Jun 5, 2026Updated last week
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- MADRL project solving chess environment using PPO with two different methods: 2 agents/networks and a single agent/network.☆23Apr 1, 2023Updated 3 years ago
- Filters CSV files of wind sites and generates parameters and features used in predicting wind power using NumPy in Python. Evaluates perf…☆10Aug 10, 2018Updated 7 years ago
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆122Feb 3, 2023Updated 3 years ago
- ☆12Jul 15, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆36Feb 13, 2021Updated 5 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Multi-view Reinforcement Learning☆11Feb 9, 2020Updated 6 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆11Jun 24, 2022Updated 3 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆37Jul 6, 2022Updated 3 years ago
- TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Le…☆16Jul 2, 2022Updated 3 years ago
- Tools to construct canonical and regular vines. StarVine can also be used as a bivariate copula fitting tool.☆15Oct 19, 2020Updated 5 years ago
- Using GNN and DQN to find a baetter branching heuristic for a CDCL Solver☆54Oct 20, 2020Updated 5 years ago
- free5GC 5GC & UERANSIM UE / RAN Sample Configuration - Select nearby UPF according to the connected gNodeB☆11Mar 31, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Codes of GoMARL accompanying the paper "Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning"(NeurIPS 2023). G…☆34Aug 14, 2024Updated last year
- source code of paper 'Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search Based on Reinforcement Learning and Exis…☆11Jan 26, 2021Updated 5 years ago
- The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'☆17Oct 6, 2024Updated last year
- A simple lightweight utility script to use Shecan DNS servers temporarily on Linux.☆18May 24, 2023Updated 3 years ago
- ☆11Nov 9, 2020Updated 5 years ago
- MultiTask Environments for Reinforcement Learning.☆78Aug 18, 2022Updated 3 years ago
- A trivial raycaster using minifb for rendering/input☆14Jan 2, 2023Updated 3 years ago
- ETSI TS 103 097 v1.2.1 - v.1.2.5 library (outdated)☆12Mar 21, 2018Updated 8 years ago
- ☆16Apr 28, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- ICML'19: How does Disagreement Help Generalization against Label Corruption?☆22Jun 30, 2019Updated 6 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 5 years ago
- Code to reproduce results in the paper "Learning to Predict Navigational Patterns from Partial Observations" (RA-L 2023)☆12Jun 30, 2023Updated 2 years ago
- PPG (Point Process Generator) is a Reinforcement Learning framework that is able to produce actions by imitating expert sequences.☆14May 17, 2019Updated 7 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago