(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Jun 22, 2021Updated 5 years ago
Alternatives and similar repositories for action-hypergraph-networks
Users that are interested in action-hypergraph-networks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 8 years ago
- ☆23Nov 9, 2021Updated 4 years ago
- ☆13Jul 2, 2020Updated 6 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 4 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 4 years ago
- Multi-agent active perception with prediction rewards☆12Nov 13, 2020Updated 5 years ago
- Applications of reinforcement learning to Groebner basis computation.☆14Jun 13, 2021Updated 5 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Jun 26, 2026Updated last week
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- [TMC’23] Preemptive Migration Prediction Network for Proactive Fault Tolerant Edge Computing☆11Sep 25, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆122Feb 3, 2023Updated 3 years ago
- ☆12Jul 15, 2020Updated 5 years ago
- Never fill a sockaddr_in struct by hand again!☆13Apr 10, 2020Updated 6 years ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆36Feb 13, 2021Updated 5 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Multi-view Reinforcement Learning☆11Feb 9, 2020Updated 6 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆11Jun 24, 2022Updated 4 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆37Jul 6, 2022Updated 3 years ago
- TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Le…☆16Jul 2, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- v2ray with GUI interface☆11Feb 1, 2023Updated 3 years ago
- Using GNN and DQN to find a baetter branching heuristic for a CDCL Solver☆54Oct 20, 2020Updated 5 years ago
- free5GC 5GC & UERANSIM UE / RAN Sample Configuration - Select nearby UPF according to the connected gNodeB☆11Mar 31, 2024Updated 2 years ago
- Reinforcement learning - Batched Impala - PyTorch - Mario Kart☆13Jul 21, 2020Updated 5 years ago
- Codes of GoMARL accompanying the paper "Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning"(NeurIPS 2023). G…☆34Aug 14, 2024Updated last year
- OpenFlow protocol endpoint written in C++☆10Jun 19, 2026Updated 2 weeks ago
- source code of paper 'Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search Based on Reinforcement Learning and Exis…☆11Jan 26, 2021Updated 5 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'☆17Oct 6, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OpenRAN Gym website☆14Jun 3, 2026Updated last month
- MultiTask Environments for Reinforcement Learning.☆78Aug 18, 2022Updated 3 years ago
- A trivial raycaster using minifb for rendering/input☆14Jan 2, 2023Updated 3 years ago
- Materials for my PyData Boston 2013 talk☆16Sep 26, 2013Updated 12 years ago
- ETSI TS 103 097 v1.2.1 - v.1.2.5 library (outdated)☆12Mar 21, 2018Updated 8 years ago
- ☆16Apr 28, 2023Updated 3 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 5 years ago