(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Jun 22, 2021Updated 4 years ago
Alternatives and similar repositories for action-hypergraph-networks
Users that are interested in action-hypergraph-networks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- ☆23Nov 9, 2021Updated 4 years ago
- ☆13Jul 2, 2020Updated 5 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 4 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 3 years ago
- Multi-agent active perception with prediction rewards☆11Nov 13, 2020Updated 5 years ago
- Applications of reinforcement learning to Groebner basis computation.☆14Jun 13, 2021Updated 4 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Mar 23, 2026Updated 3 weeks ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- MADRL project solving chess environment using PPO with two different methods: 2 agents/networks and a single agent/network.☆21Apr 1, 2023Updated 3 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- Repository for "Known Unknowns: Uncertainty Quality in Bayesian Neural Networks" paper.☆12Mar 3, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆121Feb 3, 2023Updated 3 years ago
- TapNet: Multivariate Time Series Classification withAttentional Prototypical Network☆11Dec 22, 2019Updated 6 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆31Dec 10, 2022Updated 3 years ago
- ☆12Jul 15, 2020Updated 5 years ago
- Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"☆10Dec 19, 2021Updated 4 years ago
- Multi-view Reinforcement Learning☆11Feb 9, 2020Updated 6 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago
- TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Le…☆16Jul 2, 2022Updated 3 years ago
- Tools to construct canonical and regular vines. StarVine can also be used as a bivariate copula fitting tool.☆15Oct 19, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Using GNN and DQN to find a baetter branching heuristic for a CDCL Solver☆54Oct 20, 2020Updated 5 years ago
- Reinforcement learning - Batched Impala - PyTorch - Mario Kart☆13Jul 21, 2020Updated 5 years ago
- ☆14Aug 9, 2023Updated 2 years ago
- OpenRAN Gym website☆13Mar 30, 2026Updated 2 weeks ago
- source code of paper 'Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search Based on Reinforcement Learning and Exis…☆11Jan 26, 2021Updated 5 years ago
- Partial port of BoofCV to C++ to accelerate specific operations☆19Feb 21, 2022Updated 4 years ago
- gabor filter bank, sift and bag of visual words implementation☆11Jul 20, 2019Updated 6 years ago
- A simple lightweight utility script to use Shecan DNS servers temporarily on Linux.☆18May 24, 2023Updated 2 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆34Dec 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Nov 9, 2020Updated 5 years ago
- A toy example of meta learning on mnist☆17Mar 14, 2019Updated 7 years ago
- MultiTask Environments for Reinforcement Learning.☆79Aug 18, 2022Updated 3 years ago
- ☆16Apr 28, 2023Updated 2 years ago
- Code to reproduce results in the paper "Learning to Predict Navigational Patterns from Partial Observations" (RA-L 2023)☆12Jun 30, 2023Updated 2 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 5 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago