diogohmcruz / DeepDipLinks
DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA
☆13Updated 6 years ago
Alternatives and similar repositories for DeepDip
Users that are interested in DeepDip are comparing it to the libraries listed below
Sorting:
- Neuro-Symbolic Reinforcement Learning: Logical Optimal Action (LOA), a novel RL with Logical Neural Network (LNN) on text-based games☆47Updated 3 months ago
- ☆57Updated last year
- Scalable Training of Propositional Logical Neural Networks.☆13Updated 3 years ago
- Pytorch implementation of the Gato paper from Deepmind☆12Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆38Updated 7 months ago
- A toy poker simulator with a pluggable Player interface to implement Agents that play using both rules based strategies and llms.☆10Updated 2 years ago
- rock paper scissors game using Double-DQN against different random generator☆17Updated 6 years ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆69Updated 2 years ago
- we're building an AI to play the board game Diplomacy!☆32Updated 3 years ago
- Documentation for dynamic machine learning systems.☆29Updated 11 months ago
- Logic Reinforcement Learning☆16Updated last year
- ☆35Updated 2 years ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆17Updated last year
- Diplomacy: DATC-Compliant Game Engine with Web Interface☆153Updated last year
- A neurosymbolic T5 agent for playing text games, from the EACL 2023 paper "Behavior Cloned Transformers are Neurosymbolic Reasoners"☆19Updated 2 years ago
- We propose an evolution-based approach to meta-learn synthetic neural environments and reward neural networks for reinforcement learning.☆21Updated 2 years ago
- ☆27Updated last year
- Explore and Control with Adversarial Surprise☆10Updated 4 years ago
- Co-evolution of agents and environments in GVG-AI☆17Updated 4 years ago
- This repository is to develop novel AIs for complex C2 decision making. It consists of parallel branches for GUI and for AI development (…☆29Updated 2 years ago
- Neuro-Symbolic Visual Question Answering on Sort-of-CLEVR using PyTorch☆57Updated 3 years ago
- ☆62Updated 9 months ago
- Badger code samples☆28Updated 5 years ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆61Updated 5 months ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆12Updated 3 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆33Updated last year
- Evaluating different engineering tricks that make RL work☆15Updated 4 years ago
- The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk…☆29Updated 3 years ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆45Updated 7 months ago