BlueDi / DeepDip
DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA
☆12Updated 5 years ago
Related projects: ⓘ
- A neurosymbolic T5 agent for playing text games, from the EACL 2023 paper "Behavior Cloned Transformers are Neurosymbolic Reasoners"☆19Updated last year
- Repo to reproduce the First-Explore paper results☆36Updated last year
- Documentation for dynamic machine learning systems.☆26Updated last week
- Framework for building algorithms based on FractalAI theory☆17Updated 3 years ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆33Updated last week
- Evaluation of neuro-symbolic engines☆29Updated last month
- Pytorch implementation of the Gato paper from Deepmind☆13Updated last year
- ☆17Updated 3 years ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆23Updated 3 months ago
- Neuro-Symbolic Reinforcement Learning: Logical Optimal Action (LOA), a novel RL with Logical Neural Network (LNN) on text-based games☆36Updated last year
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆16Updated 2 months ago
- A web based platform for collecting human actions in reinforcement learning environments☆26Updated last year
- ☆20Updated last year
- Elevate your language models with insightful diversity metrics.☆9Updated 7 months ago
- Scalable Training of Propositional Logical Neural Networks.☆11Updated 2 years ago
- Co-evolution of agents and environments in GVG-AI☆16Updated 3 years ago
- Produce intelligence by means of natural selection without objective/reward optimization☆13Updated 2 years ago
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆17Updated 7 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆41Updated 3 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆61Updated last year
- ☆17Updated last year
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 3 years ago
- Abstract Reasoning with Graph Abstractions (ARGA) implementation☆53Updated 2 months ago
- ☆14Updated 5 months ago
- LMQL implementation of tree of thoughts☆33Updated 7 months ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated 4 months ago
- CodeNav is an LLM agent that navigates and leverages previously unseen code repositories to solve user queries.☆15Updated 3 weeks ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 2 years ago
- An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting☆22Updated 8 months ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated last year