diogohmcruz / DeepDipLinks
DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA
☆13Updated 5 years ago
Alternatives and similar repositories for DeepDip
Users that are interested in DeepDip are comparing it to the libraries listed below
Sorting:
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 5 months ago
- A Python reimplementation of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆17Updated last year
- Pytorch implementation of the Gato paper from Deepmind☆11Updated 2 years ago
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆16Updated last year
- Framework for building algorithms based on FractalAI theory☆19Updated 4 years ago
- a socketteer/loom reimplementation in obsidian☆12Updated last year
- An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting☆31Updated last year
- A neurosymbolic T5 agent for playing text games, from the EACL 2023 paper "Behavior Cloned Transformers are Neurosymbolic Reasoners"☆19Updated 2 years ago
- Documentation for dynamic machine learning systems.☆29Updated 8 months ago
- Approximating the joint distribution of language models via MCTS☆21Updated 7 months ago
- Neuro-Symbolic Reinforcement Learning: Logical Optimal Action (LOA), a novel RL with Logical Neural Network (LNN) on text-based games☆46Updated 3 weeks ago
- LMQL implementation of tree of thoughts☆34Updated last year
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆40Updated last year
- ☆19Updated last week
- A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.☆26Updated last year
- Implementations of Curious Replay for model-based adaptation.☆40Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆36Updated 2 months ago
- Aigents Java Core Platform☆31Updated 7 months ago
- ☆55Updated last year
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆44Updated 5 months ago
- examples and guides to using Nomic Atlas☆36Updated last month
- ☆19Updated 7 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆68Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆48Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆18Updated last year
- ☆21Updated 3 months ago
- Ludwig benchmark☆20Updated 3 years ago