BlueDi / DeepDip

DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA

☆13

Alternatives and similar repositories for DeepDip:

Users that are interested in DeepDip are comparing it to the libraries listed below

lab-v2 / langdiversity
Elevate your language models with insightful diversity metrics.
☆11Updated 11 months ago
Miffyli / rl-human-prior-tricks
Evaluating different engineering tricks that make RL work
☆15Updated 3 years ago
btnorman / First-Explore
Repo to reproduce the First-Explore paper results
☆37Updated 3 weeks ago
lab-v2 / pyreason-gym
An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting
☆27Updated last year
google-deepmind / diplomacy
☆50Updated 8 months ago
jprivera44 / EscalAItion
Repo for the paper on Escalation Risks of AI systems
☆36Updated 9 months ago
plastic-labs / dspy-opentom
Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset
☆14Updated 10 months ago
ambroser53 / Prompt-Day-Care
A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.
☆27Updated last year
nomic-ai / cookbook
examples and guides to using Nomic Atlas
☆27Updated 4 months ago
gregorycoppola / bayes-star
Implementation
☆24Updated this week
CarletonCognitiveModelingLab / python_actr
A Python implementation of the ACT-R cognitive Architecture
☆28Updated last year
cognitiveailab / neurosymbolic
A neurosymbolic T5 agent for playing text games, from the EACL 2023 paper "Behavior Cloned Transformers are Neurosymbolic Reasoners"
☆19Updated last year
The-Swarm-Corporation / swarm-ecosystem
The Swarm Ecosystem
☆19Updated 5 months ago
cavaunpeu / mcts-llm-codegen
A Python reimplementation of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)
☆17Updated last year
kyegomez / Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆23Updated 2 months ago
luohongyin / EntST
Entailment self-training
☆25Updated last year
S1M0N38 / dspy-arxiv
Explore the use of DSPy for extracting features from PDFs 🔎
☆37Updated 10 months ago
IDSIA / automated-cl
Official repository for the paper "Automating Continual Learning"
☆12Updated 9 months ago
levitation-opensource / Manipulative-Expression-Recognition
MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. …
☆13Updated 5 months ago
Pervasive-AI-Lab / LuckyMera
☆13Updated 3 months ago
RewardReports / reward-reports
Documentation for dynamic machine learning systems.
☆29Updated 4 months ago
rjb7731 / LLM-demos
☆12Updated 9 months ago
ManifoldRG / AgentForge
The AgentForge project focuses on building general tooling to construct multicapability AI systems by composing skills and models togethe…
☆17Updated last year
facebookresearch / LIGHT
LIGHT is a platform for text-situated dialogue research. We originally hosted LIGHT as a live game with dialogue models in a grounded set…
☆68Updated last year
pietrolesci / anchoral
This is the official PyTorch implementation for our NAACL 2024 paper: "AnchorAL: Computationally Efficient Active Learning for Large and …
☆19Updated last month
microsoft / Alympics
☆48Updated 7 months ago
kyegomez / SelfExtend
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Updated 2 months ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated last week