BlueDi / DeepDip
DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA
☆13Updated 5 years ago
Alternatives and similar repositories for DeepDip:
Users that are interested in DeepDip are comparing it to the libraries listed below
- Elevate your language models with insightful diversity metrics.☆11Updated 11 months ago
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 3 weeks ago
- An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting☆27Updated last year
- ☆50Updated 8 months ago
- Repo for the paper on Escalation Risks of AI systems☆36Updated 9 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆14Updated 10 months ago
- A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.☆27Updated last year
- examples and guides to using Nomic Atlas☆27Updated 4 months ago
- Implementation☆24Updated this week
- A Python implementation of the ACT-R cognitive Architecture☆28Updated last year
- A neurosymbolic T5 agent for playing text games, from the EACL 2023 paper "Behavior Cloned Transformers are Neurosymbolic Reasoners"☆19Updated last year
- The Swarm Ecosystem☆19Updated 5 months ago
- A Python reimplementation of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆17Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated 2 months ago
- Entailment self-training☆25Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆37Updated 10 months ago
- Official repository for the paper "Automating Continual Learning"☆12Updated 9 months ago
- MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. …☆13Updated 5 months ago
- ☆13Updated 3 months ago
- Documentation for dynamic machine learning systems.☆29Updated 4 months ago
- ☆12Updated 9 months ago
- The AgentForge project focuses on building general tooling to construct multicapability AI systems by composing skills and models togethe…☆17Updated last year
- LIGHT is a platform for text-situated dialogue research. We originally hosted LIGHT as a live game with dialogue models in a grounded set…☆68Updated last year
- This is the official PyTorch implementation for our NAACL 2024 paper: "AnchorAL: Computationally Efficient Active Learning for Large and …☆19Updated last month
- ☆48Updated 7 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 2 months ago
- Minimum Description Length probing for neural network representations☆18Updated last week