Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"
☆34Dec 9, 2022Updated 3 years ago
Alternatives and similar repositories for CALM-Dialogue
Users that are interested in CALM-Dialogue are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆211Jul 31, 2023Updated 2 years ago
- Comprehensive Implementation of Proximal Policy Optimization☆12Aug 3, 2021Updated 4 years ago
- ☆13May 26, 2022Updated 3 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21May 22, 2023Updated 2 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆62Aug 9, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Automated Programming Framework☆15May 11, 2020Updated 5 years ago
- Model-Based Offline Reinforcement Learning☆51Jan 13, 2021Updated 5 years ago
- ☆47Apr 24, 2022Updated 3 years ago
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆16Nov 21, 2022Updated 3 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Nov 22, 2022Updated 3 years ago
- Code for Generalization Guarantees for (Multi-Modal) Imitation Learning☆11Jul 14, 2022Updated 3 years ago
- Qualitative Numeric Planning☆10Dec 10, 2020Updated 5 years ago
- Reinforcement Learning via Supervised Learning☆72May 16, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Integration of the planning system Fast Downward with the unified-planning framework.☆13Aug 5, 2025Updated 7 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Feb 20, 2026Updated last month
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- ☆32Nov 20, 2019Updated 6 years ago
- MiniZinc documentation☆16Feb 9, 2023Updated 3 years ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆52Oct 31, 2024Updated last year
- ☆30Jun 13, 2019Updated 6 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆34Oct 22, 2020Updated 5 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12Dec 13, 2022Updated 3 years ago
- The newly improved planner (and more) in the cloud.☆40Mar 18, 2026Updated last week
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Jun 11, 2020Updated 5 years ago
- Deep generative model for sentiment analysis☆34Mar 13, 2017Updated 9 years ago
- Code for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality (ICLR 2020)☆11Mar 24, 2023Updated 3 years ago
- RND1: Scaling Diffusion Language Models☆176Feb 22, 2026Updated last month
- The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"☆16Mar 14, 2024Updated 2 years ago
- Manipulate NNF (Negation Normal Form) logical sentences☆20Dec 13, 2022Updated 3 years ago
- ☆33Jul 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)☆162Dec 20, 2023Updated 2 years ago
- Learning Domain-Independent Planning Heuristics over Hypergraphs (ICAPS'20)☆14Mar 21, 2025Updated last year
- Unofficial JAX implementation of the SOAP optimizer (https://arxiv.org/abs/2409.11321)☆25Jan 9, 2026Updated 2 months ago
- Training chatbot models with reinforcement learning in ParlAI.☆17Dec 8, 2022Updated 3 years ago
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- ☆46Apr 10, 2023Updated 2 years ago
- ☆11Nov 18, 2023Updated 2 years ago