DaDaMrX / AutoTODLinks

Official repository of the ACL 2024 paper "Rethinking Task-Oriented Dialogue Systems: From Complex Modularity to Zero-Shot Autonomous Agent"

☆15

Alternatives and similar repositories for AutoTOD

Users that are interested in AutoTOD are comparing it to the libraries listed below

Sorting:

dengyang17 / ProactiveDialogues
Proactive Dialogue Systems - Paper Reading List
☆61Updated last year
nuochenpku / Awesome-Role-Play-Papers
Awesome papers for role-playing with language models
☆194Updated 8 months ago
IronBeliever / CaR
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
☆85Updated 8 months ago
bansky-cl / tods-arxiv-daily-paper
task-oriented dialogue system, especially for LLM, contain subtask: (1) intent-detection (2) slot filling (3) dialogue state tracking
☆117Updated this week
thu-coai / ComplexBench
Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)
☆87Updated 5 months ago
OpenMOSS / HalluQA
Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"
☆130Updated last year
CASIA-LM / MoDS
☆142Updated last year
pldlgb / nuggets
☆84Updated last year
hrwise-nlp / Survey-Evolution-DS
This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey…
☆62Updated 3 months ago
iwangjian / TopDial
Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)
☆30Updated last year
sugarandgugu / Simple-Trl-Training
基于DPO算法微调语言大模型，简单好上手。
☆40Updated last year
OpenMOSS / Say-I-Dont-Know
[ICML'2024] Can AI Assistants Know What They Don't Know?
☆81Updated last year
YJiangcm / FollowBench
[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
☆108Updated last month
morecry / CharacterChat
repository for CharacterChat, a personalized social support system
☆72Updated last year
fanqiwan / Explore-Instruct
EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration
☆36Updated last year
morecry / CharacterEval
☆253Updated last month
KbsdJames / MATH-Minos
The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…
☆38Updated 11 months ago
DaoD / SPRING
SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
☆22Updated 6 months ago
xiami2019 / UAR
[Findings of EMNLP'2024] Unified Active Retrieval for Retrieval Augmented Generation
☆22Updated 9 months ago
xiami2019 / CLAIF
[Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback
☆39Updated last year
fanqiwan / KCA
EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud
☆22Updated last year
icip-cas / awesome-auto-alignment
Collection of papers for scalable automated alignment.
☆92Updated 8 months ago
qinyiwei / InfoBench
☆55Updated 10 months ago
haidequanbu / ESC-Eval
[EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“
☆18Updated last year
RUC-NLPIR / CORAL
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
☆55Updated 2 months ago
WooooDyy / MathCritique
Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
☆55Updated 7 months ago
Abbey4799 / CELLO
Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)
☆48Updated last year
October2001 / ProLong
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆56Updated 11 months ago
tianyi-lab / Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆379Updated 3 weeks ago
csitfun / LogiQA2.0
Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks
☆95Updated last year