bansky-cl / tods-arxiv-daily-paper
task-oriented dialogue system, especially for LLM, contain subtask: (1) intent-detection (2) slot filling (3) dialogue state tracking
☆54Updated this week
Related projects: ⓘ
- This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey…☆42Updated this week
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆105Updated 3 months ago
- ☆90Updated 6 months ago
- ☆111Updated 6 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆61Updated 4 months ago
- ☆156Updated last year
- Awesome papers for role-playing with language models☆88Updated last month
- 中文大语言模型评测第二期☆68Updated 10 months ago
- ☆46Updated 2 months ago
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆28Updated 3 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆86Updated 5 months ago
- ☆34Updated last month
- ☆23Updated last year
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆154Updated 2 months ago
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆217Updated last month
- ☆158Updated 3 months ago
- ☆89Updated 9 months ago
- ☆109Updated 5 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆42Updated 5 months ago
- ☆124Updated 2 months ago
- Proactive Dialogue Systems - Paper Reading List☆38Updated 8 months ago
- 多轮共情对话模型PICA☆83Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆62Updated 2 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆281Updated 2 weeks ago
- repository for CharacterChat, a personalized social support system☆60Updated 2 months ago
- ☆260Updated 4 months ago
- YuLan-IR: Information Retrieval Boosted LMs☆211Updated 6 months ago
- 中文大语言模型评测第一期☆105Updated 10 months ago
- Code and dataset for our Bioinformatics 2022 paper: "A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datase…☆51Updated last year
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆151Updated 3 months ago