lemon234071 / clean-dialogView external linksLinks
A framework for cleaning Chinese dialog data
☆273May 14, 2021Updated 4 years ago
Alternatives and similar repositories for clean-dialog
Users that are interested in clean-dialog are comparing it to the libraries listed below
Sorting:
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,933Jun 12, 2023Updated 2 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆306Mar 11, 2023Updated 2 years ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Sep 25, 2022Updated 3 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆108Jul 6, 2023Updated 2 years ago
- ☆82Jul 3, 2023Updated 2 years ago
- 中文对话数 据清洗☆32Nov 8, 2022Updated 3 years ago
- KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation☆495May 8, 2023Updated 2 years ago
- ☆25Sep 29, 2021Updated 4 years ago
- Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle☆673Mar 6, 2024Updated last year
- Codes and data for the ACL 2021-Findings paper: CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response Generation☆39May 8, 2023Updated 2 years ago
- GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)☆3,012Oct 30, 2023Updated 2 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Jun 5, 2023Updated 2 years ago
- ☆102Oct 10, 2020Updated 5 years ago
- A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset☆711Jun 17, 2024Updated last year
- Finetune CPM-1☆75Mar 18, 2023Updated 2 years ago
- A Knowledge Grounded Conversation (KGC) Paper Reading List Maintained by Chuan Meng.☆260Sep 22, 2021Updated 4 years ago
- Dataset and Baseline for SMP-MCC2020☆23Jul 6, 2023Updated 2 years ago
- ☆442Mar 12, 2022Updated 3 years ago
- This repo contains our ACL 2017 paper data and source code☆729Sep 15, 2020Updated 5 years ago
- Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。☆158Jan 21, 2021Updated 5 years ago
- Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料☆997Feb 6, 2026Updated last week
- Tensorflow implementation for MRFN in Retrieval-based Chatbots☆49Mar 13, 2020Updated 5 years ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆495Dec 30, 2022Updated 3 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,283Oct 16, 2024Updated last year
- ☆313Apr 6, 2023Updated 2 years ago
- Code for paper "Stylized Dialogue Response Generation Using Stylized Unpaired Texts"☆31Aug 18, 2022Updated 3 years ago
- Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation☆31May 8, 2023Updated 2 years ago
- ☆15Nov 3, 2022Updated 3 years ago
- ☆21Dec 18, 2020Updated 5 years ago
- pCLUE: 1000000+多任务提示学习数据集☆506Oct 4, 2022Updated 3 years ago
- Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。☆1,037Oct 19, 2023Updated 2 years ago
- a bert for retrieval and generation☆860Feb 26, 2021Updated 4 years ago
- ☆65Jun 9, 2022Updated 3 years ago
- Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021☆95Jul 8, 2021Updated 4 years ago
- ☆221Nov 12, 2019Updated 6 years ago
- 用bert4keras加载CDial-GPT☆38Nov 20, 2020Updated 5 years ago
- Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation☆129Aug 31, 2020Updated 5 years ago
- ☆443Jul 1, 2022Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year