A framework for cleaning Chinese dialog data
☆273May 14, 2021Updated 4 years ago
Alternatives and similar repositories for clean-dialog
Users that are interested in clean-dialog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,943Jun 12, 2023Updated 2 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆305Mar 11, 2023Updated 3 years ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Sep 25, 2022Updated 3 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆108Jul 6, 2023Updated 2 years ago
- ☆83Jul 3, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆73Jun 5, 2023Updated 2 years ago
- ☆25Sep 29, 2021Updated 4 years ago
- Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle☆672Mar 6, 2024Updated 2 years ago
- GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)☆3,009Oct 30, 2023Updated 2 years ago
- Codes and data for the ACL 2021-Findings paper: CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response Generation☆39May 8, 2023Updated 2 years ago
- KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation☆498May 8, 2023Updated 2 years ago
- A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset☆717Jun 17, 2024Updated last year
- A Knowledge Grounded Conversation (KGC) Paper Reading List Maintained by Chuan Meng.☆260Sep 22, 2021Updated 4 years ago
- ☆442Mar 12, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆65Jun 9, 2022Updated 3 years ago
- This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark☆24Aug 13, 2022Updated 3 years ago
- ☆15Nov 3, 2022Updated 3 years ago
- ☆101Oct 10, 2020Updated 5 years ago
- Finetune CPM-1☆73Mar 18, 2023Updated 3 years ago
- Dataset and Baseline for SMP-MCC2020☆23Jul 6, 2023Updated 2 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,285Oct 16, 2024Updated last year
- Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation☆31May 8, 2023Updated 2 years ago
- This repo contains our ACL 2017 paper data and source code☆729Sep 15, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料☆1,002Feb 6, 2026Updated 2 months ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆496Dec 30, 2022Updated 3 years ago
- PLATO dialog model with pre-trained parameters in pytorch version☆29May 20, 2022Updated 3 years ago
- ☆21Dec 18, 2020Updated 5 years ago
- Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。☆157Jan 21, 2021Updated 5 years ago
- ☆310Apr 6, 2023Updated 3 years ago
- Repository containing code for the WWW 2021 paper on empathic rewriting☆65Sep 6, 2022Updated 3 years ago
- Code for paper "Stylized Dialogue Response Generation Using Stylized Unpaired Texts"☆31Aug 18, 2022Updated 3 years ago
- Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021☆95Jul 8, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Tensorflow implementation for MRFN in Retrieval-based Chatbots☆49Mar 13, 2020Updated 6 years ago
- Code for ACL 2021 main conference paper "Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances".☆94Jun 30, 2021Updated 4 years ago
- Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation☆129Aug 31, 2020Updated 5 years ago
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集☆3,051Apr 14, 2024Updated 2 years ago
- ☆443Jul 1, 2022Updated 3 years ago
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,104May 9, 2024Updated last year