☆17Jun 12, 2020Updated 5 years ago
Alternatives and similar repositories for Common-NLP-Datasets
Users that are interested in Common-NLP-Datasets are comparing it to the libraries listed below
Sorting:
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 5 years ago
- a QA bot on contents of given docs 用所给文档进行问答的聊天机器人☆12Apr 20, 2023Updated 2 years ago
- ☆11Mar 22, 2020Updated 6 years ago
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- use chatGLM to perform text embedding☆45Apr 9, 2023Updated 2 years ago
- Bot that addresses typical questions about the COVID-19 virus to help you handle high volumes of questions from your customers, partners …☆12Dec 5, 2022Updated 3 years ago
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- A Master Thesis Project on Video Keyword Extractor using Video Summarization techniques.☆11Oct 25, 2020Updated 5 years ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆16Mar 5, 2021Updated 5 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- 一个可以自己进行训练的中文聊天机器人, 根据自己的语料训练出自己想要的聊天机器人,可以用于智能客服、在线问答、智能聊天等场景。目前包含seq2seq、seqGAN版本和tf2.0版本。☆11Feb 10, 2021Updated 5 years ago
- 同花顺算法挑战平台:【9-10双月赛】跨领域迁移的文本语义匹配☆11Oct 28, 2021Updated 4 years ago
- A collection of utilities used in exploring data augmentation of low-resource parallel corpuses. …☆11Sep 6, 2017Updated 8 years ago
- A PySimpleGUI based text and code editor☆14Oct 6, 2019Updated 6 years ago
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Aug 12, 2024Updated last year
- This repository not only contains experience about parameter finetune, but also other in-practice experience such as model ensemble (boos…☆16Oct 29, 2017Updated 8 years ago
- Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆30Mar 1, 2026Updated 3 weeks ago
- 本项目由三个模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Feb 18, 2021Updated 5 years ago
- Code related to experimentation of different Text Data Augmentation Techniques☆14Oct 24, 2019Updated 6 years ago
- 基于Pytorch实现的一些经典自然语言处理模型中文短文本分类任务,包含TextCNN,TextRCNN,FastText,BERT,ROBERT以及ERNIE☆54Jun 29, 2020Updated 5 years ago
- 全球人工智能技术创新大赛-赛道三:小布助手对话短文本语义匹配☆12Apr 5, 2021Updated 4 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated 3 weeks ago
- 对抗训练在NLP中的应用☆14Nov 22, 2021Updated 4 years ago
- ☆20Apr 28, 2021Updated 4 years ago
- Hack and Tell @ Saarland University☆19Dec 11, 2017Updated 8 years ago
- 使用rasa构建任务型聊天机器人☆13Dec 8, 2022Updated 3 years ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- wrap cppjieba by swig.☆20Mar 15, 2018Updated 8 years ago
- 多轮中文聊天机器人,采用GPT2进行微调,清洗聊天数据110w+,采用语义相似度和文本jaccard相似度过滤回话。☆23Nov 13, 2021Updated 4 years ago
- Softcatalà neural translation models☆20Jan 17, 2026Updated 2 months ago
- Ensemble of 10 modified BERT Base models for prediction of best answers for queries on search engines.☆16Jan 1, 2019Updated 7 years ago
- ☆14Jul 12, 2022Updated 3 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20Mar 12, 2026Updated last week
- Investigating multilingual language models (BERT) by using them for NER in German and English☆14Apr 30, 2019Updated 6 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year