☆17Jun 12, 2020Updated 5 years ago
Alternatives and similar repositories for Common-NLP-Datasets
Users that are interested in Common-NLP-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆15Jul 30, 2025Updated 9 months ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- A mesh system for adapting multiple large language models.☆11Mar 20, 2024Updated 2 years ago
- Machine learning model library implementing Nix ideas for configuration management☆11Sep 12, 2020Updated 5 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 5 years ago
- [ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks☆30Feb 5, 2026Updated 2 months ago
- a QA bot on contents of given docs 用所给文档进行问答的聊天机器人☆12Apr 20, 2023Updated 3 years ago
- ☆13Jul 28, 2024Updated last year
- ☆11Mar 22, 2020Updated 6 years ago
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- use chatGLM to perform text embedding☆45Apr 9, 2023Updated 3 years ago
- Bot that addresses typical questions about the COVID-19 virus to help you handle high volumes of questions from your customers, partners …☆12Dec 5, 2022Updated 3 years ago
- ☆11Sep 17, 2023Updated 2 years ago
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- A Master Thesis Project on Video Keyword Extractor using Video Summarization techniques.☆11Oct 25, 2020Updated 5 years ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆16Mar 5, 2021Updated 5 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- 一个可以自己进行训练的中文聊天机器人, 根据自己的语料训练出自己想要的聊天机器人,可以用于智能客服、在线问答、智能聊天等场景。目前包含seq2seq、seqGAN版本和tf2.0版本。☆11Feb 10, 2021Updated 5 years ago
- 同花顺算法挑战平台:【9-10双月赛】跨领域迁移的文本语义匹配☆11Oct 28, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A collection of utilities used in exploring data augmentation of low-resource parallel corpuses. …☆11Sep 6, 2017Updated 8 years ago
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Aug 12, 2024Updated last year
- SimCSE的tensorflow版本实现,以及基础实验对比☆13Jul 22, 2021Updated 4 years ago
- This repository not only contains experience about parameter finetune, but also other in-practice experience such as model ensemble (boos…☆16Oct 29, 2017Updated 8 years ago
- Code and data for "A Deep Generative Model for Code-Switched Text" accepted in IJCAI 2019☆16Nov 14, 2019Updated 6 years ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 5 years ago
- 本项目由三个模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Feb 18, 2021Updated 5 years ago
- Code related to experimentation of different Text Data Augmentation Techniques☆14Oct 24, 2019Updated 6 years ago
- 基于Pytorch实现的一些经典自然语言处理模型中文短文本分类任务,包含TextCNN,TextRCNN,FastText,BERT,ROBERT以及ERNIE☆54Jun 29, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 全球人工智能技术创新大赛-赛道三:小布助手对话短文本语义匹配☆11Apr 5, 2021Updated 5 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Apr 22, 2026Updated last week
- 对抗训练在NLP中的应用☆14Nov 22, 2021Updated 4 years ago
- This is the source code of "Temporally Coherent Completion of Dynamic Video", ACM Transactions on Graphics (TOG), 2016, from https://file…☆15Mar 7, 2019Updated 7 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated 2 years ago
- Performance benchmarking of TensorFlow and PyTorch.☆13Jun 30, 2021Updated 4 years ago
- 使用rasa构建任务型聊天机器人☆13Dec 8, 2022Updated 3 years ago