☆17Jun 12, 2020Updated 5 years ago
Alternatives and similar repositories for Common-NLP-Datasets
Users that are interested in Common-NLP-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 5 months ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 8 months ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- A mesh system for adapting multiple large language models.☆11Mar 20, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Machine learning model library implementing Nix ideas for configuration management☆11Sep 12, 2020Updated 5 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 5 years ago
- Code for Detecting language from text in python using fasttext☆13May 25, 2020Updated 5 years ago
- Transformer Implementation for NMT using PyTorch Lightning (Korean to English)☆10Oct 19, 2020Updated 5 years ago
- use chatGLM to perform text embedding☆45Apr 9, 2023Updated 3 years ago
- Implementation for paper SideWindowFilter☆10Nov 28, 2019Updated 6 years ago
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- 一个可以自己进行训练的中文聊天机器人, 根据自己的语料训练出自己想要的聊天机器人,可以用于智能客服、在线问答、智能聊天等场景。目前包含seq2seq、seqGAN版本和tf2.0版本。☆11Feb 10, 2021Updated 5 years ago
- 同花顺算法挑战平台:【9-10双月赛】跨领域迁移的文本语义匹配☆11Oct 28, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A collection of utilities used in exploring data augmentation of low-resource parallel corpuses. …☆11Sep 6, 2017Updated 8 years ago
- 中文关键词提取☆14Aug 7, 2023Updated 2 years ago
- A PySimpleGUI based text and code editor☆14Oct 6, 2019Updated 6 years ago
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Aug 12, 2024Updated last year
- SimCSE的tensorflow版本实现,以及基础实验对比☆13Jul 22, 2021Updated 4 years ago
- Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆33Mar 1, 2026Updated last month
- 本项目由三个模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Feb 18, 2021Updated 5 years ago
- [CVPR 2026] UnicEdit-10M and UnicBench project☆40Mar 3, 2026Updated last month
- Code related to experimentation of different Text Data Augmentation Techniques☆14Oct 24, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 基于Pytorch实现的一些经典自然语言处理模型中文短文本分类任务,包含TextCNN,TextRCNN,FastText,BERT,ROBERT以及ERNIE☆54Jun 29, 2020Updated 5 years ago
- 全球人工智能技术创新大赛-赛道三:小布助手对话短文本语义匹配☆11Apr 5, 2021Updated 5 years ago
- ☆20Apr 28, 2021Updated 4 years ago
- Hack and Tell @ Saarland University☆19Dec 11, 2017Updated 8 years ago
- This is the source code of "Temporally Coherent Completion of Dynamic Video", ACM Transactions on Graphics (TOG), 2016, from https://file…☆15Mar 7, 2019Updated 7 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated last year
- Performance benchmarking of TensorFlow and PyTorch.☆13Jun 30, 2021Updated 4 years ago
- 使用rasa构建任务型聊天机器人☆13Dec 8, 2022Updated 3 years ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- wrap cppjieba by swig.☆20Mar 15, 2018Updated 8 years ago
- Ensemble of 10 modified BERT Base models for prediction of best answers for queries on search engines.☆16Jan 1, 2019Updated 7 years ago
- ☆14Jul 12, 2022Updated 3 years ago
- FewCLUE 小样本学习测评基准,中文版☆519Sep 21, 2022Updated 3 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20Mar 12, 2026Updated last month
- Investigating multilingual language models (BERT) by using them for NER in German and English☆14Apr 30, 2019Updated 6 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year