中文预处理语料
☆113Dec 18, 2018Updated 7 years ago
Alternatives and similar repositories for Chinese_from_dongxiexidian
Users that are interested in Chinese_from_dongxiexidian are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- lightMQ的客户端☆16Jan 26, 2019Updated 7 years ago
- mirror of dongxiexidian/Chinese☆306Dec 18, 2018Updated 7 years ago
- Personal open source user center☆30Jul 16, 2023Updated 2 years ago
- CopyNet (Copy Mechanism in Seq2Seq) implementation with TensorFlow 2☆10Nov 21, 2022Updated 3 years ago
- An collection of Chinese nlp corpus including basic Chinese syntatic wordset, semantic wordset, historic corpus and evaluate corpus. 中文自然…☆449Dec 16, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- API_Translationg各大翻译网站API集合☆12Oct 20, 2018Updated 7 years ago
- 使用Bi-LSTM和crf来进行人名识别,数据集人民日报98年1月标注数据集,训练:验证:测试为3:1:1☆22Jul 25, 2018Updated 7 years ago
- datasets for NLP research☆24Nov 6, 2021Updated 4 years ago
- IdealWordCloudKit, A toolbox or kit for image-shape adjusted word cloud based on plain text, local file or web articles, 面向本地文件, 在线网页, 程序…☆41Jan 26, 2019Updated 7 years ago
- 自然语言处理相关实验实现 some experiment of natural language processing, Like text classification, named entity recognition, pos-tags, segment, key …☆54Nov 22, 2018Updated 7 years ago
- aliceCN☆14Jan 30, 2013Updated 13 years ago
- 端到端的基于知识图谱的问答系统,分为实体识别和关系分类两部,在BERT基础上做多任务联合训练。☆31Nov 22, 2019Updated 6 years ago
- 搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征☆26Dec 3, 2018Updated 7 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆200Jul 17, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 文本去重☆78May 23, 2024Updated last year
- 公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。☆1,295Mar 27, 2024Updated 2 years ago
- 大规模中文语料☆44Nov 5, 2019Updated 6 years ago
- ☆10Jan 28, 2021Updated 5 years ago
- 用bert4keras加载CDial-GPT☆38Nov 20, 2020Updated 5 years ago
- Chinese Classic Poem Mining Project including corpus buiding by spyder and content analysis by nlp methods, 基于爬虫与nlp的中国古代诗词文本挖掘项目☆119Oct 7, 2018Updated 7 years ago
- parallel corpus dataset from the mnbvc project☆15Feb 11, 2026Updated 2 months ago
- Byte Cup 2018国际机器学习竞赛 23 名(水滴队)代码☆47Feb 22, 2019Updated 7 years ago
- 该项目可以根据用户给出的上文自动生成下文 该项目是本人的本科毕业设计。项目主要基于GPT-2 Chinese实现。本人的工作主要是用新的语料库进行了几次训练,得出来了一个还凑合的模型。该项目已经初步完成,不再进行进一步的更新。☆12Jun 9, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 中文公开聊天语料库☆4,182Apr 23, 2024Updated 2 years ago
- NLP Project + pytorch☆10Oct 17, 2020Updated 5 years ago
- Software for unsupervised word segmentation and language model learning using lattices☆45Aug 17, 2016Updated 9 years ago
- 完成分包拆包,使用ffmpeg完成音视频的RTMP推流☆14Dec 20, 2019Updated 6 years ago
- Some useful Chinese corpus datasets 中文语料小数据☆546Mar 29, 2020Updated 6 years ago
- 利用知识图谱嵌入模型研究阿尔茨海默病的药物重定位.☆11May 15, 2023Updated 2 years ago
- 使用word2vec, fasttext进行训练词向量☆11Jan 10, 2019Updated 7 years ago
- 2020-natural-language-processing-project☆10Dec 18, 2020Updated 5 years ago
- 根据文本和角色名字典,生成人物关系文件,利用Gephi可生成网络图☆14Aug 25, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A demo of new approach to automatic text summarization using topic models and bipartite graphs.☆15Apr 23, 2013Updated 13 years ago
- self complement of Sentence Similarity compute based on cilin, hownet, simhash, wordvector,vsm models,基于同义词词林,知网,指纹,字词向量,向量空间模型的句子相似度计算。☆364Dec 15, 2018Updated 7 years ago
- Load Tensorflow pb file using Bert/TextCNNs, an ensemble model using Java.☆10Aug 20, 2021Updated 4 years ago
- 基于开源词语识别项目的高性能识别工具(可用于敏感词识别,关键词识别等)☆18Jun 17, 2022Updated 3 years ago
- 敏感词过滤的几种实现+某1w词敏感词库☆2,113Aug 20, 2021Updated 4 years ago
- 利用lstm和lstm/cnn进行答案问题匹配☆16Apr 21, 2018Updated 8 years ago
- ☆15Nov 19, 2018Updated 7 years ago