liuhuanyong / ChineseNLPCorpusView external linksLinks
An collection of Chinese nlp corpus including basic Chinese syntatic wordset, semantic wordset, historic corpus and evaluate corpus. 中文自然语言处理的语料集合,包括语义词、领域共时、历时语料库、评测语料库等。
☆450Dec 16, 2018Updated 7 years ago
Alternatives and similar repositories for ChineseNLPCorpus
Users that are interested in ChineseNLPCorpus are comparing it to the libraries listed below
Sorting:
- Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量☆454Dec 15, 2018Updated 7 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆176Dec 15, 2018Updated 7 years ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Apr 25, 2018Updated 7 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,857Feb 6, 2026Updated last week
- self complement of baike knowledge base info-box extraction by online analysis.基于互动百科,百度百科,搜狗百科的词条infobox结构化信息抽取,百科知识的融合☆37Mar 30, 2018Updated 7 years ago
- 搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。☆6,468Jan 29, 2019Updated 7 years ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆69Mar 27, 2024Updated last year
- HyponymyExtraction and Graph based on KB Schema, Baike-kb and online text extract, 基于知识概念体系,百科知识库,以及在线搜索结构化方式的词语上下位抽取 与可视化展示☆171Oct 6, 2018Updated 7 years ago
- self implement of NLP toolkit 个人实现NLP汉语自然语言处理组件,提供基于HMM与CRF的分词,词性标注,命名实体识别接口,提供基于CRF的依存句法接口。☆55Apr 14, 2018Updated 7 years ago
- ChinesePersonRelationGraph, person relationship extraction based on nlp methods.中文人物关系知识图谱项目,内容包括中文人物关系图谱构建,基于知识库的数据回标,基于远程监督与bootstrappi…☆931Dec 15, 2018Updated 7 years ago
- AbstractKnowledgeGraph, a systematic knowledge graph that concentrate on abstract thing including abstract entity and action. 抽象知识图谱,目前规模…☆248Aug 6, 2019Updated 6 years ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,575Nov 21, 2023Updated 2 years ago
- 中文自然语言处理 (NLP) 标注工具,与 有志之士 共同 促进 中文 自然语言处理 的 发展。☆155Jun 22, 2018Updated 7 years ago
- ChineseSemanticKB,chinese semantic knowledge base, 面向中文处理的12类、百万规模的语义常用词典,包括34万抽象语义库、34万反义语义库、43万同义语义库等,可支持句子扩展、转写、事件抽象与泛化等多种应用场景。☆780Mar 17, 2023Updated 2 years ago
- Sequential Event Experiment based on Travel note crawled from XieCheng,基于50W携程出行游记的采集与顺承事件图谱构建.☆188Dec 15, 2018Updated 7 years ago
- Open Chinese Language Pre-trained Model Zoo☆984Mar 18, 2020Updated 5 years ago
- 图书名语料库。含部分电影、游戏名称。☆72Mar 27, 2024Updated last year
- A concept and obvious expression pattern collection of Chinese compound event extraction which then be evolved into ComplexEventGraph,本项 目…☆1,219Dec 15, 2018Updated 7 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,182Oct 30, 2023Updated 2 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆46Nov 3, 2015Updated 10 years ago
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,786Feb 18, 2023Updated 2 years ago
- Bayesian Visual Working Memory in Python.☆13Mar 28, 2020Updated 5 years ago
- 通识教育的信息、系统论、控制论解读☆12Jan 16, 2019Updated 7 years ago
- 使用BERT模型进行文本分类,相似句子判断,以及词性标注☆90Jan 23, 2019Updated 7 years ago
- Text Content Grapher based on keyinfo extraction by NLP method。输入一篇文档,将文档进行关键信息提取,进行结构化,并最终组织成图谱组织形式,形成对文章语义信息的图谱化展示。☆1,453Oct 20, 2021Updated 4 years ago
- A curated list of resources for Chinese NLP 中文自然语言处理相关资料☆7,926Jul 27, 2023Updated 2 years ago
- self summary after attending CCL2018 (全国计算语言学学术会议), CCL2018参会总结,包括会议论文下载脚本,会议前言技术报告下载,以及个人的一点总结.☆27Oct 24, 2018Updated 7 years ago
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆60Aug 26, 2018Updated 7 years ago
- 中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室☆720Sep 26, 2019Updated 6 years ago
- BIBFRAME Datastore is a Linked-Data project for managing bibliographic records and operational data focused on libraries and other simila…☆16Sep 17, 2015Updated 10 years ago
- meta-analyses of language acquisition phenomena☆13May 28, 2019Updated 6 years ago
- The bibfra.me vocabulary☆13Sep 20, 2022Updated 3 years ago
- Syntax and Ruler-Based Doc sentiment analysis 基于依存句法规则的篇章级情感分析demo☆107Jun 11, 2019Updated 6 years ago
- 基于CEC语料库挖掘要素识别规则,对新闻报道类生语料进行自动标注☆20May 14, 2015Updated 10 years ago
- 一个中文的已标注词性的语料库☆208Aug 5, 2014Updated 11 years ago
- A simple documentary topic analysis implement based on traditional K-means and LDA which can achieve a not-bad result. 基于Kmeans与Lda模型的多文…☆247Dec 15, 2018Updated 7 years ago
- Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类☆3,423May 7, 2022Updated 3 years ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- Consider is a parser for the ThinkGear protocol used by NeuroSky devices (MindSet, BrainBand and others).☆16Apr 3, 2012Updated 13 years ago