liuhuanyong / ChineseEmbeddingView external linksLinks
Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量
☆454Dec 15, 2018Updated 7 years ago
Alternatives and similar repositories for ChineseEmbedding
Users that are interested in ChineseEmbedding are comparing it to the libraries listed below
Sorting:
- self complement of Sentence Similarity compute based on cilin, hownet, simhash, wordvector,vsm models,基于同义词词林,知网,指纹,字词向量,向量空间模型的句子相似度计算。☆365Dec 15, 2018Updated 7 years ago
- WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarit…☆131Dec 15, 2018Updated 7 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,182Oct 30, 2023Updated 2 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆176Dec 15, 2018Updated 7 years ago
- An collection of Chinese nlp corpus including basic Chinese syntatic wordset, semantic wordset, historic corpus and evaluate corpus. 中文自然…☆450Dec 16, 2018Updated 7 years ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Apr 25, 2018Updated 7 years ago
- Self complemented word embedding methods using CBOW,skip-Gram,word2doc matrix , word2word matrix ,基于CBOW、skip-gram、词-文档矩阵、词-词矩阵四种方法的词向量生成☆187Apr 26, 2018Updated 7 years ago
- AbstractKnowledgeGraph, a systematic knowledge graph that concentrate on abstract thing including abstract entity and action. 抽象知识图谱,目前规模…☆248Aug 6, 2019Updated 6 years ago
- 综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。☆743Feb 16, 2022Updated 3 years ago
- Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取☆2,266Feb 1, 2024Updated 2 years ago
- Open Language Pre-trained Model Zoo☆1,004Nov 18, 2021Updated 4 years ago
- 基于Pytorch和torchtext的自然语言处理深度学习框架。☆836Dec 14, 2020Updated 5 years ago
- self implement of NLP toolkit 个人实现NLP汉语自然语言处理组件,提供基于HMM与CRF的分词,词性标注,命名实体识别接口,提供基于CRF的依存句法接口。☆55Apr 14, 2018Updated 7 years ago
- ChinesePersonRelationGraph, person relationship extraction based on nlp methods.中文人物关系知识图谱项目,内容包括中文人物关系图谱构建,基于知识库的数据回标,基于远程监督与bootstrappi…☆931Dec 15, 2018Updated 7 years ago
- 基于远监督的中文关系抽取☆386May 13, 2021Updated 4 years ago
- An experiment and demo-level tool for text information extraction (event-triples extraction), which can be a route to the event chain an…☆933Nov 26, 2022Updated 3 years ago
- HyponymyExtraction and Graph based on KB Schema, Baike-kb and online text extract, 基于知识概念体系,百科知识库,以及在线搜索结构化方式的词语上下位抽取与可视化展示☆171Oct 6, 2018Updated 7 years ago
- Syntax and Ruler-Based Doc sentiment analysis 基于依存句法规则的篇章级情感分析demo☆107Jun 11, 2019Updated 6 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,986Nov 21, 2022Updated 3 years ago
- 自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,b…☆1,539Sep 23, 2021Updated 4 years ago
- An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具,仅占35M内存 www.jionlp.com☆556Nov 21, 2023Updated 2 years ago
- python3实现互信息和左右熵的新词发现☆593Aug 1, 2019Updated 6 years ago
- 常用文本匹配模型tf版本,数据集为QA_corpus,持续更新中☆674Oct 12, 2019Updated 6 years ago
- 文本匹配的相关模型DSSM,ESIM,ABCNN,BIMPM等,数据集为LCQMC官方数据☆470May 8, 2022Updated 3 years ago
- 基于Pytorch和torchtext的知识图谱深度学习框架。☆614Jul 7, 2020Updated 5 years ago
- Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA…☆327Mar 24, 2023Updated 2 years ago
- Entity and Relation Extraction Based on TensorFlow and BERT. 基于TensorFlow和BERT的管道式实体及关系抽取,2019语言与智能技术竞赛信息抽取任务解决方案。Schema based Knowledge …☆1,230Jun 1, 2020Updated 5 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆84May 20, 2022Updated 3 years ago
- A simple documentary topic analysis implement based on traditional K-means and LDA which can achieve a not-bad result. 基于Kmeans与Lda模型的多文…☆247Dec 15, 2018Updated 7 years ago
- Causality event extraction demo project including casual patterns and experiment on large scale corpus. 基于因果关系知识库的因果事件图谱实验项目,本项目罗列了因果显式表达…☆428Dec 15, 2018Updated 7 years ago
- Keyphrase or Keyword Extraction 基于预训练模 型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…☆433May 17, 2020Updated 5 years ago
- 复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!☆2,797Aug 30, 2025Updated 5 months ago
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,386May 31, 2022Updated 3 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆138May 25, 2020Updated 5 years ago
- CCKS 2019 中文短文本实体链指比赛技术创新奖解决方案☆412Mar 24, 2023Updated 2 years ago
- self complement of baike knowledge base info-box extraction by online analysis.基于互动百科,百度百科,搜狗百科的词条infobox结构化信息抽取,百科知识的融合☆37Mar 30, 2018Updated 7 years ago
- 利用预训练的中文模型实现基于bert的语义匹配模型 数据集为LCQMC官方数据☆198Dec 19, 2019Updated 6 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,858Feb 6, 2026Updated last week
- Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includ…☆2,387Sep 3, 2024Updated last year