短文本聚类预处理模块 Short text cluster
☆281Dec 28, 2019Updated 6 years ago
Alternatives and similar repositories for TextCluster
Users that are interested in TextCluster are comparing it to the libraries listed below
Sorting:
- 文本聚类(Kmeans、DBSCAN、LDA、Single-pass)☆353May 12, 2021Updated 4 years ago
- ☆133Jan 4, 2018Updated 8 years ago
- An experiment and demo-level tool for text information extraction (event-triples extraction), which can be a route to the event chain an…☆933Nov 26, 2022Updated 3 years ago
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,386May 31, 2022Updated 3 years ago
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- 一行代码使用BERT生成句向量,BERT做文本分类、文本相似度计算☆1,671Oct 14, 2019Updated 6 years ago
- An Open-source Neural Hierarchical Multi-label Text Classification Toolkit☆1,919Nov 18, 2025Updated 3 months ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆65Sep 4, 2021Updated 4 years ago
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,599May 13, 2024Updated last year
- bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示☆16Apr 7, 2022Updated 3 years ago
- 中文文本聚类☆123Jun 21, 2022Updated 3 years ago
- 速度更快、效果更好的中文新词发现☆513Mar 15, 2024Updated last year
- Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类☆3,422May 7, 2022Updated 3 years ago
- A Chinese information extraction tool.☆1,127Jun 28, 2022Updated 3 years ago
- 基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口☆1,294Jun 13, 2021Updated 4 years ago
- Code for chinese error detection module, using n-gram and bi-lstm☆135Mar 31, 2019Updated 6 years ago
- WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarit…☆131Dec 15, 2018Updated 7 years ago
- Reinforcement Learning For Dialogue Systems 强化学习在对话系统中的应用 论文或开源应用总结☆28Dec 27, 2019Updated 6 years ago
- 自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,b…☆1,539Sep 23, 2021Updated 4 years ago
- 中文近义词:聊天机器人,智能问答工具包☆5,104Feb 1, 2026Updated last month
- chatbot based on music region using method including es and music kb.基于14W歌曲知识库的问答尝试,功能包括歌词接龙,已知歌词找歌曲以及歌曲歌手歌词三角关系的问答。☆285Oct 15, 2018Updated 7 years ago
- Documentation for Chatstack: A Full Pipeline UI for building Chinese NLU System☆18Sep 7, 2019Updated 6 years ago
- An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具,仅占35M内存 www.jionlp.com☆555Nov 21, 2023Updated 2 years ago
- Time-NLP的python3版本 中文时间表达词转换☆520Dec 8, 2022Updated 3 years ago
- 中文文本摘要(text summarization)工具包, 抽取式中文文本摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(gra…☆419Jun 17, 2024Updated last year
- Collections of Chinese reading comprehension datasets☆221Dec 19, 2019Updated 6 years ago
- a bert for retrieval and generation☆860Feb 26, 2021Updated 5 years ago
- 中文分词☆3,211Jan 16, 2025Updated last year
- QAmatch(qa_match)/文本匹配/文本分类/文本embedding/文本聚类/文本检索(bow/ifidf/ngramtf-df/bert/albert/bm25/…/nn/gbdt/xgb/kmeans/dscan/faiss/….)☆933May 1, 2023Updated 2 years ago
- SiameseSentenceSimilarity,个人实现的基于Siamese bilstm模型的相似句子判定模型,提供训练数据集和测试数据集.☆271Dec 5, 2019Updated 6 years ago
- 基于知识图谱的问答系统,BERT做命名实体识别和句子相似度,分为online和outline模式☆1,473Dec 16, 2021Updated 4 years ago
- python3实现互信息和左右熵的新词发现☆593Aug 1, 2019Updated 6 years ago
- 中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com☆3,800Nov 27, 2025Updated 3 months ago
- Text-Similarity Method in Pytorch☆469Dec 9, 2018Updated 7 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,177Jul 15, 2025Updated 7 months ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,879Mar 18, 2025Updated 11 months ago
- NLP NER datasets video/music/book bio☆90Jan 3, 2021Updated 5 years ago
- Clustering text with Bert☆58Jun 22, 2020Updated 5 years ago
- 利用Python实现中文文本关键词抽取,分别采用TF-IDF、TextRank、Word2Vec词聚类三种方法。☆1,149Jan 16, 2018Updated 8 years ago