RandyPen/TextCluster

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RandyPen/TextCluster)

RandyPen / TextCluster

短文本聚类预处理模块 Short text cluster

☆281

Alternatives and similar repositories for TextCluster

Users that are interested in TextCluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

murray-z / text_clustering
View on GitHub
文本聚类（Kmeans、DBSCAN、LDA、Single-pass）
☆353May 12, 2021Updated 5 years ago
Edward1Chou / textClustering
View on GitHub
☆133Jan 4, 2018Updated 8 years ago
liuhuanyong / EventTriplesExtraction
View on GitHub
An experiment and demo-level tool for text information extraction (event-triples extraction), which can be a route to the event chain an…
☆928Nov 26, 2022Updated 3 years ago
liuhuanyong / SinglepassTextCluster
View on GitHub
SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec，which can be used for…
☆65Sep 4, 2021Updated 4 years ago
zhanlaoban / EDA_NLP_for_Chinese
View on GitHub
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
☆1,383May 31, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
FesonX / cn-text-classifier
View on GitHub
中文文本聚类
☆124Jun 21, 2022Updated 4 years ago
terrifyzhao / bert-utils
View on GitHub
一行代码使用BERT生成句向量，BERT做文本分类、文本相似度计算
☆1,668Oct 14, 2019Updated 6 years ago
Tencent / NeuralNLP-NeuralClassifier
View on GitHub
An Open-source Neural Hierarchical Multi-label Text Classification Toolkit
☆1,921Nov 18, 2025Updated 8 months ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
taishan1994 / chinese_sentence_embeddings
View on GitHub
bert_avg，bert_whitening，sbert，consert，simcse，esimcse 中文句向量表示
☆15Apr 7, 2022Updated 4 years ago
hgliyuhao / cluster
View on GitHub
Clustering text with Bert
☆58Jun 22, 2020Updated 6 years ago
blmoistawinde / HarvestText
View on GitHub
文本挖掘和预处理工具（文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等），无监督或弱监督方法
☆2,624May 13, 2024Updated 2 years ago
bojone / word-discovery
View on GitHub
速度更快、效果更好的中文新词发现
☆512Mar 15, 2024Updated 2 years ago
ownthink / Jiagu
View on GitHub
Jiagu深度学习自然语言处理工具知识图谱关系抽取中文分词词性标注命名实体识别情感分析新词发现关键词文本摘要文本聚类
☆3,427May 7, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yongzhuo / nlp_xiaojiang
View on GitHub
自然语言处理（nlp），小姜机器人（闲聊检索式chatbot），BERT句向量-相似度（Sentence Similarity），XLNET句向量-相似度（text xlnet embedding），文本分类（Text classification），实体提取（ner，b…
☆1,535Sep 23, 2021Updated 4 years ago
galesour / BTM
View on GitHub
BTM实现代码
☆101Aug 8, 2022Updated 3 years ago
zedom1 / Error-Detection
View on GitHub
Code for chinese error detection module, using n-gram and bi-lstm
☆136Mar 31, 2019Updated 7 years ago
murray-z / text_analysis_tools
View on GitHub
中文文本分析工具包（包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取）
☆734Oct 3, 2023Updated 2 years ago
chatopera / Synonyms
View on GitHub
中文近义词：聊天机器人，智能问答工具包
☆5,107Feb 1, 2026Updated 5 months ago
alvations / annotate-questionnaire
View on GitHub
Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9
☆59Jun 30, 2020Updated 6 years ago
fighting41love / cocoNLP
View on GitHub
A Chinese information extraction tool.
☆1,129Jun 28, 2022Updated 4 years ago
zhanzecheng / Time_NLP
View on GitHub
Time-NLP的python3版本中文时间表达词转换
☆520Dec 8, 2022Updated 3 years ago
liuhuanyong / MusicLyricChatbot
View on GitHub
chatbot based on music region using method including es and music kb.基于14W歌曲知识库的问答尝试，功能包括歌词接龙，已知歌词找歌曲以及歌曲歌手歌词三角关系的问答。
☆293Oct 15, 2018Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
dongrixinyu / chinese_keyphrase_extractor
View on GitHub
An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具，仅占35M内存 www.jionlp.com
☆554Nov 21, 2023Updated 2 years ago
liuhuanyong / WordMultiSenseDisambiguation
View on GitHub
WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarit…
☆131Dec 15, 2018Updated 7 years ago
hankcs / pyhanlp
View on GitHub
中文分词
☆3,204Jan 16, 2025Updated last year
yongzhuo / nlg-yongzhuo
View on GitHub
中文文本摘要（text summarization）工具包, 抽取式中文文本摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。（gra…
☆417Jun 17, 2024Updated 2 years ago
ymcui / Chinese-BERT-wwm
View on GitHub
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
☆10,224Apr 19, 2026Updated 3 months ago
mpk001 / SentencePairMatch_doc2vec-word2vec
View on GitHub
基于Doc2vec和Word2vec的句子对匹配方法
☆23Jun 3, 2017Updated 9 years ago
charlesXu86 / Chatbot_CN
View on GitHub
基于金融-司法领域(兼有闲聊性质)的聊天机器人，其中的主要模块有信息抽取、NLU、NLG、知识图谱等，并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
☆1,291Jun 13, 2021Updated 5 years ago
rwalk / gsdmm
View on GitHub
GSDMM: Short text clustering
☆359Dec 28, 2022Updated 3 years ago
pengshuang / Text-Similarity
View on GitHub
Text-Similarity Method in Pytorch
☆468Dec 9, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
chatstack-ai / Chatstack-Doc
View on GitHub
Documentation for Chatstack: A Full Pipeline UI for building Chinese NLU System
☆18Sep 7, 2019Updated 6 years ago
MachineLP / TextMatch
View on GitHub
QAmatch(qa_match)/文本匹配/文本分类/文本embedding/文本聚类/文本检索（bow/ifidf/ngramtf-df/bert/albert/bm25/…/nn/gbdt/xgb/kmeans/dscan/faiss/….）
☆931May 1, 2023Updated 3 years ago
abhinavthomas / textclusteringDBSCAN
View on GitHub
Performed document clustering using the DBSCAN clustering algorithm
☆14Oct 21, 2020Updated 5 years ago
AimeeLee77 / keyword_extraction
View on GitHub
利用Python实现中文文本关键词抽取，分别采用TF-IDF、TextRank、Word2Vec词聚类三种方法。
☆1,149Jan 16, 2018Updated 8 years ago
netrookiecn / Reinforcement-Learning-For-Dialogue-Systems
View on GitHub
Reinforcement Learning For Dialogue Systems 强化学习在对话系统中的应用论文或开源应用总结
☆28Dec 27, 2019Updated 6 years ago
WenRichard / KBQA-BERT
View on GitHub
基于知识图谱的问答系统，BERT做命名实体识别和句子相似度，分为online和outline模式
☆1,473Dec 16, 2021Updated 4 years ago
dongrixinyu / JioNLP
View on GitHub
中文 NLP 预处理、解析工具包，准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
☆3,855Jun 5, 2026Updated last month