SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for individual real-time corpus cluster task。基于single-pass算法思想的自动文本聚类小组件,内置tfidf和doc2vec两种文本向量方法,可自动输出聚类数目、类簇文档集合和簇类大小,用于自有实时数据的聚类任务。
☆65Sep 4, 2021Updated 4 years ago
Alternatives and similar repositories for SinglepassTextCluster
Users that are interested in SinglepassTextCluster are comparing it to the libraries listed below
Sorting:
- DescriptionPairsExtraction, entity and it's description pairs extract program based on Albert and data back-annotation. 基于Albert与结构化数据回标思…☆20Mar 7, 2022Updated 3 years ago
- schemakg, a knowledge graph for schema that seeks to cover a range of things as much as possible including entity schema and event schema…☆32Apr 27, 2021Updated 4 years ago
- self summary after attending CCL2018 (全国计算语言学学术会议), CCL2018参会总结,包括会议论文下载脚本,会议前言技术报告下载,以及个人的一点总结.☆27Oct 24, 2018Updated 7 years ago
- 文本聚类(Kmeans、DBSCAN、LDA、Single-pass)☆353May 12, 2021Updated 4 years ago
- self complemented WeiboIndexSpyder based on Selenium ,新浪微博指数(微指数)采集,包括综合指数,移动端指数,PC端指数☆31May 29, 2018Updated 7 years ago
- IdealWordCloudKit, A toolbox or kit for image-shape adjusted word cloud based on plain text, local file or web articles, 面向本地文件, 在线网页, 程序…☆41Jan 26, 2019Updated 7 years ago
- Chinese as a foreign language.☆14Nov 12, 2025Updated 3 months ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- future event predict demo based on causal event graph that covers the full industries that can predict the benefits or bad effects in acc…☆70Mar 29, 2019Updated 6 years ago
- 短文本聚类预处理模块 Short text cluster☆281Dec 28, 2019Updated 6 years ago
- ☆44Jun 8, 2022Updated 3 years ago
- self labeling conditional variational auto encoder☆19May 28, 2019Updated 6 years ago
- ☆18May 1, 2023Updated 2 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆20Dec 14, 2022Updated 3 years ago
- 机器学习训练简单模型判定一个句子是不是疑问句☆20May 9, 2022Updated 3 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆84May 20, 2022Updated 3 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Jan 11, 2021Updated 5 years ago
- rasa_chinese 的服务 package☆18Jun 17, 2021Updated 4 years ago
- NLP的数据增强Demo☆48Feb 28, 2020Updated 6 years ago
- Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。☆59Sep 7, 2023Updated 2 years ago
- 基于知识图谱的多轮问答机制☆25Sep 25, 2018Updated 7 years ago
- ☆220Dec 8, 2022Updated 3 years ago
- An exploration for Eventline (important news Rank organized by pulic time),针对某一事件话题下的新闻报道集合,通过使用docrank算法,对新闻报道进行重要性识别,并通过新闻报道时间挑选出时间线上重要…☆226Oct 7, 2018Updated 7 years ago
- 文本聚类、tfidf、lda、doc2vec+kmeans等各种方法实现☆23Jan 17, 2020Updated 6 years ago
- ☆57Dec 18, 2022Updated 3 years ago
- ☆59Feb 28, 2019Updated 7 years ago
- labelit, label tool with active learning, for classification task. 自动标注,基于主动学习,边标注边学习,减少人工标注量。☆31Dec 9, 2022Updated 3 years ago
- NER实体识别模型,快速高效简单一键部署docker部署调用模型。能识别:地址、人名、机构名实体。☆36Jul 26, 2023Updated 2 years ago
- 中文地址切分,及地址补全☆34Feb 15, 2019Updated 7 years ago
- Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021☆306Oct 23, 2023Updated 2 years ago
- a bert for retrieval and generation☆860Feb 26, 2021Updated 5 years ago
- Use tensorflow to achieve some NLP project, eg: classification chatbot ner attention QAetc.☆33Oct 15, 2020Updated 5 years ago
- Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021☆37Dec 21, 2021Updated 4 years ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Apr 25, 2018Updated 7 years ago
- self complement of baike knowledge base info-box extraction by online analysis.基于互动百科,百度百科,搜狗百科的词条infobox结构化信息抽取,百科知识的融合☆37Mar 30, 2018Updated 7 years ago
- Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"☆15Aug 26, 2024Updated last year
- Official Implementation for "Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation"☆14May 6, 2025Updated 9 months ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- basic algorithm☆11Nov 28, 2020Updated 5 years ago