bojone / word-discoveryView external linksLinks
速度更快、效果更好的中文新词发现
☆513Mar 15, 2024Updated last year
Alternatives and similar repositories for word-discovery
Users that are interested in word-discovery are comparing it to the libraries listed below
Sorting:
- python3实现互信息和左右熵的新词发现☆593Aug 1, 2019Updated 6 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆122Mar 14, 2020Updated 5 years ago
- 自动构建中文词库:http://www.matrix67.com/blog/archives/5044☆655Dec 5, 2023Updated 2 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,986Nov 21, 2022Updated 3 years ago
- 开天-新词,中文新词发现工具,Chinese New Word Discovery Tool☆22Dec 5, 2019Updated 6 years ago
- 2019年百度的三元组抽取比赛,“科学空间队”源码☆768May 16, 2020Updated 5 years ago
- 专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference☆622Feb 3, 2021Updated 5 years ago
- Open Language Pre-trained Model Zoo☆1,004Nov 18, 2021Updated 4 years ago
- a bert for retrieval and generation☆860Feb 26, 2021Updated 4 years ago
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆500Sep 3, 2020Updated 5 years ago
- 2019-SOTA简繁中文拼写检查工具:FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)☆1,224Sep 3, 2022Updated 3 years ago
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,368Jan 12, 2026Updated last month
- 新词发现算法(NewWordDetection)☆92Mar 22, 2021Updated 4 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- 基于 最小熵原理的NLP工具包☆139Jan 14, 2022Updated 4 years ago
- 新词发现算法(NewWordDetection)☆63Sep 4, 2017Updated 8 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- keras implement of transformers for humans☆5,420Nov 11, 2024Updated last year
- Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…☆433May 17, 2020Updated 5 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,858Feb 6, 2026Updated last week
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,155Jan 22, 2024Updated 2 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,173Jul 15, 2025Updated 7 months ago
- Open Chinese Language Pre-trained Model Zoo☆984Mar 18, 2020Updated 5 years ago
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,786Feb 18, 2023Updated 2 years ago
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,598May 13, 2024Updated last year
- Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识 别 情感分析 新词发现 关键词 文本摘要 文本聚类☆3,422May 7, 2022Updated 3 years ago
- 自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of N…☆663Mar 24, 2023Updated 2 years ago
- ccks baidu entity link 实体链接 第一名☆843Dec 19, 2023Updated 2 years ago
- An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具,仅占35M内存 www.jionlp.com☆556Nov 21, 2023Updated 2 years ago
- Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services☆4,899Feb 24, 2021Updated 4 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,230Feb 6, 2026Updated last week
- a beautiful method for cluster or community detection☆52Oct 19, 2019Updated 6 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,439Jul 15, 2025Updated 7 months ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,649Jul 15, 2025Updated 7 months ago
- keras implement of dgcnn for reading comprehension☆164Oct 14, 2019Updated 6 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,879Mar 18, 2025Updated 10 months ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,576Nov 21, 2023Updated 2 years ago
- ☆15Mar 19, 2017Updated 8 years ago