速度更快、效果更好的中文新词发现
☆513Mar 15, 2024Updated 2 years ago
Alternatives and similar repositories for word-discovery
Users that are interested in word-discovery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- python3实现互信息和左右熵的新词发现☆592Aug 1, 2019Updated 6 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆122Mar 14, 2020Updated 6 years ago
- 开天-新词,中文新词发现工具,Chinese New Word Discovery Tool☆22Dec 5, 2019Updated 6 years ago
- 自动构建中文词库:http://www.matrix67.com/blog/archives/5044☆656Dec 5, 2023Updated 2 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,983Nov 21, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference☆621Feb 3, 2021Updated 5 years ago
- a bert for retrieval and generation☆859Feb 26, 2021Updated 5 years ago
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- 2019年百度的三元组抽取比赛,“科学空间队”源码☆769May 16, 2020Updated 5 years ago
- 新词发现算法(NewWordDetection)☆63Sep 4, 2017Updated 8 years ago
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,405Jan 12, 2026Updated 2 months ago
- 2019-SOTA简繁中文拼写检查工具:FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)☆1,225Sep 3, 2022Updated 3 years ago
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆500Sep 3, 2020Updated 5 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,105May 9, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 新词发现算法(NewWordDetection)☆92Mar 22, 2021Updated 5 years ago
- 基于最小熵原理的NLP工具包☆139Jan 14, 2022Updated 4 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,775Jul 22, 2024Updated last year
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,872Feb 6, 2026Updated last month
- keras implement of transformers for humans☆5,424Nov 11, 2024Updated last year
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,157Jan 22, 2024Updated 2 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,179Jul 15, 2025Updated 8 months ago
- 自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of N…☆663Mar 24, 2023Updated 3 years ago
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- 新词发现☆66May 30, 2014Updated 11 years ago
- Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…☆432May 17, 2020Updated 5 years ago
- Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类☆3,420May 7, 2022Updated 3 years ago
- Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services☆4,903Feb 24, 2021Updated 5 years ago
- python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等☆170Sep 27, 2019Updated 6 years ago
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,786Feb 18, 2023Updated 3 years ago
- Automated Phrase Mining from Massive Text Corpora in Python.☆174May 23, 2021Updated 4 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,238Feb 6, 2026Updated last month
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,603May 13, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- a beautiful method for cluster or community detection☆52Oct 19, 2019Updated 6 years ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,648Jul 15, 2025Updated 8 months ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,440Jul 15, 2025Updated 8 months ago
- ccks baidu entity link 实体链接 第一名☆842Dec 19, 2023Updated 2 years ago
- 专业领域词库构建/中文新词发现/专业词库发现☆31Jan 10, 2020Updated 6 years ago
- Open Chinese Language Pre-trained Model Zoo☆984Mar 18, 2020Updated 6 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,880Mar 18, 2025Updated last year