python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等
☆172Sep 27, 2019Updated 6 years ago
Alternatives and similar repositories for py-kenlm-model
Users that are interested in py-kenlm-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- kenlm语言模型,并提供python的rest服务☆30Aug 1, 2018Updated 7 years ago
- KenLM: Faster and Smaller Language Model Queries☆2,755Mar 30, 2025Updated last year
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- 根据维基百科历史编辑数据提取纠错语料。☆12Apr 6, 2022Updated 4 years ago
- 速度更快、效果更好的中文新词发现☆512Mar 15, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,427Jan 12, 2026Updated 3 months ago
- SpellGCN☆251Feb 28, 2021Updated 5 years ago
- 基于NER的文本纠错☆15Dec 27, 2023Updated 2 years ago
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆48Jun 27, 2018Updated 7 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆378Jul 21, 2022Updated 3 years ago
- 关键词抽取技术☆18Sep 11, 2019Updated 6 years ago
- 2019-SOTA简繁中文拼写检查工具:FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)☆1,224Sep 3, 2022Updated 3 years ago
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Jun 12, 2023Updated 2 years ago
- Code for paper "Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph", EMNLP 2021 - findings.☆13Dec 14, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"☆20Nov 16, 2022Updated 3 years ago
- 简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型☆272Oct 24, 2020Updated 5 years ago
- 2018 高校算法大赛神策杯第五名解决方案☆18Oct 22, 2018Updated 7 years ago
- Mirror of SRILM☆59Aug 11, 2020Updated 5 years ago
- python3实现互信息和左右熵的新词发现☆592Aug 1, 2019Updated 6 years ago
- machine translation data process tools☆10Apr 29, 2024Updated last year
- Gradient accumulation on tf.estimator☆12Dec 15, 2020Updated 5 years ago
- spark,NLP,新词发现,自然语言处理☆23Mar 16, 2018Updated 8 years ago
- 李傲龍的博客☆82Jul 17, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- 基于规则的文本纠错系统。☆121Jul 14, 2021Updated 4 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- 📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)☆758Dec 21, 2024Updated last year
- 闲聊机器人☆11Aug 12, 2020Updated 5 years ago
- 基于bert进行中文文本纠错☆242Jun 12, 2023Updated 2 years ago
- TextRank的简单实现☆10Nov 12, 2020Updated 5 years ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆566Jun 9, 2023Updated 2 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆116May 20, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for chinese error detection module, using n-gram and bi-lstm☆135Mar 31, 2019Updated 7 years ago
- 用python比较两个字符串差异,高亮差异部分☆27Jul 20, 2020Updated 5 years ago
- PromptCLUE, 全中文任务支持零样本学习模型☆665Jun 16, 2023Updated 2 years ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆16Jul 6, 2023Updated 2 years ago
- gensim-fast2vec改造、灵活使用大规模外部词向量(具备OOV查询能力)☆23Jun 3, 2019Updated 6 years ago
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆124Oct 8, 2019Updated 6 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,157Jan 22, 2024Updated 2 years ago