📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
☆756Dec 21, 2024Updated last year
Alternatives and similar repositories for cn2an
Users that are interested in cn2an are comparing it to the libraries listed below
Sorting:
- 最好的汉字数字(中文数字)-阿拉伯数字转换工具。包含"点二八","负百分之四十"等众多汉语表达方法。NLP,机器人工程必备! The Best Tool of Chinese Number to Digits☆371Mar 26, 2023Updated 2 years ago
- Time-NLP的python3版本 中文时间表达词转换☆520Dec 8, 2022Updated 3 years ago
- Text Normalization & Inverse Text Normalization☆727Updated this week
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,385Jan 12, 2026Updated last month
- Chinese text normalization for speech processing☆721Mar 18, 2023Updated 2 years ago
- 🔥 专注于中文的「自然语言处理框架」:中文分词;平衡类别;数据集划分...☆12Nov 14, 2020Updated 5 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆361Dec 24, 2021Updated 4 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,879Mar 18, 2025Updated 11 months ago
- 汉字转拼音(pypinyin)☆5,263Feb 15, 2026Updated 2 weeks ago
- Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)☆380Jun 21, 2025Updated 8 months ago
- 中文近义词:聊天机器人,智能问答工具包☆5,104Feb 1, 2026Updated last month
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆243Jul 10, 2019Updated 6 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- 基于“音形码”的中文字符串相似度计算方法☆227Jul 24, 2020Updated 5 years ago
- 速度更快、效果更好的中文新词发现☆513Mar 15, 2024Updated last year
- 中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com☆3,800Nov 27, 2025Updated 3 months ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,177Jul 15, 2025Updated 7 months ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,862Feb 6, 2026Updated 3 weeks ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,984Nov 21, 2022Updated 3 years ago
- self complement of Sentence Similarity compute based on cilin, hownet, simhash, wordvector,vsm models,基于同义词词林,知网,指纹,字词向量,向量空间模型的句子相似度计算。☆365Dec 15, 2018Updated 7 years ago
- 汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese charac…☆299Dec 29, 2025Updated 2 months ago
- 百度开源的依存句法分析系统☆1,003Feb 5, 2023Updated 3 years ago
- python3实现互信息和左右熵的新词发现☆593Aug 1, 2019Updated 6 years ago
- Use C Api and Swig to Speed up jieba 高效的中文分词库☆640Aug 27, 2021Updated 4 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,232Feb 6, 2026Updated 3 weeks ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- 复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!☆2,796Aug 30, 2025Updated 6 months ago
- ChineseSemanticKB,chinese semantic knowledge base, 面向中文处理的12类、百万规模的语义常用词典,包括34万抽象语义库、34万反义语义库、43万同义语义库等,可支持句子扩展、转写、事件抽象与泛化等多种应用场景。☆779Mar 17, 2023Updated 2 years ago
- Datasets, SOTA results of every fields of Chinese NLP☆1,812Apr 7, 2022Updated 3 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆410Apr 8, 2020Updated 5 years ago
- 基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口☆1,294Jun 13, 2021Updated 4 years ago
- chinese speech pretrained models☆1,191Aug 23, 2024Updated last year
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆416Nov 20, 2025Updated 3 months ago
- A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset☆711Jun 17, 2024Updated last year
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,575Nov 21, 2023Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…☆433May 17, 2020Updated 5 years ago