zachary822 / chinese-converter
Converts between traditional and simplified Chinese
☆30Updated 5 months ago
Alternatives and similar repositories for chinese-converter:
Users that are interested in chinese-converter are comparing it to the libraries listed below
- Python module that identifies Chinese text as being Simplified or Traditional☆89Updated 3 months ago
- ☆33Updated 8 months ago
- ☆44Updated 2 years ago
- Identification and conversion functions for Chinese text processing☆59Updated 3 months ago
- 渊 - A project for Classical Chinese☆97Updated 2 years ago
- ☆28Updated 3 months ago
- Multilingual sentence alignment using sentence embeddings☆108Updated 3 months ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆52Updated 11 months ago
- repo for Tibetan corpora☆21Updated last year
- Convert Chinese text to Pinyin or Jyutping☆26Updated last year
- 中文繁体和简体字符对照表☆40Updated last month
- super fast cpp implementation of longest common subsequence/substring☆66Updated last year
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆36Updated last year
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆57Updated 5 months ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆51Updated last month
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆65Updated 3 months ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆33Updated 8 months ago
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆47Updated last year
- Dictionary for Cantonese word segmentation☆34Updated 8 months ago
- A python wrapper for Stanford CoreNLP, simple and customizable.☆13Updated 3 years ago
- A CWN Python binding with graph structure☆27Updated last year
- A sentence segmentation library with wide language support optimized for speed and utility.☆58Updated 5 months ago
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆18Updated 3 years ago
- This packages up data for the Open Multilingual Wordnet☆45Updated last week
- 古文现代文翻译平行语料库☆100Updated 3 years ago
- 一个面向繁体中文古籍分词的python工具包☆32Updated 3 years ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆39Updated 2 years ago
- Package for inference for punctuation, true-casing, and sentence boundary detection☆24Updated 8 months ago
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆18Updated last year