tsroten/hanzidentifier

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tsroten/hanzidentifier)

tsroten / hanzidentifier

Python module that identifies Chinese text as being Simplified or Traditional

☆105

Alternatives and similar repositories for hanzidentifier

Users that are interested in hanzidentifier are comparing it to the libraries listed below

Sorting:

lm-pub-quiz / lm-pub-quiz
View on GitHub
Evaluate language models using multiple choice items
☆13Mar 6, 2026Updated 2 weeks ago
mementoweb / py-memento-client
View on GitHub
A Memento Client Library in Python
☆27Mar 5, 2018Updated 8 years ago
nsu-ai-team / conv1d-text-vae
View on GitHub
A variational autoencoder for text processing using 1D convolutions and the FastText word embeddings
☆12Dec 11, 2022Updated 3 years ago
tlyu0419 / FindAndMeetKOLs
View on GitHub
Find and Meet KOLs on Facebook Taiwan president election Analisis by SNA and NLP
☆12Dec 29, 2020Updated 5 years ago
erlcssont29i / Expanded-knowledge-for-data-analysis
View on GitHub
☆11May 26, 2021Updated 4 years ago
Katsumata420 / wikihow_japanese
View on GitHub
☆35Dec 17, 2020Updated 5 years ago
KorAP / Koral
View on GitHub
Translation of query languages to serialized KoralQuery protocol
☆14Mar 9, 2026Updated last week
codeclou-archive / docker-nodejs-chrome-xvfb
View on GitHub
docker image to build node.js based projects and to able to run headless chrome
☆11Sep 2, 2019Updated 6 years ago
Makisuo / pglite-drizzle
View on GitHub
☆14Mar 9, 2025Updated last year
JasperVanDenBosch / fexpect
View on GitHub
extension for fabric to handle prompts through pexpect
☆44May 31, 2015Updated 10 years ago
lfoppiano / material-parsers
View on GitHub
Material parsers and other tools, scripts Initially developed for Grobid Superconductor
☆13Feb 21, 2025Updated last year
clayandgithub / rnn_cws
View on GitHub
chinese word segmentation based on rnn
☆13Oct 14, 2016Updated 9 years ago
rasensuihei / mcf
View on GitHub
Emacs major mode for editing Minecraft mcfunction.
☆20Apr 12, 2023Updated 2 years ago
purescript-deprecated / purescript-node-fs-aff
View on GitHub
Aff wrappers for purescript-node-fs
☆21Aug 4, 2023Updated 2 years ago
KorAP / Tokenizer-Evaluation
View on GitHub
Benchmark scripts for comparing different tokenizers and sentence segmenters of German
☆12Feb 27, 2023Updated 3 years ago
AOEChamp / NetflixSubLoader
View on GitHub
Load subtitles into Netflix
☆12Mar 6, 2021Updated 5 years ago
hizkifw / bong
View on GitHub
ChatGPT with access to the internet
☆26Jun 16, 2023Updated 2 years ago
yono / python-extractcontent
View on GitHub
extractcontent.rb の python 版
☆24Apr 10, 2017Updated 8 years ago
zzeng13 / DISC
View on GitHub
Automatic Idiomatic Expression Detection
☆13Sep 26, 2021Updated 4 years ago
pingtype / pingtype.github.io
View on GitHub
☆17Updated this week
dragnet-org / dragnet_data
View on GitHub
code and data used to build a training dataset for dragnet models
☆10Nov 29, 2020Updated 5 years ago
jonsafari / clustercat
View on GitHub
Fast Word Clustering Software
☆79Feb 8, 2025Updated last year
djstrong / PL-Wiktionary-To-Dictionary
View on GitHub
Parses Polish wiktionary and creates simple dictionaries of foreign languages (e.g. English) to Polish and vice versa.
☆16Jul 22, 2013Updated 12 years ago
linzino7 / matplotlibChinesefix
View on GitHub
☆13Jun 20, 2018Updated 7 years ago
alelapi / material-ui-player
View on GitHub
React/Material-UI Audio and Video Components
☆16Mar 13, 2026Updated last week
originell / smaz-py3
View on GitHub
Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+
☆13Oct 18, 2025Updated 5 months ago
voidism / pywordseg
View on GitHub
Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816
☆46May 20, 2021Updated 4 years ago
lenakmeth / Wikinflection-Corpus
View on GitHub
The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…
☆12Dec 15, 2023Updated 2 years ago
overhangio / tutor-cairn
View on GitHub
Scalable, real-time analytics for Open edX
☆12Jan 16, 2026Updated 2 months ago
jacquerie / biorxiv-cli
View on GitHub
A Python wrapper for the bioRxiv API.
☆10Aug 18, 2021Updated 4 years ago
explosion / curated-tokenizers
View on GitHub
Lightweight piece tokenization library
☆12Apr 15, 2024Updated last year
Update-For-Integrated-Business-AI / CORU
View on GitHub
☆18Jul 7, 2025Updated 8 months ago
lemcoder / TinyComposer
View on GitHub
Kotlin Multiplatform app for creating and playing simple musical ideas
☆27Jan 9, 2025Updated last year
levelevel / AozoraTxt
View on GitHub
青空文庫のテキストファイル
☆14Feb 4, 2024Updated 2 years ago
rsennrich / SMORLemma
View on GitHub
SMOR (Stuttgart Morphology) with alternative lemmatization component
☆13Aug 10, 2023Updated 2 years ago
utunga / sentence_diff
View on GitHub
Difference English sentences via Liechtenstein distance, calculate word error rate, and list out word by word differences
☆10Apr 21, 2020Updated 5 years ago
zhenhua-wang / emacs.d
View on GitHub
My Emacs Config
☆14Mar 4, 2026Updated 2 weeks ago
hsensh / scalp-beauty-salon-website
View on GitHub
A test website created using Django Python for a university project.
☆10Jan 3, 2023Updated 3 years ago
Suzhou-Tongyuan / GaloisFieldNumbers.jl
View on GitHub
JuliaCN 2022 archived demo repo: How Julia beats MATLAB's C codes by 1000x
☆10May 25, 2023Updated 2 years ago