Python module that identifies Chinese text as being Simplified or Traditional
☆105Nov 20, 2024Updated last year
Alternatives and similar repositories for hanzidentifier
Users that are interested in hanzidentifier are comparing it to the libraries listed below
Sorting:
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated 2 weeks ago
- A Memento Client Library in Python☆27Mar 5, 2018Updated 8 years ago
- A variational autoencoder for text processing using 1D convolutions and the FastText word embeddings☆12Dec 11, 2022Updated 3 years ago
- Find and Meet KOLs on Facebook Taiwan president election Analisis by SNA and NLP☆12Dec 29, 2020Updated 5 years ago
- ☆11May 26, 2021Updated 4 years ago
- ☆35Dec 17, 2020Updated 5 years ago
- Translation of query languages to serialized KoralQuery protocol☆14Mar 9, 2026Updated last week
- docker image to build node.js based projects and to able to run headless chrome☆11Sep 2, 2019Updated 6 years ago
- ☆14Mar 9, 2025Updated last year
- extension for fabric to handle prompts through pexpect☆44May 31, 2015Updated 10 years ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- chinese word segmentation based on rnn☆13Oct 14, 2016Updated 9 years ago
- Emacs major mode for editing Minecraft mcfunction.☆20Apr 12, 2023Updated 2 years ago
- Aff wrappers for purescript-node-fs☆21Aug 4, 2023Updated 2 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- Load subtitles into Netflix☆12Mar 6, 2021Updated 5 years ago
- ChatGPT with access to the internet☆26Jun 16, 2023Updated 2 years ago
- extractcontent.rb の python 版☆24Apr 10, 2017Updated 8 years ago
- Automatic Idiomatic Expression Detection☆13Sep 26, 2021Updated 4 years ago
- ☆17Updated this week
- code and data used to build a training dataset for dragnet models☆10Nov 29, 2020Updated 5 years ago
- Fast Word Clustering Software☆79Feb 8, 2025Updated last year
- Parses Polish wiktionary and creates simple dictionaries of foreign languages (e.g. English) to Polish and vice versa.☆16Jul 22, 2013Updated 12 years ago
- ☆13Jun 20, 2018Updated 7 years ago
- React/Material-UI Audio and Video Components☆16Mar 13, 2026Updated last week
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 5 months ago
- Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816☆46May 20, 2021Updated 4 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Scalable, real-time analytics for Open edX☆12Jan 16, 2026Updated 2 months ago
- A Python wrapper for the bioRxiv API.☆10Aug 18, 2021Updated 4 years ago
- Lightweight piece tokenization library☆12Apr 15, 2024Updated last year
- ☆18Jul 7, 2025Updated 8 months ago
- Kotlin Multiplatform app for creating and playing simple musical ideas☆27Jan 9, 2025Updated last year
- 青空文庫のテキストファイル☆14Feb 4, 2024Updated 2 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- Difference English sentences via Liechtenstein distance, calculate word error rate, and list out word by word differences☆10Apr 21, 2020Updated 5 years ago
- My Emacs Config☆14Mar 4, 2026Updated 2 weeks ago
- A test website created using Django Python for a university project.☆10Jan 3, 2023Updated 3 years ago
- JuliaCN 2022 archived demo repo: How Julia beats MATLAB's C codes by 1000x☆10May 25, 2023Updated 2 years ago