larsyencken / simsearch
Search-by-similarity for Japanese kanji
☆10Updated 7 years ago
Alternatives and similar repositories for simsearch:
Users that are interested in simsearch are comparing it to the libraries listed below
- Top 5000 Japanese family names, with readings, ordered by frequency.☆16Updated 7 years ago
- Rakuten MA (Python version)☆22Updated 7 years ago
- JMdict Japanese dictionary - lexicographic, etc. issues management☆19Updated 4 years ago
- Japanese data from the Google UDT 2.0.☆37Updated 3 months ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- Japanese Natural Langauge Processing Libraries☆148Updated 4 years ago
- A machine-readable human-generated file for finding kanji similar to a given kanji and tools to help expand and prune it.☆14Updated 6 years ago
- The Kyoto Text Analysis Toolkit for word segmentation and pronunciation estimation, etc.☆205Updated 4 years ago
- Bilingual sengence aligner☆27Updated last year
- Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.☆86Updated 3 years ago
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese☆34Updated 2 months ago
- Yet Another Japanese Dependency Structure Analyzer☆111Updated 4 years ago
- SDK for TEASPN, a framework and a protocol for integrated writing assistance environments☆61Updated 2 years ago
- natto-py combines the Python programming language with MeCab, the part-of-speech and morphological analyzer for the Japanese language.☆93Updated 8 months ago
- aim to use JapaneseTokenizer as easy as possible☆138Updated 5 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Small example scripts for working with Japanese texts in Python☆26Updated 5 years ago
- ☆97Updated 3 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆61Updated 9 months ago
- English HPSG parser☆51Updated 6 years ago
- CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis☆108Updated 2 months ago
- The Community-enRiched Open WordNet (CROWN)☆19Updated 9 years ago
- A sample implementation of the TEASPN server☆19Updated 5 years ago
- The Language Learning Toolkit (LLTK) performs a variety of tasks useful for (human) language learning.☆41Updated 5 years ago
- Anki2 Add-On to look-up the pronunciation of Japanese expressions.☆71Updated 4 years ago
- Annodoc annotation documentation support system☆34Updated 4 years ago
- linguistics backend☆41Updated last year
- A python parser of the JMdict file.☆11Updated 7 years ago
- A paraphrase database for Japanese text simplification☆32Updated 7 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago