KoichiYasuoka / spaCy-Thai
Dependency parser on Thai language
☆26Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for spaCy-Thai
- Parallel Universal Dependencies.☆14Updated 6 months ago
- Java library to tokenize Thai text into a list of TCCs☆18Updated 7 years ago
- Thai word segmentation using deep learning☆13Updated 5 years ago
- computer tools for thai language☆21Updated 7 years ago
- Reinforcement Learning Agents to Play The Board Game Santorini☆8Updated 5 years ago
- Tic Tac Toe using Minimax and Reinforcement Learning☆29Updated 8 years ago
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models for Japanese and other languages☆48Updated last month
- Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.☆85Updated 2 years ago
- Code for paper "Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph", EMNLP 2021 - findings.☆13Updated 2 years ago
- python package for unsupervised text segmentation.☆14Updated 8 years ago
- 🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.☆16Updated 3 years ago
- A small version of UniDic for easy pip installs.☆38Updated 4 years ago
- Japanese data from the Google UDT 2.0.☆36Updated 2 weeks ago
- Japanese BERT trained on Aozora Bunko and Wikipedia, pre-tokenized by MeCab with UniDic & SudachiPy☆40Updated 4 years ago
- AMI Meeting Parallel Corpus☆9Updated 3 years ago
- Trials of pre-trained BERT models for the medical domain in Japanese.☆12Updated 3 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆39Updated 5 years ago
- Tutorial on AllenNLP library with demo "which journal to submit paper?"☆32Updated 6 years ago
- ☆21Updated 2 weeks ago
- A library to compose and decompose Hangul syllables using Hangul jamo characters☆28Updated 2 years ago
- The Business Scene Dialogue corpus☆68Updated 3 years ago
- Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation☆31Updated last year
- A repo for shared notebooks☆24Updated last year
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- Japanese data from the Google UDT 2.0.☆28Updated last year
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆20Updated 2 years ago
- An implementation of BERT using PyTorch's TransformerEncoder☆33Updated 4 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Updated 2 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆34Updated 2 years ago