eubinecto / idiomatchLinks

An implementation of SpaCy(3.0)'s Matcher specifically designed for identifying English idioms.

☆46

Alternatives and similar repositories for idiomatch

Users that are interested in idiomatch are comparing it to the libraries listed below

Sorting:

Kyubyong / KoParadigm
KoParadigm: Korean Inflectional Paradigm Generator
☆56Updated 2 years ago
cynthia / kosentences
Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)
☆27Updated 5 years ago
warnikchow / raws
Real-time automatic word segmentation (for user-generated texts)
☆21Updated 2 years ago
karlstratos / koreannet
☆9Updated 7 years ago
songhyunje / kma
Korean morphological analyzer
☆26Updated 5 years ago
MrBananaHuman / KoGPT2ForParaphrasing
TEMP
☆34Updated 5 years ago
emorynlp / ud-korean
Universal Dependency Treebanks in Korean
☆37Updated 3 years ago
UniversalDependencies / UD_Korean-Kaist
Data from KAIST (a Korean treebank).
☆19Updated 3 weeks ago
lovit / korean_lemmatizer
한국어 용언 분석기 (원형 복원, 용언 형태소 분석)
☆42Updated 5 years ago
lovit / levenshtein_finder
Similar string search in Levenshtein distance
☆21Updated 4 years ago
jeongukjae / korean-wikipedia-corpus
문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.
☆24Updated last year
warnikchow / sae4k
Structured argument extraction for Korean
☆22Updated 3 years ago
machinereading / koreanframenet
☆14Updated 4 years ago
nawnoes / pytorch-meena
Implementation Google Meena for open domain conversation.
☆29Updated 3 years ago
j-min / korean-parallel-corpora
Korean Parallel Corpus
☆11Updated 10 years ago
tunib-ai / transformers
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
☆31Updated 3 years ago
warnikchow / paraKQC
Parallel dataset of Korean Questions and Commands
☆61Updated 2 years ago
tunib-ai / artwork_captions
Machine Generated Captions for Best Artworks
☆22Updated 2 years ago
snunlp / KR-BERT-MEDIUM
Expanded KR-BERT by adding more training data
☆12Updated 4 years ago
koalanlp / python-support
Python wrapper for KoalaNLP (Korean NLP with Java/Scala)
☆31Updated 2 weeks ago
kakaobrain / kortok
The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)
☆118Updated 4 years ago
openkorpos / model-mecab
MeCab model trained with OpenKorPos.
☆23Updated 3 years ago
kakaobrain / jejueo
Jejueo Datasets for Machine Translation and Speech Synthesis
☆79Updated 5 years ago
naver / covid19-nmt
Multi-lingual & multi-domain (specialisation for biomedical data) translation model
☆40Updated 4 years ago
inmoonlight / koco
Easy installer of kocohub dataset
☆24Updated 5 years ago
NoUnique / pymecab-ko
🐍 pymecab-ko. you can find original version here: https://bitbucket.org/eunjeon/mecab-ko, https://github.com/SamuraiT/mecab-python3
☆17Updated 10 months ago
Sunkyoung / Compare-tokenizer
Tokenizer 비교 실험
☆11Updated 3 years ago
lovit / petitions_archive
청와대 국민청원 데이터 아카이브
☆15Updated 4 years ago
tunib-ai / DKTC
Dataset of Korean Threatening Conversations
☆74Updated 2 years ago
SungjoonPark / KoreanWordVectors
Subword-level Word Vector Representations for Korean (ACL 2018)
☆107Updated 5 years ago