eubinecto / idiomatchLinks
An implementation of SpaCy(3.0)'s Matcher specifically designed for identifying English idioms.
☆46Updated 3 months ago
Alternatives and similar repositories for idiomatch
Users that are interested in idiomatch are comparing it to the libraries listed below
Sorting:
- KoParadigm: Korean Inflectional Paradigm Generator☆56Updated 2 years ago
- Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)☆27Updated 5 years ago
- Real-time automatic word segmentation (for user-generated texts)☆21Updated 2 years ago
- ☆9Updated 7 years ago
- Korean morphological analyzer☆26Updated 5 years ago
- TEMP☆34Updated 5 years ago
- Universal Dependency Treebanks in Korean☆37Updated 3 years ago
- Data from KAIST (a Korean treebank).☆19Updated 3 weeks ago
- 한국어 용언 분석기 (원형 복원, 용언 형태소 분석)☆42Updated 5 years ago
- Similar string search in Levenshtein distance☆21Updated 4 years ago
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Updated last year
- Structured argument extraction for Korean☆22Updated 3 years ago
- ☆14Updated 4 years ago
- Implementation Google Meena for open domain conversation.☆29Updated 3 years ago
- Korean Parallel Corpus☆11Updated 10 years ago
- 🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed☆31Updated 3 years ago
- Parallel dataset of Korean Questions and Commands☆61Updated 2 years ago
- Machine Generated Captions for Best Artworks☆22Updated 2 years ago
- Expanded KR-BERT by adding more training data☆12Updated 4 years ago
- Python wrapper for KoalaNLP (Korean NLP with Java/Scala)☆31Updated 2 weeks ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆118Updated 4 years ago
- MeCab model trained with OpenKorPos.☆23Updated 3 years ago
- Jejueo Datasets for Machine Translation and Speech Synthesis☆79Updated 5 years ago
- Multi-lingual & multi-domain (specialisation for biomedical data) translation model☆40Updated 4 years ago
- Easy installer of kocohub dataset☆24Updated 5 years ago
- 🐍 pymecab-ko. you can find original version here: https://bitbucket.org/eunjeon/mecab-ko, https://github.com/SamuraiT/mecab-python3☆17Updated 10 months ago
- Tokenizer 비교 실험☆11Updated 3 years ago
- 청와대 국민청원 데이터 아카이브☆15Updated 4 years ago
- Dataset of Korean Threatening Conversations☆74Updated 2 years ago
- Subword-level Word Vector Representations for Korean (ACL 2018)☆107Updated 5 years ago