Kyubyong / name2nat
name2nat: a Python package for nationality prediction from a name
☆105Updated 4 years ago
Alternatives and similar repositories for name2nat:
Users that are interested in name2nat are comparing it to the libraries listed below
- KoParadigm: Korean Inflectional Paradigm Generator☆56Updated 2 years ago
- Package for controllable summarization☆78Updated 2 years ago
- Real-time automatic word segmentation (for user-generated texts)☆21Updated last year
- Multi-lingual & multi-domain (specialisation for biomedical data) translation model☆40Updated 4 years ago
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- HateEval 2019 - Task 5☆15Updated 5 years ago
- Subword Language Model for Query Auto-Completion☆66Updated 5 years ago
- TEMP☆34Updated 4 years ago
- MeCab model trained with OpenKorPos.☆22Updated 2 years ago
- Similar string search in Levenshtein distance☆22Updated 3 years ago
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)☆28Updated 3 years ago
- Data from KAIST (a Korean treebank).☆19Updated 2 months ago
- Universal Dependency Treebanks in Korean☆37Updated 3 years ago
- Preprocessing Library for Natural Language Processing☆161Updated 2 years ago
- ☆64Updated last year
- Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)☆27Updated 5 years ago
- Pre-trained Machine Translation Models of Korean from/to ECJ☆29Updated 5 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data☆57Updated 3 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆108Updated last year
- 🐍 pymecab-ko. you can find original version here: https://bitbucket.org/eunjeon/mecab-ko, https://github.com/SamuraiT/mecab-python3☆16Updated 5 months ago
- Semantic Search using FAISS & ElasticSearch☆31Updated 4 years ago
- reference pytorch code for intent classification☆44Updated 3 months ago
- A python module for word inflections designed for use with spaCy.☆92Updated 4 years ago
- Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)☆16Updated 8 years ago
- Korean version of GoEmotions Dataset 😍😢😱☆54Updated last year
- A tutorial of pertaining Bert on your own dataset using google TPU☆44Updated 4 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆69Updated last year
- Data collection, alignment and TAUS repository☆20Updated 7 years ago