monologg / ko_lm_dataformatLinks
A utility for storing and reading files for Korean LM training ๐พ
โ36Updated last year
Alternatives and similar repositories for ko_lm_dataformat
Users that are interested in ko_lm_dataformat are comparing it to the libraries listed below
Sorting:
- Korean Named Entity Corpusโ25Updated 2 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selectionโ35Updated 3 years ago
- ํ๊ตญ์ด ๋ฌธ์์ ๋ ธ์ด์ฆ๋ฅผ ์ถ๊ฐํฉ๋๋ค.โ27Updated 2 years ago
- ํ๊ตญ์ด T5 ๋ชจ๋ธโ54Updated 3 years ago
- โ20Updated 3 years ago
- kogpt๋ฅผ oslo๋ก ํ์ธํ๋ํ๋ ์์ .โ23Updated 2 years ago
- #Paired Questionโ24Updated 5 years ago
- ํ๊ตญ์ด ๋์๋ง ๊ต์ โ26Updated 2 years ago
- Character-level Korean ELECTRA Model (์์ ๋จ์ ํ๊ตญ์ด ELECTRA)โ54Updated 2 years ago
- Kobart model on Huggingface transformersโ64Updated 3 years ago
- โ32Updated last year
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)โ25Updated 3 years ago
- ํ๊ตญ์ด ์ดํ ์๋ฏธ ๋ถ์ ๋ชจ๋ธโ21Updated 3 years ago
- Korean Light Weight Language Modelโ30Updated 2 years ago
- ๋ฌธ์ฅ๋จ์๋ก ๋ถ์ ๋ ํ๊ตญ์ด ์ํคํผ๋์ ์ฝํผ์ค. Releases์์ ๋ค์ด๋ก๋ ๋ฐ๊ฑฐ๋ tfds-korean์ผ๋ก ์ฌ์ฉํด์ฃผ์ธ์.โ24Updated last year
- Bias, Hate classification with KoELECTRA ๐ฟโ27Updated 2 years ago
- Training Transformers of Huggingface with KoNLPyโ68Updated 4 years ago
- โ18Updated 3 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)โ119Updated 4 years ago
- [HCLT 2022] Korean sentence text similarity dataset using naver shopping reviewโ25Updated 2 years ago
- T5-base model for Koreanโ27Updated 4 years ago
- ๋๋ฌด์ํค๋คํ์์ ์ ์ ๋ ํ ์คํธ๋ฅผ ์ป๊ธฐ ์ํ NamuwikiExtractorโ18Updated 3 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasetsโ129Updated 2 years ago
- โ26Updated 4 years ago
- Finetuning Pipelineโ90Updated 3 years ago
- [Unofficial] Kakaotrans: Kakao translate API for pythonโ16Updated 5 years ago
- NSMC, KorSTS ... fine-tuningsโ19Updated 3 years ago
- Korean Math Word Problemsโ59Updated 3 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluationโ27Updated 2 years ago
- โ19Updated 2 years ago