Korean Parallel Corpus
☆147Feb 24, 2024Updated 2 years ago
Alternatives and similar repositories for korean-parallel-corpora
Users that are interested in korean-parallel-corpora are comparing it to the libraries listed below
Sorting:
- 2019 국어경진대회 한국어 의존구문 분석 대상(문체부 장관상)☆15Oct 26, 2022Updated 3 years ago
- ☆33Aug 30, 2023Updated 2 years ago
- #Paired Question☆24Jun 16, 2020Updated 5 years ago
- 한국어 개체명 정의 및 표지 표준화 기술보고서와 이를 기반으로 제작된 개체명 형태소 말뭉치☆94Jan 25, 2021Updated 5 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆119Oct 8, 2020Updated 5 years ago
- 문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.☆19Jun 16, 2021Updated 4 years ago
- 국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록☆165May 10, 2020Updated 5 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆310Jul 9, 2023Updated 2 years ago
- NLP Shared tasks (NER, SRL) using NSML☆183Jan 3, 2019Updated 7 years ago
- Parallel dataset of Korean Questions and Commands☆60Mar 24, 2023Updated 2 years ago
- Universal Dependency Treebanks in Korean☆38Dec 19, 2021Updated 4 years ago
- TEMP☆34Apr 2, 2020Updated 5 years ago
- Korean English NMT(Neural Machine Translation) with Gluon☆61Feb 28, 2018Updated 8 years ago
- Korean BART☆464Jun 14, 2025Updated 8 months ago
- 한국어 악성댓글 데이터셋☆73Sep 26, 2020Updated 5 years ago
- Korean Parallel Corpus☆11Nov 27, 2014Updated 11 years ago
- 🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋☆495Nov 7, 2022Updated 3 years ago
- Korean sejong corpus download and simple analysis☆147May 9, 2019Updated 6 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 4 months ago
- 📖 Korean NLU Benchmark☆587Jul 6, 2022Updated 3 years ago
- BERTScore for Korean☆80Feb 22, 2024Updated 2 years ago
- Korean HateSpeech Dataset☆394Jul 18, 2020Updated 5 years ago
- 네이버 뉴스 중 IT/과학 분야에서 50개를 선정해서 요약에 해당하는 문장을 태깅해둔 데이터셋입니다.☆39Nov 23, 2016Updated 9 years ago
- An integrated library for Korean language preprocessing.☆204Apr 23, 2023Updated 2 years ago
- ☆19Jan 17, 2021Updated 5 years ago
- KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed)☆212Aug 21, 2024Updated last year
- Implementing nlp papers relevant to classification with PyTorch, gluonnlp☆230Dec 8, 2022Updated 3 years ago
- 개인적으로 수집한 한국어 NLP용 말뭉치 모음☆139Sep 15, 2020Updated 5 years ago
- Naver sentiment movie corpus☆598Mar 7, 2017Updated 8 years ago
- KoParadigm: Korean Inflectional Paradigm Generator☆57Nov 23, 2022Updated 3 years ago
- Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)☆16Dec 23, 2016Updated 9 years ago
- Data from KAIST (a Korean treebank).☆19Nov 12, 2025Updated 3 months ago
- KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTorch☆212Apr 24, 2024Updated last year
- ☆92Mar 3, 2022Updated 4 years ago
- Korean corpus repository☆743Oct 3, 2022Updated 3 years ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆53Oct 25, 2020Updated 5 years ago
- Split Korean text into sentences using heuristic algorithm.☆215Dec 24, 2020Updated 5 years ago
- 11.5기의 beyondBERT의 토론 내용을 정리하는 repository입니다.☆57Jul 2, 2020Updated 5 years ago
- Sentence Embeddings using Siamese ETRI KoBERT☆163Aug 16, 2025Updated 6 months ago