Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference
☆45Nov 28, 2022Updated 3 years ago
Alternatives and similar repositories for SDR
Users that are interested in SDR are comparing it to the libraries listed below
Sorting:
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- ↔️ Utilizing RBERT model structure for KLUE Relation Extraction task☆15Nov 15, 2022Updated 3 years ago
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 3 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- 이동호, 이정훈, 김유리, 김형준, 박승면, 양유준, 신웅비 (Dong Ho Lee, Jung Hoon Lee, Yu Ri Kim, Hyung Jun Kim, Seung Myun Park, Yu Jun Yang, Woong Bi Shin)☆15Apr 16, 2020Updated 5 years ago
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆17Oct 5, 2021Updated 4 years ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- ☆19Jul 6, 2023Updated 2 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 4 months ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- [HCLT 2022] Korean sentence text similarity dataset using naver shopping review☆25Oct 20, 2022Updated 3 years ago
- ☆12Dec 9, 2025Updated 2 months ago
- A Pytorch-Lightning Implementation of Transformer Network☆11Oct 22, 2020Updated 5 years ago
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Jun 7, 2022Updated 3 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 3 years ago
- "자연어처리 알고리즘을 활용한 느린학습자 교육 컨텐츠 제작" 프로젝트 "애움길" 팀입니다. 데이터 수집(크롤링)/EDA/Preprocessing, 쉬운말 생성요약 AI 모델링(NLP - KoBERT, KoBART), 프로토타입 제작을 진행했습니다…☆13Mar 24, 2022Updated 3 years ago
- Code and data for "Heterogeneous Supervised Topic Models"☆11Jun 27, 2022Updated 3 years ago
- ☆14Dec 9, 2021Updated 4 years ago
- 2022년 대한민국 20대 대통령 선거 읍면동 개표 결과☆11Mar 11, 2022Updated 3 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- Code and Hummingbird dataset for EMNLP 2021 paper "Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica"☆15Apr 13, 2022Updated 3 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- ELECTRA MODEL NLP☆13Apr 8, 2020Updated 5 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- Redesigned KoNLPy (Wrapper) for Usability and Portability with gRPC. [EXPERIMENTAL]☆13Mar 7, 2023Updated 2 years ago
- A web app for translating from one language to another.Almost all languages are available.App also generates an audio file of the transla…☆12Feb 19, 2026Updated last week
- ☆14Oct 4, 2024Updated last year
- SKT'22 AI Fellowship, 딥러닝 기반 흑백 이미지 컬러화 기술 개발☆13Jun 7, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- Komoran 3 in Python☆11Dec 10, 2018Updated 7 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 5 months ago
- exBERT on Transformers🤗☆10Jun 14, 2021Updated 4 years ago
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 3 years ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆53Oct 25, 2020Updated 5 years ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Apr 11, 2022Updated 3 years ago