Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference
☆44Nov 28, 2022Updated 3 years ago
Alternatives and similar repositories for SDR
Users that are interested in SDR are comparing it to the libraries listed below
Sorting:
- ↔️ Utilizing RBERT model structure for KLUE Relation Extraction task☆15Nov 15, 2022Updated 3 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- 이동호, 이정훈, 김유리, 김형준, 박승면, 양유준, 신웅비 (Dong Ho Lee, Jung Hoon Lee, Yu Ri Kim, Hyung Jun Kim, Seung Myun Park, Yu Jun Yang, Woong Bi Shin)☆15Apr 16, 2020Updated 5 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 5 months ago
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- ☆11Nov 23, 2021Updated 4 years ago
- A Pytorch-Lightning Implementation of Transformer Network☆11Oct 22, 2020Updated 5 years ago
- ELECTRA MODEL NLP☆13Apr 8, 2020Updated 5 years ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆17Oct 5, 2021Updated 4 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Jun 7, 2022Updated 3 years ago
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 3 years ago
- NumPy로 구현한 딥러닝 라이브러리입니다. (자동 미분 지원)☆15May 4, 2021Updated 4 years ago
- NIA(National Information Society Agency) Hangul Dictionary☆34Oct 27, 2020Updated 5 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Apr 11, 2022Updated 3 years ago
- I hope to this list will contribute good influence in Korean online services.☆63Feb 10, 2019Updated 7 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Jun 16, 2021Updated 4 years ago
- ☆24Nov 22, 2022Updated 3 years ago
- ☆14Sep 10, 2021Updated 4 years ago
- exBERT on Transformers🤗☆10Jun 14, 2021Updated 4 years ago
- Dataset of Korean Threatening Conversations☆72Nov 1, 2022Updated 3 years ago
- ☆14Dec 9, 2021Updated 4 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Mar 11, 2026Updated last week
- ☆19Jul 6, 2023Updated 2 years ago
- ELECTRA기반 한국어 대화체 언어모델☆53Aug 4, 2021Updated 4 years ago
- [HCLT 2022] Korean sentence text similarity dataset using naver shopping review☆25Oct 20, 2022Updated 3 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆53Oct 25, 2020Updated 5 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago