etri-crossmodal / gbswt5Links
CharFormer(Tay et al., 2022; Gradient-based Subword Tokenizer + T5) model implementation for Huggingface Transformers
☆20Updated 10 months ago
Alternatives and similar repositories for gbswt5
Users that are interested in gbswt5 are comparing it to the libraries listed below
Sorting:
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 3 years ago
- 설명가능한 오픈도메인 질의응답 시스템 구축을 위한 질의 기반의 문서 요약 기술 연구 및 데이터☆55Updated last year
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆27Updated 2 years ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Updated 3 years ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated 2 years ago
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…☆17Updated 4 months ago
- This repo Implements "Dense Passage Retrieval for Open-Domain Question Answering" using Korean Dataset☆75Updated 2 years ago
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆20Updated 3 years ago
- Korean Commonsense Knowledge Graph☆14Updated 2 years ago
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆92Updated 10 months ago
- 한국어 T5 모델☆54Updated 3 years ago
- ☆95Updated 4 months ago
- ☆20Updated last year
- KOLD: Korean Offensive Language Dataset☆81Updated 2 years ago
- For the rlhf learning environment of Koreans☆23Updated last year
- Source codes and dataset of Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge☆62Updated 2 years ago
- Official code and dataset repository of KoBBQ (TACL 2024)☆18Updated last year
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆25Updated 2 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Updated last year
- ☆19Updated last year
- Code for "RADCoT: Retrieval-Augmented Distillation to Specialization Models for Generating Chain-of-Thoughts in Query Expansion", LREC-CO…☆10Updated last year
- kogpt를 oslo로 파인튜닝하는 예제.☆23Updated 3 years ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60Updated 3 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Updated 2 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Updated 2 years ago
- BERTScore for Korean☆81Updated last year
- ☆36Updated last year
- ☆12Updated 3 years ago
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆45Updated 8 months ago
- ☆32Updated 2 years ago