etri-crossmodal / gbswt5Links
CharFormer(Tay et al., 2022; Gradient-based Subword Tokenizer + T5) model implementation for Huggingface Transformers
☆20Updated 8 months ago
Alternatives and similar repositories for gbswt5
Users that are interested in gbswt5 are comparing it to the libraries listed below
Sorting:
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆27Updated 2 years ago
- Korean Commonsense Knowledge Graph☆14Updated 2 years ago
- This repo Implements "Dense Passage Retrieval for Open-Domain Question Answering" using Korean Dataset☆75Updated 2 years ago
- KOLD: Korean Offensive Language Dataset☆80Updated 2 years ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated 2 years ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Updated 3 years ago
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆19Updated 3 years ago
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Updated 8 months ago
- Korean Light Weight Language Model☆30Updated 2 years ago
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 3 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Updated 2 years ago
- final-project-level3-nlp-02 created by GitHub Classroom☆11Updated 3 years ago
- BERTScore for Korean☆78Updated last year
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…☆16Updated 2 months ago
- For the rlhf learning environment of Koreans☆23Updated last year
- 한국어 LLM 리더보드 및 모델 성능/안전성 관리☆22Updated last year
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60Updated 3 years ago
- A Situational Conversation-Based English Education Platform☆21Updated 2 years ago
- ☆20Updated 2 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Updated last year
- 한국어 T5 모델☆54Updated 3 years ago
- Dataset and code for paper: "Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese".☆16Updated 7 months ago
- Finetuning Pipeline☆90Updated 3 years ago
- ☆92Updated 2 months ago
- Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)☆54Updated 2 years ago
- Source codes and dataset of Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge☆62Updated last year
- Simple Contrastive Learning of Korean Sentence Embeddings☆51Updated 2 years ago
- ☆12Updated 3 years ago
- ☆59Updated last year
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆129Updated 2 years ago