etri-crossmodal / gbswt5Links
CharFormer(Tay et al., 2022; Gradient-based Subword Tokenizer + T5) model implementation for Huggingface Transformers
☆20Updated 7 months ago
Alternatives and similar repositories for gbswt5
Users that are interested in gbswt5 are comparing it to the libraries listed below
Sorting:
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 3 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆27Updated 2 years ago
- This repo Implements "Dense Passage Retrieval for Open-Domain Question Answering" using Korean Dataset☆75Updated 2 years ago
- Korean Commonsense Knowledge Graph☆14Updated 2 years ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated 2 years ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Updated 3 years ago
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆19Updated 3 years ago
- ☆91Updated last month
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…☆16Updated last month
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Updated 7 months ago
- 한국어 T5 모델☆53Updated 3 years ago
- kogpt를 oslo로 파인튜닝하는 예제.☆23Updated 2 years ago
- KOLD: Korean Offensive Language Dataset☆80Updated 2 years ago
- A Situational Conversation-Based English Education Platform☆21Updated 2 years ago
- ☆17Updated last year
- ☆36Updated last year
- DSBA code study☆28Updated last year
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60Updated 3 years ago
- For the rlhf learning environment of Koreans☆23Updated last year
- final-project-level3-nlp-02 created by GitHub Classroom☆11Updated 3 years ago
- BERTScore for Korean☆77Updated last year
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"☆18Updated 2 years ago
- 한국어 LLM 리더보드 및 모델 성능/안전성 관리☆22Updated last year
- bpe based korean t5 model for text-to-text unified framework☆62Updated last year
- Source codes and dataset of Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge☆62Updated last year
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆15Updated last year
- 제 4회 북커톤 대회에 참여한 'Hey, Shakesby' 입니다.☆8Updated 2 years ago
- [누구인가? 누가 내 험담을 하였는가!] 악플수집 서비스 #악플 #멈춰☆9Updated 2 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Updated 9 months ago
- Official Code for SIGIR 2022 "A Multi-task Based Neural Model to Simulate Users in Goal Oriented Dialogue Systems". User Simulator genera…☆38Updated 2 years ago