rifkiaputri / id-csqaLinks
Dataset and code for paper: "Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese".
☆16Updated 7 months ago
Alternatives and similar repositories for id-csqa
Users that are interested in id-csqa are comparing it to the libraries listed below
Sorting:
- KOLD: Korean Offensive Language Dataset☆80Updated 2 years ago
- This repo Implements "Dense Passage Retrieval for Open-Domain Question Answering" using Korean Dataset☆75Updated 2 years ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated 2 years ago
- ☆20Updated 2 years ago
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Updated 8 months ago
- ☆17Updated last year
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 3 years ago
- 한국어 T5 모델☆54Updated 3 years ago
- Korean Commonsense Knowledge Graph☆14Updated 2 years ago
- CharFormer(Tay et al., 2022; Gradient-based Subword Tokenizer + T5) model implementation for Huggingface Transformers☆20Updated 8 months ago
- BERTScore for Korean☆78Updated last year
- A Situational Conversation-Based English Education Platform☆21Updated 2 years ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆34Updated last month
- final-project-level3-nlp-02 created by GitHub Classroom☆11Updated 3 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Updated 2 years ago
- The list of NLP paper and news I've checked. There might be short description of them (abstract) in Korean.☆26Updated this week
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆27Updated 2 years ago
- ☆44Updated 11 months ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60Updated 3 years ago
- Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)☆54Updated 2 years ago
- ☆36Updated last year
- Official code and dataset repository of KoBBQ (TACL 2024)☆17Updated last year
- ☆66Updated 4 years ago
- Jiphyeonjeon Season 3☆39Updated 3 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆118Updated 4 years ago
- ☆19Updated 10 months ago
- 자연어 처리 기반 [한글 서술형 수학문제 데이터셋] 공개 저장소입니다.☆13Updated 2 years ago
- ☆28Updated 2 years ago
- ☆59Updated last year
- For the rlhf learning environment of Koreans☆23Updated last year