rifkiaputri / id-csqa
Dataset and code for paper: "Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese".
☆16Updated last month
Alternatives and similar repositories for id-csqa:
Users that are interested in id-csqa are comparing it to the libraries listed below
- KOLD: Korean Offensive Language Dataset☆80Updated 2 years ago
- ☆16Updated 10 months ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated last year
- ☆34Updated 10 months ago
- This repo Implements "Dense Passage Retrieval for Open-Domain Question Answering" using Korean Dataset☆74Updated 2 years ago
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 2 years ago
- A Situational Conversation-Based English Education Platform☆21Updated last year
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Updated 2 months ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Updated 2 years ago
- ☆20Updated last year
- 한국어 T5 모델☆48Updated 3 years ago
- ☆28Updated 2 years ago
- ☆57Updated last year
- ☆66Updated 4 years ago
- final-project-level3-nlp-02 created by GitHub Classroom☆11Updated 3 years ago
- Korean Commonsense Knowledge Graph☆14Updated 2 years ago
- ☆20Updated last year
- ☆15Updated 2 years ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆22Updated last month
- ☆87Updated 2 years ago
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆19Updated 3 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Updated 2 years ago
- The list of NLP paper and news I've checked. There might be short description of them (abstract) in Korean.☆21Updated this week
- 자연어 처리 기반 [한글 서술형 수학문제 데이터셋] 공개 저장소입니다.☆13Updated last year
- CharFormer(Tay et al., 2022; Gradient-based Subword Tokenizer + T5) model implementation for Huggingface Transformers☆21Updated 3 months ago
- 한국어 생성 문서의 원소 사실 관계에 대한 설명 기술☆14Updated last month
- BERTScore for Korean☆73Updated 10 months ago
- 한국어 벤치마크 평가 코드 통합본(?)☆12Updated 2 months ago
- Jiphyeonjeon Season 3☆39Updated 2 years ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60Updated 2 years ago