nlpai-lab / MIRAGELinks
MIRAGE is a light benchmark to evaluate RAG performance.
☆14Updated last month
Alternatives and similar repositories for MIRAGE
Users that are interested in MIRAGE are comparing it to the libraries listed below
Sorting:
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Updated 10 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆12Updated last year
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Updated 8 months ago
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15Updated 2 years ago
- For the rlhf learning environment of Koreans☆23Updated last year
- huggingface에 있는 한국어 데이터 세트☆28Updated 8 months ago
- evolve llm training instruction, from english instruction to any language.☆118Updated last year
- Keep Me Updated! Memory Management in Long-term Conversations (Findings of EMNLP 2022)☆30Updated 2 years ago
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆24Updated 2 years ago
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆15Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated last year
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)☆11Updated last year
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆45Updated 6 months ago
- ☆10Updated 9 months ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated 2 years ago
- Don't Judge a Language Model by Its Last Layer: Contrastive Learning with Layer-Wise Attention Pooling☆9Updated 2 years ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60Updated 3 years ago
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…☆16Updated 2 months ago
- 🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…☆16Updated 8 months ago
- bpe based korean t5 model for text-to-text unified framework☆63Updated last year
- Reward Model을 이용하여 언어모델의 답변을 평가하기☆29Updated last year
- NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)☆36Updated 3 years ago
- ☆20Updated 11 months ago
- official repository for ListT5☆45Updated 4 months ago
- [COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization☆25Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆89Updated 7 months ago
- ☆29Updated last year
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Updated 3 months ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated 2 years ago
- About, prompt-based few-shot learning, Text Generation with Prompting☆13Updated 2 years ago