yejinc00 / PREMIRLinks
[EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"
☆15Updated 5 months ago
Alternatives and similar repositories for PREMIR
Users that are interested in PREMIR are comparing it to the libraries listed below
Sorting:
- The most modern LLM evaluation toolkit☆70Updated 3 months ago
- 한국어 벤치마크 평가 코드 통합본(?)☆20Updated last year
- MIRAGE is a light benchmark to evaluate RAG performance.☆33Updated 8 months ago
- K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models☆38Updated last month
- Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)☆142Updated last year
- huggingface에 있는 한국어 데이터 세트☆35Updated last year
- The list of NLP paper and news I've checked. There might be short description of them (abstract) in Korean.☆37Updated this week
- 구글에서 발표한 Chain-of-Thought Reasoning without Prompting을 코드로 구현한 레포입니다.☆65Updated last year
- level2-nlp-generationfornlp-nlp-05-lv3 created by GitHub Classroom☆14Updated last year
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF method☆19Updated 8 months ago
- Official repository for KoMT-Bench built by LG AI Research☆71Updated last year
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆115Updated 7 months ago
- ☆19Updated 2 years ago
- huggingface transformers tutorial, code, resources☆26Updated last year
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆24Updated 8 months ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆248Updated 2 years ago
- Paper list and short/long summaries I've read for my research or interests☆23Updated last year
- ☆61Updated 4 months ago
- BERT score for text generation☆12Updated last year
- [ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues☆26Updated 7 months ago
- ☆114Updated 6 months ago
- Benchmark in Korean Context☆136Updated 2 years ago
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆82Updated last year
- Automatically Update NLP Papers Daily using Github Actions (ref: https://github.com/Vincentqyw/cv-arxiv-daily)☆103Updated this week
- [ Text Analytics ] 법률 도메인 특화 한국어 기반 LLM 개발☆14Updated 4 months ago
- 한국어 의료 분야 특화 챗봇 프로젝트☆32Updated 2 years ago
- ☆69Updated last year
- This repo Implements "Dense Passage Retrieval for Open-Domain Question Answering" using Korean Dataset☆75Updated 3 years ago
- The Universe of Evaluation. All about the evaluation for LLMs.☆232Updated last year
- ☆105Updated 3 months ago