KUNLP / multihop-edit-evalLinks
Official implementation of “Watch Your Step: A Fine-Grained Evaluation Framework for Multi-hop Knowledge Editing in Large Language Models” (CIKM 2025).
☆46Updated 2 months ago
Alternatives and similar repositories for multihop-edit-eval
Users that are interested in multihop-edit-eval are comparing it to the libraries listed below
Sorting:
- ☆17Updated last year
- ☆25Updated last year
- ☆24Updated 2 years ago
- ☆18Updated last year
- ☆17Updated last year
- ☆27Updated 2 years ago
- ☆15Updated last month
- ☆17Updated last year
- ☆16Updated last year
- ☆16Updated last year
- ☆20Updated last year
- ☆16Updated 2 years ago
- resository for NLPLAB_sLLM☆38Updated 8 months ago
- ☆20Updated last year
- Official Repository for "Revisiting the Impact of Pursuing Modularity for Code Generation"☆17Updated last year
- ☆15Updated 2 years ago
- code for the paper 'Multi-Task Learning for Knowledge Graph Completion with Pre-trained Language Models'☆16Updated 5 years ago
- The Korean version of the HumanEval benchmark, where the original docstrings are translated into Korean☆17Updated 7 months ago
- Code for "Never Too Late to Learn: Regularizing Gender bias in Coreference Resolution"☆43Updated last year
- ☆19Updated last year
- The list of NLP paper and news I've checked. There might be short description of them (abstract) in Korean.☆34Updated this week
- K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models☆37Updated last month
- SemEval2022 Task2: Multilingual Idiomaticity Detection and Sentence Embedding☆13Updated 3 years ago
- Kaggle을 처음 접하는 사람들을 위한 문서☆10Updated 5 years ago
- This repo Implements "Dense Passage Retrieval for Open-Domain Question Answering" using Korean Dataset☆75Updated 3 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆249Updated 2 years ago
- ☆12Updated 3 years ago
- ☆11Updated last year
- 설명가능한 오픈도메인 질의응답 시스템 구축을 위한 질의 기반의 문서 요약 기술 연구 및 데이터☆55Updated last year
- KOLD: Korean Offensive Language Dataset☆81Updated 3 years ago