KUNLP / multihop-edit-evalLinks
Official implementation of “Watch Your Step: A Fine-Grained Evaluation Framework for Multi-hop Knowledge Editing in Large Language Models” (CIKM 2025).
☆46Updated last month
Alternatives and similar repositories for multihop-edit-eval
Users that are interested in multihop-edit-eval are comparing it to the libraries listed below
Sorting:
- ☆25Updated last year
- ☆17Updated last year
- ☆17Updated last year
- ☆18Updated last year
- ☆24Updated 2 years ago
- ☆15Updated 3 weeks ago
- ☆17Updated last year
- ☆16Updated last year
- ☆27Updated 2 years ago
- ☆16Updated last year
- ☆20Updated last year
- ☆16Updated 2 years ago
- Official Repository for "Revisiting the Impact of Pursuing Modularity for Code Generation"☆17Updated last year
- ☆15Updated 2 years ago
- The Korean version of the HumanEval benchmark, where the original docstrings are translated into Korean☆17Updated 7 months ago
- SemEval2022 Task2: Multilingual Idiomaticity Detection and Sentence Embedding☆13Updated 3 years ago
- resository for NLPLAB_sLLM☆38Updated 7 months ago
- code for the paper 'Multi-Task Learning for Knowledge Graph Completion with Pre-trained Language Models'☆16Updated 4 years ago
- ☆20Updated last year
- ☆12Updated 3 years ago
- K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models☆37Updated last week
- ☆19Updated last year
- Official Repository for "Hyper-CL: Conditioning Sentence Representations with Hypernetworks"☆17Updated last year
- Code for "Never Too Late to Learn: Regularizing Gender bias in Coreference Resolution"☆43Updated last year
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated 2 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆248Updated 2 years ago
- KOLD: Korean Offensive Language Dataset☆82Updated 3 years ago
- The list of NLP paper and news I've checked. There might be short description of them (abstract) in Korean.☆33Updated this week
- A Situational Conversation-Based English Education Platform☆21Updated 2 years ago
- Official Repository for "BlendX: Complex Multi-intent Detection with Blended Patterns"☆27Updated 5 months ago