Babelscape / LLM-OasisLinks
This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis". LLM-Oasis is a large-scale resource for end-to-end factuality evaluation obtained by extracting and falsifying information from Wikipedia.
☆23Updated 2 weeks ago
Alternatives and similar repositories for LLM-Oasis
Users that are interested in LLM-Oasis are comparing it to the libraries listed below
Sorting:
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆52Updated 8 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Updated 11 months ago
- The first dense retrieval model that can be prompted like an LM☆89Updated 5 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆91Updated last year
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆44Updated 2 months ago
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"☆55Updated 5 months ago
- ☆17Updated 6 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆60Updated last year
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆136Updated 2 weeks ago
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Updated 7 months ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Updated 9 months ago
- ☆19Updated 5 months ago
- ☆84Updated last year
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Updated last year
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆36Updated 2 weeks ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆120Updated 8 months ago
- ☆52Updated 9 months ago
- Generate Python Package with Simple Prompts☆73Updated 11 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 6 months ago
- OLAPH: Improving Factuality in Biomedical Long-form Question Answering☆37Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated 2 months ago
- This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"☆51Updated 9 months ago
- RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation [ACL 2025]☆119Updated 9 months ago
- Official Repo for CRMArena and CRMArena-Pro☆119Updated 4 months ago
- ☆50Updated 5 months ago
- XmodelLM☆38Updated 11 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated last year