Babelscape / LLM-OasisLinks
This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis". LLM-Oasis is a large-scale resource for end-to-end factuality evaluation obtained by extracting and falsifying information from Wikipedia.
☆24Updated last month
Alternatives and similar repositories for LLM-Oasis
Users that are interested in LLM-Oasis are comparing it to the libraries listed below
Sorting:
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆56Updated 10 months ago
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆44Updated last week
- The first dense retrieval model that can be prompted like an LM☆89Updated 7 months ago
- ☆55Updated 10 months ago
- Generate Python Package with Simple Prompts☆75Updated last year
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Updated 8 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆123Updated 4 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆142Updated 2 months ago
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"☆56Updated 7 months ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Updated 10 months ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated last year
- ☆17Updated 8 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆124Updated 10 months ago
- ☆84Updated last year
- ☆19Updated 6 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 7 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- [ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation☆118Updated 10 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆30Updated last year
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆115Updated last year
- OLAPH: Improving Factuality in Biomedical Long-form Question Answering☆37Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆68Updated 3 weeks ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 8 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18Updated 7 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆39Updated last year
- Open-source Python toolkit focused on deep learning with ordinal methodologies☆63Updated this week
- MIRIAD is a million-scale Medical Instruction and Retrieval Datatset☆133Updated 3 weeks ago