Babelscape / LLM-OasisLinks
This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis". LLM-Oasis is a large-scale resource for end-to-end factuality evaluation obtained by extracting and falsifying information from Wikipedia.
☆25Updated 3 months ago
Alternatives and similar repositories for LLM-Oasis
Users that are interested in LLM-Oasis are comparing it to the libraries listed below
Sorting:
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated last year
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"☆56Updated 9 months ago
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Updated 10 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆59Updated 11 months ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Updated last year
- The first dense retrieval model that can be prompted like an LM☆90Updated 9 months ago
- ☆57Updated last month
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆142Updated 3 months ago
- ☆19Updated 8 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Updated last year
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆45Updated 2 months ago
- ☆87Updated last year
- [ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation☆118Updated last year
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated last year
- ☆17Updated 10 months ago
- Generate Python Package with Simple Prompts☆75Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Updated 6 months ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆52Updated 3 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆41Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing☆44Updated last year
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Updated last year
- The official implementation of Preference Data Reward-Augmentation.☆18Updated 9 months ago
- Code for ExploreTom☆90Updated 7 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Official Repo for CRMArena and CRMArena-Pro☆132Updated this week