amazon-science / llm-hallucinations-factual-qaLinks
☆12Updated 5 months ago
Alternatives and similar repositories for llm-hallucinations-factual-qa
Users that are interested in llm-hallucinations-factual-qa are comparing it to the libraries listed below
Sorting:
- Aioli: A unified optimization framework for language model data mixing☆27Updated 6 months ago
- Official repo for the paper "Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models" at N…☆8Updated 5 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- ☆26Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Tasks for describing differences between text distributions.☆16Updated 11 months ago
- ☆26Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆30Updated last year
- ☆22Updated last year
- ☆16Updated last year
- ☆28Updated last week
- Code for "Merging Text Transformers from Different Initializations"☆20Updated 5 months ago
- NeurIPS'24 - LLM Safety Landscape☆25Updated 4 months ago
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆25Updated 3 months ago
- Lottery Ticket Adaptation☆39Updated 8 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆27Updated 4 months ago
- efficient query encoding for dense retrieval☆11Updated 11 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆15Updated last week
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆13Updated 2 months ago
- ☆27Updated 2 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Updated last year
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆13Updated last week
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆24Updated 3 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Updated 11 months ago
- ☆45Updated 3 months ago
- Can We Trust Large Language Models?: A Benchmark for Responsible Large Language Models via Toxicity, Bias, and Value-alignment Evaluation☆24Updated last year
- ☆27Updated 2 weeks ago