AlexTMallen / adaptive-retrieval
☆160Updated last year
Related projects: ⓘ
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆271Updated 4 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models"☆54Updated 8 months ago
- ☆114Updated 2 weeks ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆91Updated last year
- Code for Editing Factual Knowledge in Language Models☆134Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆159Updated 11 months ago
- ☆78Updated last year
- ☆259Updated 8 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆77Updated last month
- A framework for few-shot evaluation of autoregressive language models.☆98Updated last year
- Codes for papers on Large Language Models Personalization (LaMP)☆102Updated 5 months ago
- ☆164Updated last month
- Scalable training for dense retrieval models.☆268Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆174Updated last week
- Token-level Reference-free Hallucination Detection☆92Updated last year
- contrastive decoding☆174Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆118Updated 6 months ago
- ☆68Updated last year
- RARR: Researching and Revising What Language Models Say, Using Language Models☆41Updated last year
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models☆38Updated 8 months ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆157Updated 3 years ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆104Updated 6 months ago
- Multilingual Large Language Models Evaluation Benchmark☆91Updated 3 weeks ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆102Updated 2 months ago
- ☆64Updated 7 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆105Updated last year
- A Survey on Data Selection for Language Models☆148Updated 3 months ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆187Updated 7 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆100Updated 3 months ago