jongjyh / TrFrLinks
Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning
☆46Updated last year
Alternatives and similar repositories for TrFr
Users that are interested in TrFr are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆154Updated 2 years ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆116Updated last week
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆109Updated last year
- ☆155Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆163Updated last year
- ☆128Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆218Updated last year
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…☆306Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆145Updated 2 weeks ago
- ☆146Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆135Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆134Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆112Updated 4 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆204Updated 10 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆68Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆250Updated 11 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆218Updated last week
- ☆297Updated last year
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆187Updated 2 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆147Updated last year
- Evaluating LLMs with fewer examples☆164Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆329Updated 11 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 2 months ago
- ☆43Updated last year