Liyan06 / MiniCheck
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]
☆133Updated 2 months ago
Alternatives and similar repositories for MiniCheck:
Users that are interested in MiniCheck are comparing it to the libraries listed below
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆104Updated 6 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆102Updated 5 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆218Updated 4 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆167Updated last month
- ☆68Updated last year
- LOFT: A 1 Million+ Token Long-Context Benchmark☆180Updated this week
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆160Updated 3 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆136Updated 4 months ago
- ☆307Updated 9 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆131Updated 4 months ago
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆147Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 6 months ago
- ☆119Updated 5 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- ☆120Updated 9 months ago
- ☆142Updated 11 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆207Updated 4 months ago
- Comprehensive benchmark for RAG☆144Updated 4 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆119Updated 7 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆194Updated this week
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆133Updated 3 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆229Updated last month
- Retrieval Augmented Generation Generalized Evaluation Dataset☆52Updated 4 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆152Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆234Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆117Updated last year
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆89Updated last year
- ☆114Updated 6 months ago
- awesome synthetic (text) datasets☆264Updated 4 months ago
- ☆160Updated 7 months ago