kmeng01 / romeLinks
Locating and editing factual associations in GPT (NeurIPS 2022)
☆715Updated last year
Alternatives and similar repositories for rome
Users that are interested in rome are comparing it to the libraries listed below
Sorting:
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆563Updated 11 months ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆535Updated last year
- Representation Engineering: A Top-Down Approach to AI Transparency☆934Updated last year
- ☆244Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆541Updated last year
- Tools for understanding how transformer predictions are built layer-by-layer☆555Updated 4 months ago
- TruthfulQA: Measuring How Models Imitate Human Falsehoods☆859Updated 11 months ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆533Updated 11 months ago
- ☆249Updated 3 years ago
- RewardBench: the first evaluation tool for reward models.