hasaniqbal777 / OpenFactCheck
An Open-source Factuality Evaluation Demo for LLMs
☆27Updated 2 months ago
Alternatives and similar repositories for OpenFactCheck:
Users that are interested in OpenFactCheck are comparing it to the libraries listed below
- Code for "Enhancing In-context Learning via Linear Probe Calibration"☆35Updated 10 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆41Updated 4 months ago
- ☆30Updated 10 months ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆21Updated 5 months ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆31Updated 2 months ago
- ☆14Updated last month
- [CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…☆31Updated this week
- Code repository for the paper "Mission: Impossible Language Models."☆47Updated 2 weeks ago
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆60Updated last month
- LoFiT: Localized Fine-tuning on LLM Representations☆33Updated last month
- Restore safety in fine-tuned language models through task arithmetic☆27Updated 11 months ago
- ☆19Updated 4 months ago
- [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆49Updated 3 months ago
- Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆14Updated 3 months ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆15Updated 11 months ago
- Public code repo for paper "Aligning LLMs with Individual Preferences via Interaction"☆22Updated 5 months ago
- ☆16Updated last year
- Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)☆15Updated 10 months ago
- ☆11Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆14Updated 2 months ago
- Mosaic IT: Enhancing Instruction Tuning with Data Mosaics☆17Updated 3 weeks ago
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"☆13Updated 5 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆21Updated 3 months ago
- Investigating Cultural Alignment of Large Language Models☆11Updated 6 months ago
- ☆11Updated last year
- ☆47Updated last year
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆31Updated last year