JacksonWuxs / UsableXAI_LLM
Using Explanations as a Tool for Advanced LLMs
☆58Updated 5 months ago
Alternatives and similar repositories for UsableXAI_LLM:
Users that are interested in UsableXAI_LLM are comparing it to the libraries listed below
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆72Updated 2 weeks ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆35Updated 3 months ago
- Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).☆53Updated 10 months ago
- ☆41Updated 2 weeks ago
- Code for paper: Are Large Language Models Post Hoc Explainers?☆30Updated 6 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆104Updated 5 months ago
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆47Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆50Updated 10 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆18Updated 4 months ago
- A Survey of Hallucination in Large Foundation Models☆51Updated last year
- [FCS'24] LVLM Safety paper☆17Updated last month
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆55Updated 2 months ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆125Updated 2 months ago
- [ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models☆45Updated 5 months ago
- ☆125Updated last year
- [ICLR'25] UniGen: A Unified Framework for Dataset Generation via Large Language Model☆39Updated 2 months ago
- Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misin…☆96Updated 3 months ago
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Updated 8 months ago
- JAILJUDGE: A comprehensive evaluation benchmark which includes a wide range of risk scenarios with complex malicious prompts (e.g., synth…☆34Updated 2 months ago
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆38Updated 7 months ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆35Updated last year
- Data and code for the Corr2Cause paper (ICLR 2024)☆93Updated 10 months ago
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆32Updated 3 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆126Updated this week
- ☆30Updated 4 months ago
- The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"☆54Updated 3 months ago
- ☆46Updated last month
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆46Updated last year
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆58Updated 7 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆41Updated last week