The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"
☆64Jun 3, 2025Updated 11 months ago
Alternatives and similar repositories for ReDEeP-ICLR
Users that are interested in ReDEeP-ICLR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RAG Hallucination Detecting By LRP.☆11Mar 31, 2025Updated last year
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Source code for the ACL'2025 paper titled "Unveiling privacy risks in llm agent memory"☆30Dec 2, 2025Updated 5 months ago
- ☆14Oct 17, 2024Updated last year
- ☆16May 17, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implementation of ACL'24 paper: Synergistic Interplay between Search and Large Language Models for Information Retrieval.☆36Jun 6, 2024Updated last year
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆16Jan 9, 2026Updated 4 months ago
- DICE: Detecting In-distribution Data Contamination with LLM's Internal State☆11Sep 21, 2024Updated last year
- ☆57Mar 27, 2023Updated 3 years ago
- ICL backdoor attack☆17Nov 4, 2024Updated last year
- ☆59Nov 18, 2024Updated last year
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆23Nov 19, 2025Updated 5 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆242Dec 2, 2024Updated last year
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆17Feb 13, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆80Jan 16, 2026Updated 3 months ago
- ☆14Aug 9, 2024Updated last year
- [ACL 2024] REANO: Optimising Retrieval-Augmented Reader Models through Knowledge Graph Generation☆12Sep 4, 2024Updated last year
- Artifact for TOSEM Submission: GiantRepair☆13Jun 26, 2024Updated last year
- Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning☆12Sep 2, 2024Updated last year
- The official implementation of the paper "Data Contamination Calibration for Black-box LLMs" (ACL 2024)☆16May 21, 2024Updated last year
- ☆119Feb 11, 2025Updated last year
- [AAAI-25] Official repository of "Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object De…☆20Dec 27, 2024Updated last year
- Code for the paper "Firewalls to Secure Dynamic LLM Agentic Networks"☆30Jun 6, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆23Feb 6, 2025Updated last year
- Llama中文社区,最好的中文Llama大模型,完全开源可商用☆12Aug 5, 2023Updated 2 years ago
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆40Aug 4, 2025Updated 9 months ago
- Codes for ACL2023 paper: Knowledgeable Parameter Efficient Tuning Network for Commonsense Question Answering.☆11Sep 23, 2023Updated 2 years ago
- ☆14Mar 19, 2020Updated 6 years ago
- DSN jailbreak Attack & Evaluation Ensemble☆17Feb 7, 2026Updated 3 months ago
- ☆23Jan 3, 2025Updated last year
- Limited automatic tabular ML pipelines for generic MEDS datasets.☆18Apr 14, 2026Updated 3 weeks ago
- R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)☆13Nov 17, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Enhancing contextual understanding in large language models through contrastive decoding☆20May 3, 2024Updated 2 years ago
- ☆41Jul 6, 2025Updated 10 months ago
- Fine tuning of the Retrieval-Augmented Generation (RAG) with a custom knowledge source.☆13Feb 10, 2021Updated 5 years ago
- ☆16Apr 27, 2024Updated 2 years ago
- ☆18Sep 1, 2025Updated 8 months ago
- Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".☆383Jun 13, 2025Updated 10 months ago
- [SIGIR '25] This is the code repo for our SIGIR '25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆19Apr 22, 2025Updated last year