☆36Feb 11, 2025Updated last year
Alternatives and similar repositories for deception-detection
Users that are interested in deception-detection are comparing it to the libraries listed below
Sorting:
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆15Apr 15, 2024Updated last year
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆20Oct 18, 2024Updated last year
- ☆27Oct 6, 2024Updated last year
- AlgZoo: uninterpreted models with fewer than 1,500 parameters☆43Jan 19, 2026Updated last month
- ☆48Sep 29, 2024Updated last year
- ☆15Aug 19, 2025Updated 6 months ago
- ☆14Mar 15, 2025Updated 11 months ago
- BLEU Score in Rust☆12Updated this week
- ☆13Apr 10, 2025Updated 10 months ago
- Tool to decrypt encrypted strings in AgentTesla☆16Jan 24, 2022Updated 4 years ago
- Panda - is a set of utilities used to research how PsExec encrypts its traffic.☆12Apr 20, 2021Updated 4 years ago
- Accelerating Transfer Learning with Robust Neural Nets☆11Oct 2, 2020Updated 5 years ago
- ACL24☆11Jun 7, 2024Updated last year
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- ☆20Feb 3, 2025Updated last year
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 4 months ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆101Sep 21, 2023Updated 2 years ago
- Python-based cloud node for local use☆11Mar 7, 2018Updated 7 years ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- ☆16Apr 26, 2023Updated 2 years ago
- An interpretability library for pytorch☆13Dec 31, 2022Updated 3 years ago
- SysFlow collection probe☆17Nov 11, 2025Updated 3 months ago
- [Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances☆21Jan 15, 2026Updated last month
- Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"☆15Aug 2, 2025Updated 6 months ago
- ☆14Apr 4, 2019Updated 6 years ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆11Jun 16, 2023Updated 2 years ago
- Support UEFI load☆11Oct 1, 2015Updated 10 years ago
- Localization of Knowledge in Text-to-Image Models☆12Oct 8, 2024Updated last year
- Transform dumped executable memory back into an identical match from disk. Use network or local database to de-locate relocated binaries…☆12Jan 10, 2016Updated 10 years ago
- ☆12Sep 16, 2024Updated last year
- ☆12Aug 12, 2024Updated last year
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- ☆13Nov 10, 2020Updated 5 years ago
- ☆14May 30, 2024Updated last year
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago
- Материалы к статье "Препарируем Hyper V"☆14Nov 5, 2014Updated 11 years ago
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆19Updated this week
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- ☆58Nov 19, 2024Updated last year