ArGintum / GPTIDLinks

Official code repository for article Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts

☆29

Alternatives and similar repositories for GPTID

Users that are interested in GPTID are comparing it to the libraries listed below

Sorting:

ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆89Updated 7 months ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 5 months ago
AIRI-Institute / LLM-Microscope
☆51Updated 3 months ago
LoryPack / LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆70Updated last year
FusionBrainLab / LLM-Microscope
☆70Updated 9 months ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆43Updated last year
AIRI-Institute / Probing_framework
Framework for probing tasks
☆27Updated last year
epfl-dlab / llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
☆78Updated last year
ucl-dark / llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
☆109Updated last year
WSNLP / uncertainty_transformers
☆35Updated 3 years ago
yurakuratov / hidden_capacity
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025)
☆21Updated last week
UFO-101 / auto-circuit
A library for efficient patching and automatic circuit discovery.
☆67Updated 2 months ago
aryamanarora / causalgym
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
☆43Updated 6 months ago
peterljq / Parsimonious-Concept-Engineering
PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)
☆37Updated 7 months ago
ApolloResearch / e2e_sae
Sparse Autoencoder Training Library
☆52Updated last month
tml-epfl / llm-past-tense
Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]
☆69Updated 5 months ago
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
technion-cs-nlp / hallucination-mitigation
☆22Updated 6 months ago
ZonglinY / MOOSE
[ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …
☆42Updated 7 months ago
thestephencasper / explore_establish_exploit_llms
☆31Updated last year
explanare / ravel
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
☆47Updated 8 months ago
ruiqi-zhong / nlparam
Augmenting Statistical Models with Natural Language Parameters
☆27Updated 9 months ago
Yu-Fangxu / FoR
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆95Updated 2 weeks ago
ltgoslo / bert-in-context
Official implementation of "BERTs are Generative In-Context Learners"
☆28Updated 3 months ago
Lagooon / LeanSTaR
☆42Updated 9 months ago
haotiansun14 / BBox-Adapter
Lightweight Adapting for Black-Box Large Language Models
☆22Updated last year
hannamw / EAP-IG
☆37Updated last month
jkallini / mission-impossible-language-models
Code repository for the paper "Mission: Impossible Language Models."
☆52Updated last month
roeehendel / icl_task_vectors
☆95Updated last year
GXimingLu / IPA
Codebase for Inference-Time Policy Adapters
☆24Updated last year