alisawuffles / tokenizer-attackLinks

Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"

☆14

Alternatives and similar repositories for tokenizer-attack

Users that are interested in tokenizer-attack are comparing it to the libraries listed below

Sorting:

BrachioLab / incontext_influences
In-context Example Selection with Influences
☆15Updated 2 years ago
shuoli90 / Rank-Calibration
This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.
☆13Updated last year
sfeucht / footprints
https://footprints.baulab.info
☆17Updated 9 months ago
formll / resolving-scaling-law-discrepancies
☆20Updated last year
pratyushmaini / llm_dataset_inference
Official Repository for Dataset Inference for LLMs
☆35Updated 11 months ago
eth-lre / LLM_ICL
ACL24
☆10Updated last year
alon-albalak / FLAD
Few-shot Learning with Auxiliary Data
☆28Updated last year
yihuaihong / ConceptVectors
ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"
☆36Updated 5 months ago
frankaging / Causal-Distill
The Codebase for Causal Distillation for Language Models (NAACL '22)
☆25Updated 3 years ago
tml-epfl / icl-alignment
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆30Updated 5 months ago
kangmintong / C-RAG
[ICML 2024] Codes for C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models
☆17Updated last year
mireshghallah / ft-memorization
☆13Updated 2 years ago
shadowkiller33 / Contrast-Instruction
☆19Updated last year
p-lambda / in-n-out
Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"
☆13Updated 3 years ago
pietrolesci / memorisation-profiles
This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
☆23Updated 3 months ago
cambridgeltl / zepo
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)
☆13Updated 9 months ago
milesaturpin / cot-unfaithfulness
☆46Updated last year
Hritikbansal / jpo
☆13Updated 2 weeks ago
tatsu-lab / linguistic_calibration
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆26Updated last year
nathanhu0 / CaMeLS
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
☆25Updated last year
AndreasMadsen / nlp-roar-interpretability
Measuring if attention is explanation with ROAR
☆22Updated 2 years ago
TristanThrush / perplexity-correlations
Simple and scalable tools for data-driven pretraining data selection.
☆24Updated last month
orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆45Updated last year
CEBaBing / CEBaB
CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior
☆12Updated 2 years ago
jmerullo / lm_vector_arithmetic
☆35Updated 2 years ago
uiuctml / fair-classification
Post-processing for fair classification
☆15Updated 2 weeks ago
JeremyAlain / imitation_learning_from_language_feedback
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Updated 2 years ago
ruyimarone / data-portraits
Documenting large text datasets 🖼️ 📚
☆12Updated 7 months ago
uiuctml / MergeBench
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
☆16Updated 2 months ago
aaronmueller / MIB
Landing page for MIB: A Mechanistic Interpretability Benchmark
☆16Updated last week