alisawuffles / tokenizer-attackLinks
Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"
☆14Updated 2 months ago
Alternatives and similar repositories for tokenizer-attack
Users that are interested in tokenizer-attack are comparing it to the libraries listed below
Sorting:
- In-context Example Selection with Influences☆15Updated 2 years ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Updated last year
- https://footprints.baulab.info☆17Updated 9 months ago
- ☆20Updated last year
- Official Repository for Dataset Inference for LLMs☆35Updated 11 months ago
- ACL24☆10Updated last year
- Few-shot Learning with Auxiliary Data☆28Updated last year
- ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"☆36Updated 5 months ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆25Updated 3 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆30Updated 5 months ago
- [ICML 2024] Codes for C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models☆17Updated last year
- ☆13Updated 2 years ago
- ☆19Updated last year
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆13Updated 3 years ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆23Updated 3 months ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆13Updated 9 months ago
- ☆46Updated last year
- ☆13Updated 2 weeks ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆26Updated last year
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- Measuring if attention is explanation with ROAR☆22Updated 2 years ago
- Simple and scalable tools for data-driven pretraining data selection.☆24Updated last month
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆45Updated last year
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆12Updated 2 years ago
- ☆35Updated 2 years ago
- Post-processing for fair classification☆15Updated 2 weeks ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- Documenting large text datasets 🖼️ 📚☆12Updated 7 months ago
- MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆16Updated 2 months ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆16Updated last week