smartspot2 / edapiLinks
Python integration with the Ed API
☆18Updated 5 months ago
Alternatives and similar repositories for edapi
Users that are interested in edapi are comparing it to the libraries listed below
Sorting:
- [EMNLP'24] Evaluating LLM performance and sensitivity when there is a "task-switch". Code for "LLM Task Interference: An Initial Study on…☆14Updated 8 months ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆78Updated last month
- ☆51Updated 2 years ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆11Updated 5 months ago
- ☆22Updated 5 months ago
- ☆23Updated last year
- ☆18Updated 7 months ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆26Updated last year
- ☆12Updated 2 months ago
- Sparse Autoencoder Training Library☆53Updated 2 months ago
- Algebraic value editing in pretrained language models☆65Updated last year
- ☆79Updated 4 months ago
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆18Updated last year
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Updated 10 months ago
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆174Updated last year
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆22Updated 4 years ago
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆49Updated 9 months ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.☆235Updated this week
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆33Updated 2 years ago
- Forcing Diffuse Distributions out of Language Models☆16Updated 10 months ago
- General-purpose activation steering library☆84Updated 2 months ago
- Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".☆41Updated 7 months ago
- AI Logging for Interpretability and Explainability🔬☆124Updated last year
- ☆10Updated last month
- Sparse probing paper full code.☆58Updated last year
- The repository for paper <Evaluating Open-QA Evaluation>☆25Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆74Updated 4 months ago
- ☆29Updated last year
- ☆87Updated 11 months ago
- ☆36Updated 11 months ago