chrisvdweth / seleneLinks
An open, large-scale, interactive textbook.
☆56Updated this week
Alternatives and similar repositories for selene
Users that are interested in selene are comparing it to the libraries listed below
Sorting:
- ☆156Updated 2 years ago
- ☆59Updated 2 years ago
- Papers on fairness in NLP☆451Updated last year
- ☆63Updated last year
- Resources for cultural NLP research☆112Updated 2 months ago
- A resource repository for representation engineering in large language models☆143Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆237Updated 11 months ago
- Must-read Papers on Gender Bias.☆40Updated 3 years ago
- A reading list for papers on causality for natural language processing (NLP)☆677Updated 6 months ago
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆61Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆153Updated 4 months ago
- ☆116Updated last year
- [NeurIPS D&B '25] The one-stop repository for large language model (LLM) unlearning. Supports TOFU, MUSE, WMDP, and many unlearning metho…☆448Updated 2 weeks ago
- Aligning AI With Shared Human Values (ICLR 2021)☆305Updated 2 years ago
- A Python Data Valuation Package☆30Updated 2 years ago
- [ICLR 2025] General-purpose activation steering library☆130Updated 3 months ago
- ☆165Updated last year
- ☆26Updated 3 weeks ago
- ☆57Updated 2 years ago
- ☆223Updated last year
- A resource repository for machine unlearning in large language models☆513Updated 5 months ago
- [NeurIPS 2023 Spotlight] In-Context Impersonation Reveals Large Language Models' Strengths and Biases☆22Updated last year
- ☆90Updated 3 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆78Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆84Updated 9 months ago
- Python package for measuring memorization in LLMs.☆175Updated 5 months ago
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆181Updated 2 years ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆127Updated last year
- ICLR 2025 Workshop & CHI 2025 SIG: "Bidirectional Human-AI Alignment"☆47Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆304Updated 2 years ago