chrisvdweth / seleneLinks
An open, large-scale, interactive textbook.
☆44Updated this week
Alternatives and similar repositories for selene
Users that are interested in selene are comparing it to the libraries listed below
Sorting:
- Papers on fairness in NLP☆448Updated last year
- ☆146Updated last year
- ☆46Updated last year
- ☆62Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆303Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆144Updated 2 weeks ago
- Aligning AI With Shared Human Values (ICLR 2021)☆297Updated 2 years ago
- ☆226Updated last year
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆84Updated 2 months ago
- A reading list for papers on causality for natural language processing (NLP)☆665Updated 3 months ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆123Updated last year
- ☆31Updated 4 years ago
- The one-stop repository for large language model (LLM) unlearning. Supports TOFU, MUSE, WMDP, and many unlearning methods. All features: …☆358Updated last month
- A resource repository for machine unlearning in large language models☆477Updated last month
- Training data extraction on GPT-2☆191Updated 2 years ago
- A resource repository for representation engineering in large language models☆132Updated 9 months ago
- ☆89Updated 3 years ago
- ☆295Updated 3 weeks ago
- ☆114Updated last year
- ☆18Updated 4 years ago
- A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP☆942Updated 11 months ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆74Updated 11 months ago
- ☆48Updated last month
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆60Updated last year
- Python package for measuring memorization in LLMs.☆165Updated last month
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆77Updated 5 months ago
- Resources for cultural NLP research☆103Updated 4 months ago
- This repository collects all relevant resources about interpretability in LLMs☆370Updated 10 months ago
- ☆97Updated 3 years ago
- A comprehensive open-source guide that demystifies how U.S. universities evaluate and admit students into Computer Science PhD programs.☆151Updated last week