sophiegroenwold / AAVE_SAE_dataset
Dataset accompanying the paper "Investigating African-American Vernacular English in Transformer-Based Text Generation."
☆10Updated 2 years ago
Alternatives and similar repositories for AAVE_SAE_dataset:
Users that are interested in AAVE_SAE_dataset are comparing it to the libraries listed below
- ☆91Updated 9 months ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated last year
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆19Updated 2 years ago
- Factored Cognition Primer: How to write compositional language model programs☆48Updated 2 years ago
- ☆23Updated 6 months ago
- Pre-train Static Word Embeddings☆48Updated last week
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 7 months ago
- Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)☆73Updated last year
- MultiCite code and data. Models are available on Huggingface.☆31Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- ☆93Updated 2 months ago
- Semantically Structured Sentence Embeddings☆65Updated 4 months ago
- Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.☆15Updated 5 months ago
- ☆22Updated last year
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆31Updated last year
- https://arxiv.org/abs/2404.10917☆14Updated 9 months ago
- ☆32Updated 5 months ago
- Toolkit for building prompt templates for language models☆12Updated 2 years ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆34Updated 7 months ago
- ☆24Updated 3 months ago
- Data and code for the paper "Inducing Positive Perspectives with Text Reframing"☆57Updated last year
- ☆26Updated 2 weeks ago
- Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts☆58Updated last year
- A dataset for pretraining language models targeted for legal tasks.☆127Updated 2 years ago
- ☆44Updated 3 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆31Updated last year
- ☆34Updated 5 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆83Updated 7 months ago
- ☆38Updated 3 months ago