sophiegroenwold / AAVE_SAE_datasetLinks
Dataset accompanying the paper "Investigating African-American Vernacular English in Transformer-Based Text Generation."
☆10Updated 3 years ago
Alternatives and similar repositories for AAVE_SAE_dataset
Users that are interested in AAVE_SAE_dataset are comparing it to the libraries listed below
Sorting:
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆21Updated 3 years ago
- Semantically Structured Sentence Embeddings☆71Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆30Updated 3 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆33Updated 2 years ago
- Detecting Bias and ensuring Fairness in AI solutions☆102Updated 2 years ago
- ☆100Updated last year
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆24Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆32Updated 3 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆95Updated 2 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 3 years ago
- ☆65Updated 2 years ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆107Updated 9 months ago
- StAtutory Reasoning Assessment☆15Updated 3 years ago
- Factored Cognition Primer: How to write compositional language model programs☆50Updated 2 years ago
- Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)☆85Updated 2 years ago
- Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!☆11Updated 2 years ago
- A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.☆57Updated 9 months ago
- Apps built using Inspired Cognition's Critique.☆57Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆98Updated last year
- ☆44Updated last year
- Multi-task model for named-entity recognition, relation extraction, entity mention detection and coreference resolution.☆45Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Updated 7 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- ☆26Updated 11 months ago
- A corpus and code for understanding norms and subjectivity. 🤖☆53Updated last year
- Data for evaluating gender bias in coreference resolution systems.☆81Updated 6 years ago
- ☆10Updated last year
- https://arxiv.org/abs/2404.10917☆14Updated 10 months ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated last year
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆31Updated 2 years ago