sophiegroenwold / AAVE_SAE_datasetLinks
Dataset accompanying the paper "Investigating African-American Vernacular English in Transformer-Based Text Generation."
☆10Updated 3 years ago
Alternatives and similar repositories for AAVE_SAE_dataset
Users that are interested in AAVE_SAE_dataset are comparing it to the libraries listed below
Sorting:
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆93Updated 2 years ago
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆21Updated 3 years ago
- MultiCite code and data. Models are available on Huggingface.☆32Updated 3 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated 2 years ago
- Semantically Structured Sentence Embeddings☆69Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆30Updated 3 years ago
- A dataset for pretraining language models targeted for legal tasks.☆140Updated 3 years ago
- Detecting Bias and ensuring Fairness in AI solutions☆102Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆73Updated 2 years ago
- ☆39Updated last year
- ☆37Updated 4 months ago
- The AI Knowledge Editor☆186Updated 3 years ago
- ☆65Updated 2 years ago
- Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)☆84Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆31Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆56Updated 2 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆230Updated 4 months ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆93Updated 4 months ago
- ☆100Updated last year
- Multi-task model for named-entity recognition, relation extraction, entity mention detection and coreference resolution.☆45Updated last year
- ☆176Updated last year
- Calculate Krippendorff's Alpha on any DataFrame☆42Updated 2 years ago
- Adversarial Training on Transformer Networks to discover check-worthy factual claims☆83Updated 2 years ago
- Human-free quality estimation of document summaries☆97Updated 2 weeks ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆54Updated 2 years ago
- ☆21Updated 2 months ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆74Updated 3 years ago
- StAtutory Reasoning Assessment☆15Updated 3 years ago