dhfbk / variationistLinks
Variationist: Exploring Multifaceted Variation and Bias in Written Language Data (ACL 2024 demo track)
☆10Updated last year
Alternatives and similar repositories for variationist
Users that are interested in variationist are comparing it to the libraries listed below
Sorting:
- Utility for behavioral and representational analyses of Language Models☆167Updated last month
- A module to compute textual lexical richness (aka lexical diversity).☆110Updated 2 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆28Updated 2 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆38Updated 8 months ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆22Updated 2 years ago
- Python Multilingual Ucrel Semantic Analysis System☆32Updated this week
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Updated last year
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆93Updated last year
- Repository for Vajjala & Lucic (2018)☆65Updated last year
- Fast computation of Krippendorff's alpha agreement measure in Python.☆152Updated this week
- Find informative examples to efficiently (human)-evaluate NLG models.☆16Updated 3 weeks ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆81Updated 2 years ago
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆20Updated last year
- ☆49Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 11 months ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆184Updated 2 years ago
- Split bib files for anthology bibliography for overleaf☆11Updated last year
- A collection of text simplification datasets and other resources☆50Updated last year
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 3 years ago
- This is the data associated with the PERSUADE Corpus 2.0 version☆46Updated 11 months ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆11Updated 6 months ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆69Updated 2 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆32Updated 7 months ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆54Updated 2 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14Updated 3 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆29Updated 5 months ago
- A accurate multilingual word aligner based on LaBSE☆23Updated 2 years ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆93Updated 3 months ago