scrosseye / CLEAR-CorpusLinks

Repository for the CommonLit Ease of Readability Corpus

☆24

Alternatives and similar repositories for CLEAR-Corpus

Users that are interested in CLEAR-Corpus are comparing it to the libraries listed below

Sorting:

marcoguerini / CONAN
A repository with several curated datasets of counter-narratives to fight online hate speech.
☆93Updated 4 months ago
mainlp / awesome-human-label-variation
A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …
☆94Updated last year
LSYS / LexicalRichness
A module to compute textual lexical richness (aka lexical diversity).
☆110Updated 2 years ago
scrosseye / persuade_corpus_2.0
This is the data associated with the PERSUADE Corpus 2.0 version
☆46Updated last year
kristopherkyle / lexical_diversity
This is a simple Python package for calculating a variety of lexical diversity indices
☆81Updated 2 years ago
kanishkamisra / minicons
Utility for behavioral and representational analyses of Language Models
☆170Updated last month
ipavlopoulos / toxic_spans
Detect toxic spans in toxic texts
☆71Updated 2 years ago
nishkalavallabhi / OneStopEnglishCorpus
Repository for Vajjala & Lucic (2018)
☆66Updated last year
zhijing-jin / NLP4SocialGood_Papers
A reading list of up-to-date papers on NLP for Social Good.
☆304Updated 2 years ago
mit-ccc / TweebankNLP
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…
☆105Updated last year
DS3Lab / multilingual-gaze
Code for "Multilingual language models predict human reading behavior"
☆12Updated 3 years ago
ssharoff / biberpy
Python version for Doug Biber's Multidimensional Analysis (MDA)
☆38Updated last week
maartensap / riveter-nlp
Package to extract connotation frames
☆91Updated last year
glnmario / cwr4lsc
Contextualised Word Representations for Lexical Semantic Change Analysis
☆32Updated 5 years ago
cardiffnlp / xlm-t
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆158Updated 2 years ago
chaojiang06 / wiki-auto
Neural CRF Model for Sentence Alignment in Text Simplification
☆68Updated 10 months ago
kristopherkyle / corpus_toolkit
A simple toolkit for conducting analyses using corpus methods
☆26Updated 4 years ago
DFKI-NLP / thermostat
Collection of NLP model explanations and accompanying analysis tools
☆144Updated 2 years ago
cardiffnlp / timelms
TimeLMs: Diachronic Language Models from Twitter
☆111Updated last year
HannahKirk / Hatemoji
Testing and training detection models for emoji-based hate speech.
☆24Updated 3 years ago
BramVanroy / spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…
☆80Updated last year
sf-wa-326 / phrase-bert-topic-model
☆88Updated 3 years ago
rstodden / TS_annotation_tool
Annotation Tool for Text Simplification Corpora
☆17Updated 2 years ago
wwbp / empathic_reactions
☆41Updated 5 years ago
uclanlp / gn_glove
Learning Gender-Neutral Word Embeddings
☆47Updated 6 years ago
aymeam / Datasets-for-Hate-Speech-Detection
Datasets for Hate Speech Detection
☆133Updated 2 years ago
berzak / celer
☆20Updated 3 years ago
uds-lsv / lexicon-of-abusive-words
This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…
☆29Updated 6 years ago
kristopherkyle / TAASSC
Tool for the Automatic Analysis of Syntactic Sophistication and Complexity
☆28Updated 2 years ago
shauryr / ACL-anthology-corpus
This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs
☆185Updated 2 years ago