ziqizhang / dataLinks

Datasets shared by research

☆8

Alternatives and similar repositories for data

Users that are interested in data are comparing it to the libraries listed below

Sorting:

mrinaldhar / en-hi-codemixed-corpus
Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus
☆13Updated 6 years ago
sjtuprog / fox-news-comments
annotated hateful speech
☆24Updated 6 years ago
karins / CoherenceFramework
Entity and syntax experiments for assessing coherence
☆27Updated 6 years ago
jing-qian / A-Benchmark-Dataset-for-Learning-to-Intervene-in-Online-Hate-Speech
☆68Updated 3 years ago
mcdm / CommitmentBank
Materials related to our Sinn und Bedeutung 23 paper
☆39Updated 5 years ago
zhongpeixiang / SemEval2019-Task3-EmotionDetection
☆14Updated 6 years ago
hate-alert / DE-LIMIT
DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.
☆109Updated 2 years ago
uds-lsv / lexicon-of-abusive-words
This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…
☆29Updated 6 years ago
Oneplus / Tweebank
A collection of English tweets annotated in Universal Dependencies.
☆39Updated 3 years ago
sfu-natlang / trofi-metaphor-data
Metaphor dataset: literal versus non-literal uses of words
☆14Updated 9 years ago
ENCASEH2020 / hatespeech-twitter
☆54Updated 3 years ago
FredericGodin / TwitterEmbeddings
Twitter word embeddings generated using Word2Vec and FastText.
☆48Updated 5 years ago
danlou / LMMS
Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings
☆95Updated 2 years ago
sileod / Discovery
Mining Discourse Markers for Unsupervised Sentence Representation Learning
☆60Updated 2 years ago
EducationalTestingService / metaphor
Metaphor classification for verbs and content words
☆65Updated last year
shyamupa / wikidump_preprocessing
Extracting useful metadata from Wikipedia dumps in any language.
☆27Updated 5 years ago
jacobeisenstein / language-change-tutorial
Tutorial on computational models of language change
☆115Updated 6 years ago
NorskRegnesentral / weak-supervision-for-NER
Framework to learn Named Entity Recognition models without labelled data using weak supervision.
☆124Updated 4 years ago
hltcoe / PredPatt
PredPatt: Predicate-Argument Extraction from Universal Dependencies
☆112Updated 4 years ago
hate-alert / Hate-Speech-Reading-List
This repository contains papers and resources pertaining to Hate speech research.
☆45Updated 4 years ago
cbaziotis / ntua-slp-semeval2018
Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.
☆85Updated 3 years ago
decompositional-semantics-initiative / decomp
The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit
☆57Updated 2 years ago
HKUST-KnowComp / MLMA_hate_speech
Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"
☆56Updated 7 months ago
uzaymacar / comparatively-finetuning-bert
Comparatively fine-tuning pretrained BERT models on downstream, text classification tasks with different architectural configurations in …
☆123Updated 5 years ago
afshinrahimi / mmner
Massively Multilingual Transfer for NER
☆86Updated 3 years ago
dirkhovy / MACE
Multi-Annotator Competence Estimation tool
☆63Updated 6 years ago
manueltonneau / covid-berts
BERT models pretrained on the CORD-19 Kaggle dataset
☆15Updated 5 years ago
ziqizhang / semrerank
Implements SemRe-Rank: improving automatic term extraction by incorporating semantic relatedness with personalised pagerank
☆16Updated 7 years ago
sshaar / clef2020-factchecking-task1
Contains data, format checker, scorer and baselines for the CLEF2020-CheckThat! Task 1.
☆20Updated 2 years ago
tsproisl / textcomplexity
Linguistic and stylistic complexity measures for (literary) texts
☆82Updated last year