ziqizhang / dataLinks
Datasets shared by research
☆8Updated 7 years ago
Alternatives and similar repositories for data
Users that are interested in data are comparing it to the libraries listed below
Sorting:
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Updated 6 years ago
- annotated hateful speech☆24Updated 6 years ago
- Entity and syntax experiments for assessing coherence☆27Updated 6 years ago
- ☆68Updated 3 years ago
- Materials related to our Sinn und Bedeutung 23 paper☆39Updated 5 years ago
- ☆14Updated 6 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated 2 years ago
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Updated 6 years ago
- A collection of English tweets annotated in Universal Dependencies.☆39Updated 3 years ago
- Metaphor dataset: literal versus non-literal uses of words☆14Updated 9 years ago
- ☆54Updated 3 years ago
- Twitter word embeddings generated using Word2Vec and FastText.☆48Updated 5 years ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆95Updated 2 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated 2 years ago
- Metaphor classification for verbs and content words☆65Updated last year
- Extracting useful metadata from Wikipedia dumps in any language.☆27Updated 5 years ago
- Tutorial on computational models of language change☆115Updated 6 years ago
- Framework to learn Named Entity Recognition models without labelled data using weak supervision.☆124Updated 4 years ago
- PredPatt: Predicate-Argument Extraction from Universal Dependencies☆112Updated 4 years ago
- This repository contains papers and resources pertaining to Hate speech research.☆45Updated 4 years ago
- Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.☆85Updated 3 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated 2 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆56Updated 7 months ago
- Comparatively fine-tuning pretrained BERT models on downstream, text classification tasks with different architectural configurations in …☆123Updated 5 years ago
- Massively Multilingual Transfer for NER☆86Updated 3 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 6 years ago
- BERT models pretrained on the CORD-19 Kaggle dataset☆15Updated 5 years ago
- Implements SemRe-Rank: improving automatic term extraction by incorporating semantic relatedness with personalised pagerank☆16Updated 7 years ago
- Contains data, format checker, scorer and baselines for the CLEF2020-CheckThat! Task 1.☆20Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year