ziqizhang / data
Datasets shared by research
☆9Updated 6 years ago
Alternatives and similar repositories for data:
Users that are interested in data are comparing it to the libraries listed below
- ☆54Updated 3 years ago
- annotated hateful speech☆25Updated 6 years ago
- Cyber Hate detection And tracking on Social mEdia☆31Updated 2 years ago
- public repository of the interdisciplinary working group 'Hatespeech' of the research training group UCSM☆17Updated 6 years ago
- Massively Multilingual Transfer for NER☆86Updated 3 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated last year
- Corpus and annotations for the CL-Aff Shared Task from the University of Pennsylvania☆19Updated 3 years ago
- Sentence specificity prediction☆25Updated 6 years ago
- Metaphor dataset: literal versus non-literal uses of words☆14Updated 9 years ago
- Twitter word embeddings generated using Word2Vec and FastText.☆49Updated 5 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- ☆17Updated 3 years ago
- Implements SemRe-Rank: improving automatic term extraction by incorporating semantic relatedness with personalised pagerank☆16Updated 7 years ago
- A Neural Model for User Geolocation and Lexical Dialectology☆16Updated 6 years ago
- Metaphor classification for verbs and content words☆65Updated last year
- Entity and syntax experiments for assessing coherence☆27Updated 6 years ago
- Decoding platform for machine translation research☆55Updated 5 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- ☆24Updated 6 years ago
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Updated 6 years ago
- A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contai…☆106Updated 5 years ago
- Cross-Lingual Alignment of Contextual Word Embeddings☆99Updated 5 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Updated 6 years ago
- ☆11Updated 5 years ago
- Guidelines.☆96Updated 8 months ago
- ☆83Updated 4 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆59Updated last year
- Harassment Lexicon and Corpus☆30Updated 6 years ago