OFAI / million-post-corpus
Annotated data set consisting of user comments posted to a German-language newspaper website
☆17Updated 6 years ago
Alternatives and similar repositories for million-post-corpus:
Users that are interested in million-post-corpus are comparing it to the libraries listed below
- Language Model and Text Classification for German Language using Deep Learning☆18Updated 6 years ago
- Code and data for ACL2016 article "Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidi…☆28Updated 8 years ago
- ☆103Updated 6 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆66Updated 2 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- Coreference resolution for German☆16Updated 7 years ago
- Sentence specificity prediction☆25Updated 6 years ago
- Workshop on Noisy User-generated Text (W-NUT)☆30Updated this week
- Corpus of Attribution-Annotated news articles covering the campaigns during the year leading up to the 2016 US Presidential election.☆20Updated 6 years ago
- ☆43Updated 9 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago
- Keras implementation of ontology aware token embeddings☆48Updated 6 years ago
- ☆16Updated 7 years ago
- ☆16Updated 5 years ago
- Mining Argument Structures with Expressive Inference (Linear and LSTM Engines)☆64Updated 7 years ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 5 years ago
- A Dependency Parser for Tweets☆79Updated 5 years ago
- Dict2vec is a framework to learn word embeddings using lexical dictionaries.☆114Updated 4 years ago
- Processing the MPQA Corpus☆27Updated 6 years ago
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 6 years ago
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- Scripts and tools for doing unsupervised acceptability prediction.☆15Updated last year
- The Potsdam Twitter Sentiment Corpus☆17Updated 5 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆42Updated 3 months ago
- The Arborator software is aimed at collaboratively annotating dependency corpora.☆25Updated 5 years ago
- ☆15Updated 6 years ago