OFAI / million-post-corpusLinks
Annotated data set consisting of user comments posted to a German-language newspaper website
☆17Updated 7 years ago
Alternatives and similar repositories for million-post-corpus
Users that are interested in million-post-corpus are comparing it to the libraries listed below
Sorting:
- ☆105Updated 7 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆29Updated 7 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆91Updated 6 years ago
- ☆54Updated 3 years ago
- A Dependency Parser for Tweets☆78Updated 6 years ago
- Utility scripts in Python☆37Updated 5 months ago
- Fast Word Clustering Software☆79Updated 9 months ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆123Updated 2 years ago
- Workshop on Noisy User-generated Text (W-NUT)☆30Updated 6 months ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆118Updated 4 months ago
- CONLL-U to Pandas DataFrame☆31Updated 8 years ago
- Tokenizer for Twitter and Reddit data☆46Updated 6 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆69Updated 6 years ago
- Multi-Annotator Competence Estimation tool☆64Updated 6 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆84Updated 4 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆62Updated last year
- Doing things with embeddings☆66Updated 3 years ago
- ☆32Updated 4 years ago
- Incremental learning of word embeddings with context informativeness.☆94Updated 2 years ago
- PredPatt: Predicate-Argument Extraction from Universal Dependencies☆110Updated 4 years ago
- spaCy + UDPipe☆163Updated 3 years ago
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- ☆44Updated 10 years ago
- numeric fused-head identification and resolution☆33Updated 6 years ago
- LexNET: Integrated Path-based and Distributional Method for Lexical Semantic Relation Classification☆62Updated 7 years ago
- Getting started with AllenNLP and PyTorch by training a tweet classifier☆66Updated 8 years ago
- Code for morphological transformations☆29Updated 8 years ago
- Word embedding approach based on a dynamic log-linear model☆55Updated 8 years ago