OFAI / million-post-corpusLinks
Annotated data set consisting of user comments posted to a German-language newspaper website
☆17Updated 7 years ago
Alternatives and similar repositories for million-post-corpus
Users that are interested in million-post-corpus are comparing it to the libraries listed below
Sorting:
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- ☆104Updated 6 years ago
- ☆54Updated 3 years ago
- Implementation of a simple frame identification approach (SimpleFrameId) described in the paper "Out-of-domain FrameNet Semantic Role Lab…☆15Updated 8 years ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- ☆59Updated 10 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 6 years ago
- Sume is an implementation of the concept-based ILP model for summarization.☆37Updated 7 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆84Updated 4 years ago
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- Workshop on Noisy User-generated Text (W-NUT)☆30Updated 5 months ago
- Keras implementation of ontology aware token embeddings☆49Updated 6 years ago
- Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"☆30Updated 6 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- A Dependency Parser for Tweets☆78Updated 6 years ago
- ☆64Updated 2 years ago
- LexNET: Integrated Path-based and Distributional Method for Lexical Semantic Relation Classification☆62Updated 6 years ago
- Jupyter extension to visualize dependency structures☆28Updated 7 years ago
- spaCy + UDPipe☆163Updated 3 years ago
- A natural language processing tool for automatically detecting quotations in text.☆15Updated 3 years ago
- Word embedding approach based on a dynamic log-linear model☆55Updated 8 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆29Updated 7 years ago
- PredPatt: Predicate-Argument Extraction from Universal Dependencies☆111Updated 4 years ago
- Mining Argument Structures with Expressive Inference (Linear and LSTM Engines)☆67Updated 8 years ago
- Hierarchical word clustering, following "Brown clustering" (Brown et al., 1992)☆70Updated 10 years ago
- Utility scripts in Python☆37Updated 3 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆152Updated last week
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year