benjaminvdb / DBRDLinks
110k Dutch Book Reviews Dataset for Sentiment Analysis
☆29Updated 2 years ago
Alternatives and similar repositories for DBRD
Users that are interested in DBRD are comparing it to the libraries listed below
Sorting:
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115Updated last year
- spaCy + UDPipe☆163Updated 3 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆153Updated last week
- This is a simple Python package for calculating a variety of lexical diversity indices☆82Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆150Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 4 years ago
- ☆65Updated 3 months ago
- ☆64Updated 2 years ago
- German Morphological Analyzer☆51Updated 4 years ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Updated 2 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 3 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆78Updated 3 years ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆102Updated last month
- Dutch coreference resolution & dialogue analysis using deterministic rules☆23Updated 2 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆141Updated 2 years ago
- A natural language processing tool for automatically detecting quotations in text.☆15Updated 3 years ago
- Language independent truecaser in Python.☆160Updated 4 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆84Updated 4 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆319Updated last week
- UIMA CAS processing library written in Python☆90Updated last month
- Alignment and annotation for comparable documents.☆22Updated 7 years ago
- ☆50Updated last year
- Python framework for processing Universal Dependencies data☆57Updated 3 weeks ago
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆68Updated 3 weeks ago
- Experiments with Zalando's flair library☆34Updated 2 years ago
- A Dutch RoBERTa-based language model☆207Updated last year
- CONLL-U to Pandas DataFrame☆31Updated 8 years ago