benjaminvdb / DBRD
110k Dutch Book Reviews Dataset for Sentiment Analysis
☆30Updated last year
Related projects ⓘ
Alternatives and complementary repositories for DBRD
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆47Updated 3 weeks ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆76Updated 4 months ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆77Updated 9 months ago
- German Morphological Analyzer☆47Updated 3 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆82Updated 3 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- ☆25Updated 4 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆67Updated 3 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated last year
- ☆64Updated last year
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆14Updated 7 months ago
- A Dutch RoBERTa-based language model☆197Updated 7 months ago
- Automatic extraction of edited sentences from text edition histories.☆81Updated 2 years ago
- Experiments with Zalando's flair library☆34Updated last year
- Language Models for Zalando's flair library☆62Updated 4 years ago
- Python framework for processing Universal Dependencies data☆57Updated this week
- UIMA CAS processing library written in Python☆85Updated 6 months ago
- spaCy + UDPipe☆161Updated 2 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 6 months ago
- Scripts and tools for doing unsupervised acceptability prediction.☆15Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- KenLM extension for spaCy 2.0.☆16Updated 6 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆95Updated last year
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago