emanlapponi / norlem-norwegian-lemmatizer
A lemmatizer for Norwegian that uses lexical and contextual information from the Norwegian Dependency Treebank (NDT) and lexical information from Ordbanken.
☆7Updated 8 years ago
Alternatives and similar repositories for norlem-norwegian-lemmatizer:
Users that are interested in norlem-norwegian-lemmatizer are comparing it to the libraries listed below
- Morphosyntactic tagger for Norwegian bokmål and nynorsk☆30Updated last year
- Tools for Norwegian NLP based on the Norwegian Dependency Treebank.☆17Updated 7 years ago
- A trend viewer written in Python/JavaScript☆21Updated 5 months ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆69Updated 7 months ago
- Take a MALLET to disciplinary history☆98Updated 2 years ago
- Norwegian Review Corpus☆47Updated 7 months ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 years ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 5 years ago
- ☆16Updated 10 years ago
- ☆33Updated 3 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- An implementation of latent Dirichlet allocation in javascript☆184Updated 2 years ago
- Python implementation of the Zeta score for contrastive text analysis☆14Updated 3 years ago
- 2016 Presidential Campaign Speeches☆15Updated 8 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 3 years ago
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- Python package for stylometry☆63Updated 4 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- spaCy + UDPipe☆161Updated 2 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆65Updated 3 years ago
- An R package for analysis of dramatic texts☆15Updated 2 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- CRF-based Morphological Tagging and Lemmatization☆36Updated 5 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- Detect and align similar passages☆100Updated 2 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Python framework for processing Universal Dependencies data☆56Updated this week