noklesta / The-Oslo-Bergen-Tagger
Morphosyntactic tagger for Norwegian bokmål and nynorsk
☆30Updated last year
Related projects: ⓘ
- A lemmatizer for Norwegian that uses lexical and contextual information from the Norwegian Dependency Treebank (NDT) and lexical informat…☆7Updated 7 years ago
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆29Updated last year
- ☆12Updated this week
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆67Updated last week
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆11Updated 5 years ago
- Text-Induced Corpus Clean-up☆20Updated last year
- CRF-based Morphological Tagging and Lemmatization☆34Updated 4 years ago
- Tools for Norwegian NLP based on the Norwegian Dependency Treebank.☆17Updated 7 years ago
- Automatically exported from code.google.com/p/hunpos☆11Updated 6 years ago
- The curation repository for the data behind Concepticon.☆32Updated this week
- A tool for automatic spelling normalization☆20Updated 3 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆65Updated last week
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆48Updated last year
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 6 months ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆47Updated last week
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆73Updated last week
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 4 months ago
- A command-line program to download text corpora.☆33Updated 7 years ago
- ☆17Updated 9 years ago
- Norwegian Review Corpus☆45Updated 3 weeks ago
- ☆27Updated 7 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆38Updated last year
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- Collections of english historical texts and data relating to them☆16Updated 3 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated last year
- IXA pipes Part of Speech tagger and Lemmatizer (http://ixa2.si.ehu.es/ixa-pipes)☆17Updated last year
- Open morphology for Finnish☆84Updated last month
- Python framework for processing Universal Dependencies data☆55Updated last week
- morphologically informed POS tagging for German☆25Updated 3 years ago
- A tool for analyzing the word histories of a text.☆34Updated last month