noklesta / The-Oslo-Bergen-Tagger
Morphosyntactic tagger for Norwegian bokmål and nynorsk
☆30Updated last year
Alternatives and similar repositories for The-Oslo-Bergen-Tagger:
Users that are interested in The-Oslo-Bergen-Tagger are comparing it to the libraries listed below
- A lemmatizer for Norwegian that uses lexical and contextual information from the Norwegian Dependency Treebank (NDT) and lexical informat…☆7Updated 8 years ago
- A trend viewer written in Python/JavaScript☆21Updated 4 months ago
- CRF-based Morphological Tagging and Lemmatization☆36Updated 5 years ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated last month
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 3 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Updated last week
- ☆16Updated 10 years ago
- Pikes is a Knowledge Extraction Suite☆23Updated last year
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆69Updated 6 months ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 7 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated 10 months ago
- Supervised learning of morphology☆28Updated 8 years ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆48Updated last year
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Text-Induced Corpus Clean-up☆20Updated last year
- Tools for Norwegian NLP based on the Norwegian Dependency Treebank.☆17Updated 7 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 9 months ago
- A tool for analyzing the word histories of a text.☆34Updated 4 months ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 5 years ago
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆31Updated last year
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆115Updated 8 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Specification of NAF, the NLP annotation format☆21Updated 4 years ago
- spaCy-to-naf converter☆21Updated 9 months ago
- Automatically exported from code.google.com/p/hunpos☆12Updated 6 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆112Updated 2 months ago
- ☆30Updated 8 years ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆75Updated 3 months ago
- A command-line program to download text corpora.☆34Updated 7 years ago