noklesta / The-Oslo-Bergen-Tagger
Morphosyntactic tagger for Norwegian bokmål and nynorsk
☆30Updated last year
Alternatives and similar repositories for The-Oslo-Bergen-Tagger:
Users that are interested in The-Oslo-Bergen-Tagger are comparing it to the libraries listed below
- A lemmatizer for Norwegian that uses lexical and contextual information from the Norwegian Dependency Treebank (NDT) and lexical informat…☆7Updated 8 years ago
- A trend viewer written in Python/JavaScript☆21Updated 3 months ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆69Updated 5 months ago
- Tools for Norwegian NLP based on the Norwegian Dependency Treebank.☆17Updated 7 years ago
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆31Updated last year
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 5 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- Supervised learning of morphology☆28Updated 8 years ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- morphologically informed POS tagging for German☆26Updated 3 years ago
- Search back-end for dependency tree search. See the docs at https://fginter.github.io/dep_search/☆17Updated 6 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Automatically exported from code.google.com/p/hunpos☆12Updated 6 years ago
- The Zurich Dependency Parser for German☆83Updated 2 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 8 months ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆66Updated 2 years ago
- 2016 Presidential Campaign Speeches☆15Updated 8 years ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆48Updated last year
- ☆16Updated 9 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆47Updated 2 months ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Text-Induced Corpus Clean-up☆20Updated last year
- Socially-Equitable Language Identification☆78Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆67Updated last week
- Basic dataset for the linguistic data collection.☆15Updated 8 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- ☆17Updated 3 months ago