noklesta / The-Oslo-Bergen-TaggerLinks
Morphosyntactic tagger for Norwegian bokmål and nynorsk
☆29Updated 2 years ago
Alternatives and similar repositories for The-Oslo-Bergen-Tagger
Users that are interested in The-Oslo-Bergen-Tagger are comparing it to the libraries listed below
Sorting:
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆49Updated 2 years ago
- A trend viewer written in Python/JavaScript☆21Updated last year
- Simple CORPORA list crawler☆10Updated 8 years ago
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆31Updated 2 years ago
- The Zurich Dependency Parser for German☆87Updated 3 months ago
- CRF-based Morphological Tagging and Lemmatization☆37Updated 6 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Automatically exported from code.google.com/p/hunpos☆12Updated 7 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆315Updated 3 years ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆70Updated last year
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Updated 8 months ago
- Tools for Norwegian NLP based on the Norwegian Dependency Treebank.☆17Updated 8 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 5 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆389Updated this week
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆79Updated last week
- displaCy.js: An open-source NLP visualiser for the modern web☆345Updated 7 years ago
- Model Training tool for MITIE☆79Updated 10 years ago
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 7 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated 2 weeks ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Updated 2 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆197Updated 5 years ago
- A command-line program to download text corpora.☆34Updated 8 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- German Morphological Analyzer☆50Updated 4 years ago
- NLTK Contrib☆168Updated last year
- German lemmatization with IWNLP as extension for spaCy☆26Updated 2 years ago