noklesta / The-Oslo-Bergen-TaggerLinks
Morphosyntactic tagger for Norwegian bokmål and nynorsk
☆29Updated 2 years ago
Alternatives and similar repositories for The-Oslo-Bergen-Tagger
Users that are interested in The-Oslo-Bergen-Tagger are comparing it to the libraries listed below
Sorting:
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆31Updated 2 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆316Updated 3 years ago
- A command-line program to download text corpora.☆34Updated 8 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- A trend viewer written in Python/JavaScript☆21Updated last year
- CRF-based Morphological Tagging and Lemmatization☆38Updated 6 years ago
- Automatically exported from code.google.com/p/hunpos☆12Updated 7 years ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆71Updated last year
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- A tool for analyzing the word histories of a text.☆37Updated last month
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Collection of tools for building diachronic/historical word vectors☆444Updated 2 years ago
- NLTK Contrib☆169Updated last year
- An open-source CRF Reference String Parsing Package☆160Updated 5 years ago
- The Zurich Dependency Parser for German☆89Updated 5 months ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Updated 10 months ago
- ☆31Updated 8 years ago
- Tools for Norwegian NLP based on the Norwegian Dependency Treebank.☆17Updated 8 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆148Updated 3 years ago
- command-line tool to extract taxonomies from Wikidata☆129Updated 6 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Model Training tool for MITIE☆79Updated 10 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last month
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 6 years ago
- ☆16Updated 10 years ago
- Project on the history of genre.☆24Updated 5 years ago
- CONLL-U to Pandas DataFrame☆31Updated 8 years ago
- A toolkit for corpus linguistics☆206Updated 6 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆393Updated this week
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Updated 3 years ago