LanguageMachines / frogLinks
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
☆77Updated 5 months ago
Alternatives and similar repositories for frog
Users that are interested in frog are comparing it to the libraries listed below
Sorting:
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Updated 2 months ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated 3 months ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆67Updated 4 years ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated last year
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆112Updated 4 months ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆198Updated 6 months ago
- UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.☆34Updated 2 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated last year
- Indra is a Web Service which allows easy access to different distributional semantics models in several languages.☆48Updated last month
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- 110k Dutch Book Reviews Dataset for Sentiment Analysis☆29Updated last year
- Multi Tier Annotation Search☆26Updated 4 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆83Updated 3 years ago
- Alpino in Docker☆10Updated 10 months ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆77Updated 3 years ago
- An unsupervised compound splitter☆41Updated 5 years ago
- Extension of the mate-tools NLP pipeline☆67Updated 9 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 5 months ago
- Pikes is a Knowledge Extraction Suite☆23Updated last year
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 7 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 2 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 2 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Federated Knowledge Extraction Framework☆192Updated last year
- spaCy-to-naf converter☆21Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago