LanguageMachines / frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
☆73Updated last week
Related projects: ⓘ
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆47Updated last week
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆65Updated last week
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆64Updated 3 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 4 months ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆110Updated 2 months ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- Extension of the mate-tools NLP pipeline☆66Updated 8 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆195Updated last month
- Base modules of JCoRe☆22Updated 4 months ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆82Updated 3 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆53Updated this week
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆49Updated 4 years ago
- Text-Induced Corpus Clean-up☆20Updated last year
- build/run the most current Stanford CoreNLP server in a docker container☆44Updated 5 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Simple perceptron tagger trained using the NLTK on the NLCOW14 corpus.☆25Updated 6 years ago
- UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.☆34Updated last year
- spaCy-to-naf converter☆21Updated 3 months ago
- T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf☆18Updated 2 months ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated 9 months ago
- Federated Knowledge Extraction Framework☆189Updated 10 months ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆38Updated last year
- Excitement Open Platform for Recognizing Textual Entailments☆86Updated 6 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated last year
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 6 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 6 years ago
- The Italian NLP Tool☆70Updated last year