LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script
☆69Sep 11, 2023Updated 2 years ago
Alternatives and similar repositories for LaMachine
Users that are interested in LaMachine are comparing it to the libraries listed below
Sorting:
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆18Nov 18, 2024Updated last year
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆80Dec 11, 2025Updated 2 months ago
- A Python implementation of word2vec that allows custom sampling strategies☆10Jan 30, 2014Updated 12 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Jan 24, 2025Updated last year
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Multi Tier Annotation Search☆26May 12, 2021Updated 4 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Feb 9, 2026Updated 2 weeks ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Dec 9, 2025Updated 2 months ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Jan 13, 2026Updated last month
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Sep 7, 2022Updated 3 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Feb 2, 2026Updated 3 weeks ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Feb 11, 2022Updated 4 years ago
- Collaborative Synchronized Corpus Annotation Tool☆11Dec 31, 2018Updated 7 years ago
- ☆11Oct 10, 2017Updated 8 years ago
- Automatic text comparison with an extendable variance classifier☆13Sep 11, 2023Updated 2 years ago
- An extension for pymongo that adds json schema validation and index management☆13Oct 19, 2019Updated 6 years ago
- Pure Rust port of CRFsuite: a fast implementation of Conditional Random Fields (CRFs)☆30Feb 1, 2026Updated 3 weeks ago
- ☆27Feb 2, 2021Updated 5 years ago
- PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, an…☆477Sep 14, 2023Updated 2 years ago
- Alpino in Docker☆10Nov 19, 2025Updated 3 months ago
- Repository for creating models, vocabulary and other necessities for Dutch in Spacey☆11Dec 15, 2016Updated 9 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆84Jun 11, 2021Updated 4 years ago
- Multi Tier Annotation Search☆12May 13, 2024Updated last year
- ☆16Oct 28, 2024Updated last year
- ☆16Jan 7, 2026Updated last month
- Wasm HTTP middleware, agnostic, efficient and blazing fast.☆18Sep 27, 2025Updated 5 months ago
- Virtual Language Observatory☆18Updated this week
- A simple Java application for managing an OAI-PMH harvesting workflow☆14Jan 18, 2026Updated last month
- T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf☆19May 28, 2025Updated 9 months ago
- Performs pairwise preference ranking for a given trainfile and testfile with binary class labels (1 and not 1). The binary classification…☆14Jul 12, 2017Updated 8 years ago
- A collection of ipython/jupyter notebooks☆16Jan 31, 2019Updated 7 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Oct 14, 2022Updated 3 years ago
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://…☆37Feb 10, 2026Updated 2 weeks ago
- A wrapper for the Stuttgart Finite State Transducer Tools (SFST).☆15May 27, 2020Updated 5 years ago
- IXA pipes Part of Speech tagger and Lemmatizer (http://ixa2.si.ehu.es/ixa-pipes)☆18Nov 18, 2022Updated 3 years ago
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)☆18May 29, 2022Updated 3 years ago
- JavaScript visualizations of various DELPH-IN structures.☆17Feb 3, 2022Updated 4 years ago
- Semantic Annotation Without the Pointy Brackets☆160Jan 30, 2024Updated 2 years ago