LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script
☆69Sep 11, 2023Updated 2 years ago
Alternatives and similar repositories for LaMachine
Users that are interested in LaMachine are comparing it to the libraries listed below
Sorting:
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆80Mar 2, 2026Updated 2 weeks ago
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆18Nov 18, 2024Updated last year
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- FoLiA library for C++☆17Mar 12, 2026Updated last week
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Sep 7, 2022Updated 3 years ago
- Collaborative Synchronized Corpus Annotation Tool☆11Dec 31, 2018Updated 7 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Jan 24, 2025Updated last year
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Feb 2, 2026Updated last month
- Multi Tier Annotation Search☆26May 12, 2021Updated 4 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Mar 13, 2026Updated last week
- A Python implementation of word2vec that allows custom sampling strategies☆10Jan 30, 2014Updated 12 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Dec 9, 2025Updated 3 months ago
- Tools for TICCL☆14Dec 12, 2025Updated 3 months ago
- Experiment, Storage and Visualization Framework for Machine Learning research.☆31May 19, 2021Updated 4 years ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆135Mar 12, 2026Updated last week
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆84Jun 11, 2021Updated 4 years ago
- IXA pipes Part of Speech tagger and Lemmatizer (http://ixa2.si.ehu.es/ixa-pipes)☆18Nov 18, 2022Updated 3 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Dec 13, 2018Updated 7 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Jan 13, 2026Updated 2 months ago
- PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, an…☆477Sep 14, 2023Updated 2 years ago
- Specification of a stand-off element for the TEI guidelines☆12Apr 29, 2021Updated 4 years ago
- A tool that helps with analysis of obfuscated JavaScript☆11Dec 15, 2023Updated 2 years ago
- An extension for pymongo that adds json schema validation and index management☆13Oct 19, 2019Updated 6 years ago
- ☆11Oct 10, 2017Updated 8 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆129Feb 5, 2026Updated last month
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- Tools for text processing of structured product labels☆12Mar 27, 2020Updated 5 years ago
- Semanticizest: dump parser and client☆20May 11, 2016Updated 9 years ago
- Context-enhanced Adaptive Entity Linking☆13Mar 21, 2016Updated 9 years ago
- Pure Rust port of CRFsuite: a fast implementation of Conditional Random Fields (CRFs)☆30Mar 1, 2026Updated 2 weeks ago
- 💜 A modern, community-driven server controller for TrackMania Forever☆20Oct 29, 2024Updated last year
- Interactive TOpic Model and MEtadata Visualization. Live at: tome.lmc.gatech.edu☆13May 6, 2019Updated 6 years ago
- NLP pipeline software using common workflow language☆35Apr 22, 2019Updated 6 years ago
- Fast and robust NLP components implemented in Java.☆53Oct 13, 2020Updated 5 years ago
- Sisyphe is a modulable NodeJS BIG-DATA analyser & transformer☆12Oct 16, 2023Updated 2 years ago
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)☆18May 29, 2022Updated 3 years ago
- Performs pairwise preference ranking for a given trainfile and testfile with binary class labels (1 and not 1). The binary classification…☆14Jul 12, 2017Updated 8 years ago