LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script
☆69Sep 11, 2023Updated 2 years ago
Alternatives and similar repositories for LaMachine
Users that are interested in LaMachine are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆18Nov 18, 2024Updated last year
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- FoLiA library for C++☆17Mar 25, 2026Updated 2 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Sep 7, 2022Updated 3 years ago
- Collaborative Synchronized Corpus Annotation Tool☆10Dec 31, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Jan 24, 2025Updated last year
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Feb 2, 2026Updated 4 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70May 8, 2026Updated last month
- A Python implementation of word2vec that allows custom sampling strategies☆10Jan 30, 2014Updated 12 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Dec 9, 2025Updated 6 months ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆69Apr 30, 2026Updated last month
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆135May 28, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆85Jun 11, 2021Updated 4 years ago
- Python API for KB data-services☆20Jan 30, 2020Updated 6 years ago
- A workflow system for Natural Language Processing.☆21Oct 17, 2019Updated 6 years ago
- IXA pipes Part of Speech tagger and Lemmatizer (http://ixa2.si.ehu.es/ixa-pipes)☆19May 8, 2026Updated last month
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Dec 13, 2018Updated 7 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆19Updated this week
- PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, an…☆476Sep 14, 2023Updated 2 years ago
- Specification of a stand-off element for the TEI guidelines☆12Apr 29, 2021Updated 5 years ago
- Free and open source Tableau alternative that generates Python Pandas code☆12Aug 23, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆11Oct 10, 2017Updated 8 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆130Feb 5, 2026Updated 4 months ago
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- Tools for text processing of structured product labels☆12Mar 27, 2020Updated 6 years ago
- Semanticizest: dump parser and client☆20May 11, 2016Updated 10 years ago
- Context-enhanced Adaptive Entity Linking☆13Mar 21, 2016Updated 10 years ago
- T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf☆19May 28, 2025Updated last year
- Materials of FutureTDM project☆11Aug 22, 2017Updated 8 years ago
- Repository for creating models, vocabulary and other necessities for Dutch in Spacey☆11Dec 15, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NLP pipeline software using common workflow language☆35Apr 22, 2019Updated 7 years ago
- Sisyphe is a modulable NodeJS BIG-DATA analyser & transformer☆12Oct 16, 2023Updated 2 years ago
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)☆18May 29, 2022Updated 4 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Oct 14, 2022Updated 3 years ago
- An end-user environment for working with data in the CITE environment—browsing and analyzing texts, viewing objects and images, visualizi…☆15May 5, 2020Updated 6 years ago
- Performs pairwise preference ranking for a given trainfile and testfile with binary class labels (1 and not 1). The binary classification…☆14Jul 12, 2017Updated 8 years ago
- RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis☆12Sep 2, 2015Updated 10 years ago