proycon / LaMachineLinks
LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script
☆69Updated 2 years ago
Alternatives and similar repositories for LaMachine
Users that are interested in LaMachine are comparing it to the libraries listed below
Sorting:
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 11 months ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Updated 9 months ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated last month
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆61Updated 8 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆37Updated 6 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 5 years ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆79Updated last month
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- A Named-Entity Recogniser based on Grobid.☆54Updated 8 months ago
- Essential NLP & ML, short & fast pure Python code☆78Updated 3 months ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Updated 4 months ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last month
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 3 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆84Updated 4 years ago
- Various utilities for processing the data.☆216Updated this week
- Language Tool style grammar handling with spaCy 2.0☆42Updated 7 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆41Updated 8 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆118Updated last week
- A machine learning tool for fishing entities☆270Updated 7 months ago
- German Morphological Analyzer☆51Updated 4 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated 2 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Updated 2 years ago
- spaCy-to-naf converter☆21Updated 7 months ago
- This repository contains the Framester resource, the main outcome of the framester project.☆33Updated 2 months ago
- Python 3 library for processing historical English☆67Updated last year
- UIMA CAS processing library written in Python☆91Updated 2 months ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago