citiususc / Linguakit
Multilingual toolkit for NLP: dependency parser, PoS tagger, NERC, multiword extractor, sentiment analysis, etc.
☆65Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for Linguakit
- A natural language processing tool for automatically detecting quotations in text.☆15Updated 2 years ago
- Named entity extraction from Portuguese web text☆71Updated 7 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆110Updated 4 months ago
- spaCy + UDPipe☆161Updated 2 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆54Updated last week
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆32Updated 8 years ago
- Dependency Syntactic Parsing for Portuguese, Spanish, English, and Galician, including MetaRomance parser☆10Updated 6 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 5 years ago
- Resources developed by and for the project REACTION (Retrieval, Extraction and Aggregation Computing Technology for Integrating and Organ…☆9Updated 2 years ago
- Various utilities for processing the data.☆207Updated this week
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆77Updated 3 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Updated last year
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- Multilingual NLP annotation projection☆50Updated 2 years ago
- spaCy-to-naf converter☆21Updated 5 months ago
- FreeLing project source code☆255Updated last year
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- Learning by Reading pipeline of NLP and Entity Linking tools☆82Updated last year
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆65Updated 2 years ago
- A python module to process data for Frame Semantic Parsing☆23Updated 4 years ago
- This repository contains the Framester resource, the main outcome of the framester project.☆34Updated 4 years ago
- Inter-annotator agreement for Doccano☆27Updated 4 years ago
- ☆14Updated last year
- Python code for reading Brat Repositories. Supports saving and reading from XML files for easy acces to annotations.☆41Updated 5 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆17Updated 4 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆77Updated 10 months ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆38Updated 5 years ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 5 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆47Updated 3 weeks ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago