saffsd / polyglotLinks
Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the languages therein.
☆33Updated 8 years ago
Alternatives and similar repositories for polyglot
Users that are interested in polyglot are comparing it to the libraries listed below
Sorting:
- framework for doing NER and other types of entity recognition, in Python☆68Updated 2 years ago
- ☆55Updated 9 years ago
- A python wrapper for Semaphore, a Shallow Semantic Parser that identifies roles in a text.☆12Updated 11 years ago
- spaCy-to-naf converter☆21Updated 11 months ago
- Labeled examples from wiki dumps in Python☆67Updated 8 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 5 years ago
- A Dependency Parser for Tweets☆78Updated 5 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- Keras implementation of ontology aware token embeddings☆48Updated 6 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 6 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆19Updated 5 years ago
- A python module to process data for Frame Semantic Parsing☆24Updated 4 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆61Updated last year
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆115Updated 3 years ago
- Clinical spelling correction with word and character n-gram embeddings.☆74Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- A natural language processing tool for automatically detecting quotations in text.☆15Updated 3 years ago
- ☆41Updated 8 years ago
- Python code for reading Brat Repositories. Supports saving and reading from XML files for easy acces to annotations.☆42Updated 5 years ago
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 6 years ago
- Temporal Expression Recognition and Normalisation in Python☆78Updated 9 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 4 years ago
- A temporal ordering system for events and time expressions in written text.☆43Updated 3 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- ☆64Updated 2 years ago