modelpredict / language-identification-survey
Live survey of off-the-shelf language identification tools for python
β26Updated 2 years ago
Alternatives and similar repositories for language-identification-survey:
Users that are interested in language-identification-survey are comparing it to the libraries listed below
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 8 months ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ96Updated 9 months ago
- Coreference Resolutionβ74Updated 3 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkβ80Updated 2 years ago
- Sentence transformers models for SpaCyβ107Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsβ65Updated 2 years ago
- A High-level Library for Named Entity Recognition in Python.β23Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)β151Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further langβ¦β120Updated 9 months ago
- Google USE (Universal Sentence Encoder) for spaCyβ182Updated last year
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 11 months ago
- Language Models for Zalando's flair libraryβ61Updated 5 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.β68Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β79Updated 7 months ago
- Fuzzy matching and more functionality for spaCy.β254Updated 7 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ157Updated 2 years ago
- A multilingual version of MS MARCO passage ranking datasetβ143Updated last year
- Text tokenization and sentence segmentation (segtok v2)β201Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated 10 months ago
- A spaCy wrapper for DBpedia Spotlightβ108Updated last year
- Automatically detect errors in annotated corpora.β47Updated last year
- β16Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.β71Updated 2 years ago
- The official tool for transforming doccano format into common dataset formats.β106Updated last year
- LASER multilingual sentence embeddings as a pip packageβ224Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago
- β84Updated 5 months ago