argilla-io / biome-text
Custom Natural Language Processing with big and small models π²π±
β68Updated 3 years ago
Alternatives and similar repositories for biome-text:
Users that are interested in biome-text are comparing it to the libraries listed below
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ85Updated 2 years ago
- Data programming by demonstration for information extraction and span annotationβ35Updated 3 years ago
- A spaCy wrapper for DBpedia Spotlightβ109Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β80Updated 9 months ago
- π« SpaCy wrapper for ConceptNet π«β92Updated last year
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.β86Updated this week
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.β44Updated 11 months ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ98Updated 11 months ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and β¦β51Updated 4 months ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vecβ19Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- β75Updated 3 years ago
- Explainable Zero-Shot Topic Extractionβ62Updated 7 months ago
- Automatically detect errors in annotated corpora.β47Updated last year
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).β75Updated 3 years ago
- Model for learning document embeddings along with their uncertaintiesβ35Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsβ65Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β153Updated 10 months ago
- Converter from UD-trees to BART representationβ36Updated last year
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of β¦β61Updated 4 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.β106Updated 11 months ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020β62Updated 11 months ago
- On Generating Extended Summaries of Long Documentsβ78Updated 4 years ago
- Template Extraction from unstructured Wikipedia text using NLP techniques.β41Updated 4 years ago
- A embed able annotation tool for end to end cross document co-referenceβ42Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ94Updated 2 years ago
- Simple library to work with pre-trained ELMo models in TensorFlowβ52Updated last year