argilla-io / biome-text
Custom Natural Language Processing with big and small models π²π±
β68Updated 3 years ago
Alternatives and similar repositories for biome-text:
Users that are interested in biome-text are comparing it to the libraries listed below
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of β¦β61Updated 4 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.β44Updated 8 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 8 months ago
- Data programming by demonstration for information extraction and span annotationβ35Updated 3 years ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.β32Updated 2 years ago
- Using questions to summarize large amounts of textual data.β25Updated 4 years ago
- β42Updated last year
- β74Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 11 months ago
- β66Updated 4 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and β¦β51Updated last month
- A set of methods for finding an appropriate number of topics in a text collectionβ15Updated 5 months ago
- Converter from UD-trees to BART representationβ36Updated 10 months ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.β86Updated 3 weeks ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020β62Updated 9 months ago
- An example of how to use spaCy for extremely large files without running into memory issuesβ36Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkβ80Updated 2 years ago
- Topic Inference with Zeroshot modelsβ61Updated last year
- numeric fused-head identification and resolutionβ33Updated 5 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ52Updated last year
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."β26Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsβ65Updated 2 years ago
- Data Programming by Demonstration (DPBD) for Document Classificationβ35Updated 3 years ago
- sequence tagging with spaCy and crfsuiteβ19Updated last year
- Semantic search using Transformers and othersβ110Updated 4 years ago
- A simple neural truecaser written in pytorch and allennlp.β32Updated 7 months ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.β105Updated 3 years ago
- Visualise, evaluate, and manage annotated dataβ33Updated 2 years ago
- Experiments for data quality in Rasa.β34Updated 2 years ago