Ben-Epstein / spacy-to-hfLinks
A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)
☆16Updated last year
Alternatives and similar repositories for spacy-to-hf
Users that are interested in spacy-to-hf are comparing it to the libraries listed below
Sorting:
- Super Simple Similarities Service☆155Updated 9 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- Bag of, not words, but tricks!☆68Updated 2 years ago
- Pipeline components that support partial_fit.☆46Updated last year
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 4 years ago
- Generate reports for spaCy models.☆29Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆60Updated 2 years ago
- Confection: the sweetest config system for Python☆193Updated last month
- ☆30Updated 3 years ago
- ☆43Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- Efficient BM25 with DuckDB 🦆☆61Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆142Updated last year
- this repo might get accepted☆28Updated 4 years ago
- Information extraction from English and German texts based on predicate logic☆141Updated 2 years ago
- It's a cooler way to store simple linear models.☆27Updated last year
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆332Updated 9 months ago
- Super lightweight function registries for your library☆181Updated last year
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆105Updated last year
- ☆68Updated 3 years ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆36Updated 2 years ago
- Machine learning prediction in pure Python☆86Updated 5 years ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- SPEAR: Programmatically label and build training data quickly.☆109Updated last year
- ☆45Updated 2 years ago
- Sentence transformers models for SpaCy☆109Updated 2 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆43Updated 5 years ago