explosion / wikid
Generate a SQLite database from Wikipedia & Wikidata dumps.
β33Updated last year
Alternatives and similar repositories for wikid:
Users that are interested in wikid are comparing it to the libraries listed below
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- π« SpaCy wrapper for ConceptNet π«β90Updated last year
- β30Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated 11 months ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ98Updated 11 months ago
- π A Prodigy plugin for evaluating spaCy pipelinesβ13Updated last year
- β70Updated 2 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2β¦β68Updated 2 years ago
- spaCy entry points for Curated Transformersβ27Updated 6 months ago
- β46Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β153Updated 10 months ago
- π’ Work with static vector modelsβ23Updated 2 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ94Updated 2 years ago
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ85Updated 2 years ago
- πΈ Train floret vectorsβ18Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β37Updated 2 years ago
- Python package for deduplication/entity resolution using active learningβ77Updated 7 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated 10 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β80Updated 8 months ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Updated 6 months ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).β70Updated 7 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ158Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β107Updated 10 months ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β91Updated 3 years ago
- Language detection using Spacy and Fasttextβ55Updated last year
- π€ Push your spaCy pipelines to the Hugging Face Hubβ43Updated 9 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β78Updated last year