explosion / wikid
Generate a SQLite database from Wikipedia & Wikidata dumps.
β35Updated last year
Alternatives and similar repositories for wikid:
Users that are interested in wikid are comparing it to the libraries listed below
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated last year
- π§ͺ Cutting-edge experimental spaCy components and featuresβ98Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- β30Updated 2 years ago
- Legal document classification with EuroVoc descriptors on 22 languages.β26Updated last year
- π« SpaCy wrapper for ConceptNet π«β92Updated last year
- spaCy entry points for Curated Transformersβ29Updated 7 months ago
- β70Updated 2 years ago
- π A Prodigy plugin for evaluating spaCy pipelinesβ13Updated last year
- π’ Work with static vector modelsβ28Updated 2 weeks ago
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β108Updated 11 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ161Updated 2 years ago
- A spaCy wrapper for DBpedia Spotlightβ109Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β153Updated 11 months ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β91Updated 3 years ago
- Source code and data for Like a Good Nearest Neighborβ28Updated 3 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated last year
- Plug-and-play document processing pipelines. No training. Batteries included.β57Updated last week
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ85Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.β24Updated 10 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further langβ¦β122Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ94Updated 2 years ago
- Generate reports for spaCy models.β29Updated 2 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for MLβ63Updated 3 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.β48Updated 10 months ago
- Pytorch implementation of a BiLSTM model for the Wikification project.β19Updated 5 years ago
- A utility for labeling clusters of text data.β28Updated 3 years ago