explosion / wikid
Generate a SQLite database from Wikipedia & Wikidata dumps.
β30Updated 9 months ago
Alternatives and similar repositories for wikid:
Users that are interested in wikid are comparing it to the libraries listed below
- π§ͺ Cutting-edge experimental spaCy components and featuresβ96Updated 8 months ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- π« SpaCy wrapper for ConceptNet π«β89Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated 9 months ago
- Source code and data for Like a Good Nearest Neighborβ28Updated this week
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- β30Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ52Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 7 months ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.β86Updated last week
- β70Updated 2 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2β¦β66Updated last year
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated last year
- β22Updated 2 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).β70Updated 4 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β57Updated 8 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ156Updated 2 years ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated 10 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β37Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.β57Updated last year
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β88Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ86Updated 2 years ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)β60Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 10 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.β47Updated 6 months ago
- Generate reports for spaCy models.β29Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)β151Updated 2 years ago
- Sentence transformers models for SpaCyβ107Updated last year
- No Teacher BART distillation experiment for NLI tasksβ26Updated 4 years ago
- β42Updated last year