explosion / wikid
Generate a SQLite database from Wikipedia & Wikidata dumps.
โ33Updated last year
Alternatives and similar repositories for wikid:
Users that are interested in wikid are comparing it to the libraries listed below
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iโฆโ46Updated last year
- ๐งช Cutting-edge experimental spaCy components and featuresโ98Updated 11 months ago
- ๐ซ SpaCy wrapper for ConceptNet ๐ซโ92Updated last year
- ๐ A Prodigy plugin for evaluating spaCy pipelinesโ13Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ153Updated 10 months ago
- โ46Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ54Updated 2 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2โฆโ67Updated 2 years ago
- ๐ฅ Use Hugging Face text and token classification pipelines directly in spaCyโ63Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataโ161Updated 2 years ago
- Information extraction from English and German texts based on predicate logicโ135Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.โ108Updated 10 months ago
- A Python library aimed at dissecting and augmenting NER training data.โ58Updated last year
- โ30Updated 2 years ago
- ๐ธ Train floret vectorsโ18Updated last year
- Python package for deduplication/entity resolution using active learningโ78Updated 7 months ago
- allennlp-light is a port of AllenNLP's core modules and nn portions into a standalone package with minimum dependenciesโ56Updated 2 years ago
- Lightweight piece tokenization libraryโ12Updated last year
- Semantically Structured Sentence Embeddingsโ65Updated 5 months ago
- spaCy entry points for Curated Transformersโ29Updated 6 months ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.โ106Updated 11 months ago
- โ70Updated 2 years ago
- โ26Updated last month
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).โ70Updated 7 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.โ48Updated 9 months ago
- โ22Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingโ85Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puโฆโ40Updated 3 years ago
- ๐ข Work with static vector modelsโ24Updated 2 months ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)โ41Updated 3 years ago