A spaCy wrapper of OpenTapioca for named entity linking on Wikidata
☆96Feb 5, 2026Updated last month
Alternatives and similar repositories for spacyopentapioca
Users that are interested in spacyopentapioca are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Oct 6, 2022Updated 3 years ago
- Entity linking system for Wikidata updated by your edits in real time☆261Dec 24, 2025Updated 3 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Nov 7, 2022Updated 3 years ago
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 3 months ago
- A spaCy wrapper for DBpedia Spotlight☆112Mar 24, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Django SKOS-XL Thesaurus manager☆13Oct 18, 2021Updated 4 years ago
- spaCy module for linking text to Wikidata items☆243Mar 9, 2023Updated 3 years ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- Knowledge graph construction: Fast inserts into a Wikibase instance☆46Feb 3, 2022Updated 4 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Jun 9, 2025Updated 9 months ago
- Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.☆13Dec 7, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- This repository contains the Wikibase configuration of the EU Knowledge Graph☆15Jun 6, 2025Updated 9 months ago
- Cloud and Kubernetes configuration for deployment for wbstack.com. You'll want to look at the wikibase.cloud deploy repository soon!☆12Feb 9, 2024Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆21Aug 15, 2024Updated last year
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- Named Entity Recognition☆19Feb 13, 2026Updated last month
- Wikidata lexemes presentations☆23Jan 30, 2026Updated last month
- A Python wrapper for the nascent hypothes.is web API☆11Jan 28, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- A curated list of awesome RDM resources for researchers and organisations☆30Mar 2, 2026Updated 3 weeks ago
- This repository has migrated to:☆100Oct 11, 2025Updated 5 months ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- A machine learning tool for fishing entities☆270Feb 27, 2026Updated 3 weeks ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 4 months ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- Corpus Annotation Graph builder (CAG) is an architectural framework that employs the build-and-annotate pattern for creating a graph.☆14Dec 7, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Mar 13, 2019Updated 7 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Jan 20, 2025Updated last year
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆14Dec 6, 2025Updated 3 months ago
- CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and Wikidata☆32Jan 21, 2022Updated 4 years ago
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆15Jan 25, 2022Updated 4 years ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Specifications of the reconciliation API☆39Nov 10, 2025Updated 4 months ago