bbc / citronLinks
Citron is an experimental quote extraction system created by BBC R&D
☆33Updated 3 years ago
Alternatives and similar repositories for citron
Users that are interested in citron are comparing it to the libraries listed below
Sorting:
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago
- Libraries, Archives and Museums (LAM)☆84Updated 2 years ago
- Poetic processing, for Python.☆42Updated last year
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 3 months ago
- Full text geoparsing/toponym resolution with event geolocation☆77Updated last month
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- ParlaMint: Comparable Parliamentary Corpora☆62Updated 2 weeks ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated 11 months ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations☆36Updated 3 weeks ago
- Literary Language Toolkit: code, models, corpora, and web tools☆11Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆109Updated 6 years ago
- Browser-based app for segmenting & OCRing PDF pages based on whitespace rules. To assist researchers (especially in the humanities) with …☆12Updated last year
- Named Entity Disambiguation and Linking☆16Updated last year
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 8 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated 2 years ago
- A collection of notebooks for Natural Language Processing☆25Updated 6 months ago
- Command Line Interface for running 🤗 Transformers Image Classification locally☆19Updated 2 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆146Updated 9 months ago
- A Python module for clustering creators of social media content into networks☆73Updated 3 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆236Updated this week
- Rhythm analysis toolkit in Python☆12Updated last year
- Neural Language Models for Historical Research☆28Updated 9 months ago
- Find legal citations in any block of text☆161Updated last month
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- ☆25Updated 10 months ago