bbc / citronLinks
Citron is an experimental quote extraction system created by BBC R&D
☆35Updated 3 years ago
Alternatives and similar repositories for citron
Users that are interested in citron are comparing it to the libraries listed below
Sorting:
- Libraries, Archives and Museums (LAM)☆88Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Updated 3 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆245Updated this week
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆21Updated last year
- Command Line Interface for running 🤗 Transformers Image Classification locally☆19Updated 6 months ago
- Full text geoparsing/toponym resolution with event geolocation☆80Updated 3 weeks ago
- Poetic processing, for Python.☆42Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆39Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆147Updated last year
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆23Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated 4 months ago
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations☆37Updated 4 months ago
- A collection of open source tools and resources related to Wikibase knowledge graphs☆73Updated 2 months ago
- ☆26Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- A Python library for topic modeling and visualization☆66Updated 5 years ago
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Special Topics in AI: Artificial Intelligence as an Archival Science☆18Updated last year
- Browser-based app for segmenting & OCRing PDF pages based on whitespace rules. To assist researchers (especially in the humanities) with …☆12Updated last year
- Citation Classification using hybrid neural network model for Wikipedia References☆31Updated 2 years ago
- A general purpose tool for text-based crosswalking☆108Updated last year
- Named-Entity Recognition extension for OpenRefine☆29Updated 2 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago