bbc / citronLinks
Citron is an experimental quote extraction system created by BBC R&D
☆35Updated 3 years ago
Alternatives and similar repositories for citron
Users that are interested in citron are comparing it to the libraries listed below
Sorting:
- Libraries, Archives and Museums (LAM)☆87Updated 3 years ago
- Full text geoparsing/toponym resolution with event geolocation☆78Updated 2 weeks ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations☆37Updated 3 months ago
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆18Updated 2 months ago
- A collection of open source tools and resources related to Wikibase knowledge graphs☆72Updated last month
- Poetic processing, for Python.☆42Updated last year
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated 3 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆189Updated 4 months ago
- Tools for interactive visual exploration of semantic embeddings.☆38Updated last year
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- Neural Language Models for Historical Research☆29Updated last year
- SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions☆52Updated 6 months ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Python Multilingual Ucrel Semantic Analysis System☆31Updated last year
- A Flexible Deep Learning Approach to Fuzzy String Matching☆148Updated last year
- A Python module for clustering creators of social media content into networks☆73Updated 3 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆58Updated last week
- Find legal citations in any block of text☆176Updated 2 weeks ago
- Tools for downloading agendas, minutes and other documents produced by local government☆56Updated last week
- Command Line Interface for running 🤗 Transformers Image Classification locally☆19Updated 5 months ago
- A general purpose tool for text-based crosswalking☆107Updated last year
- Swedish parliamentary proceedings - Riksdagens protokoll 1867-today☆26Updated last year
- A collection of notebooks for Natural Language Processing☆25Updated 9 months ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆103Updated last month