divkakwani / awesome-newspapers
A Directory of Online Newspaper Sources for 70+ Languages
☆33Updated 3 years ago
Alternatives and similar repositories for awesome-newspapers:
Users that are interested in awesome-newspapers are comparing it to the libraries listed below
- A spaCy wrapper for DBpedia Spotlight☆109Updated 2 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆97Updated 5 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆158Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 2 weeks ago
- A Named-Entity Recogniser based on Grobid.☆51Updated 6 months ago
- Extract dates from text☆64Updated 4 years ago
- Index Common Crawl archives in tabular format☆112Updated 2 weeks ago
- Legal document classification with EuroVoc descriptors on 22 languages.☆25Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆80Updated last year
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆66Updated last month
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆90Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- ANETAC: Arabic Named Entity Transliteration and Classification Dataset☆34Updated 5 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆98Updated 11 months ago
- ☆64Updated 2 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 3 years ago
- Stylometry library for Burrows' Delta method☆36Updated 10 months ago
- A machine learning tool for fishing entities☆263Updated this week
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated last year
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆40Updated 2 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- ☆12Updated 3 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 7 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆121Updated 11 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Live survey of off-the-shelf language identification tools for python☆26Updated 2 years ago