divkakwani / awesome-newspapersLinks
A Directory of Online Newspaper Sources for 70+ Languages
☆31Updated 4 years ago
Alternatives and similar repositories for awesome-newspapers
Users that are interested in awesome-newspapers are comparing it to the libraries listed below
Sorting:
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Updated 3 years ago
- A spaCy wrapper for DBpedia Spotlight☆112Updated 2 years ago
- ☆64Updated 2 years ago
- Extract dates from text☆66Updated 4 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆34Updated 9 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Resources to go with the Indic NLP Library☆77Updated 3 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆95Updated 2 years ago
- Language independent truecaser in Python.☆159Updated 4 years ago
- Fast and accurate spell correction library☆80Updated 3 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 5 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 4 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆256Updated 3 years ago
- 🕸 GlotWeb: Web Indexing for Low-Resource Languages -- under construction.☆17Updated 4 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆148Updated last year
- Implementation of the ClausIE information extraction system for python+spacy☆226Updated 3 years ago
- Information extraction from English and German texts based on predicate logic☆139Updated 2 years ago
- Python tools for interacting with Wikidata☆159Updated 2 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 5 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Updated 3 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆142Updated last month
- A minimal, pure Python library to interface with CoNLL-U format files.☆153Updated 2 weeks ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115Updated last year
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆105Updated last year
- Fuzzy matching and more functionality for spaCy.☆259Updated last year
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 7 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆87Updated 3 years ago