divkakwani / awesome-newspapers
A Directory of Online Newspaper Sources for 70+ Languages
☆28Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-newspapers
- A spaCy wrapper for DBpedia Spotlight☆105Updated last year
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 3 months ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆69Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆91Updated last year
- ☆64Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆95Updated 7 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆153Updated 2 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆11Updated 11 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- Extract dates from text☆64Updated 3 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 5 years ago
- Dataset of ML and NLP papers☆35Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- Measure the readability of a given text using surface characteristics☆72Updated last year
- spaCy + UDPipe☆161Updated 2 years ago
- Language independent truecaser in Python.☆161Updated 3 years ago
- Live survey of off-the-shelf language identification tools for python☆26Updated 2 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆122Updated last week
- Python Multilingual Ucrel Semantic Analysis System☆30Updated 3 months ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆13Updated 5 months ago
- coFR: COreference resolution tool for FRench (and singletons).☆24Updated 4 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆110Updated 4 months ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆50Updated 4 years ago