CLARIN-PL / PolDeepNer
Tool for named entity recognition for Polish based on deep learning.
☆31Updated 2 years ago
Alternatives and similar repositories for PolDeepNer:
Users that are interested in PolDeepNer are comparing it to the libraries listed below
- Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions an…☆12Updated last year
- RoBERTa models for Polish☆86Updated 3 years ago
- Polish morphological tagger.☆43Updated last year
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆26Updated last year
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- Polish data.☆11Updated 4 months ago
- ☆50Updated 2 years ago
- Polish BERT☆70Updated 4 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆335Updated 9 months ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 3 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 3 years ago
- xfspell — the Transformer Spell Checker☆189Updated 4 years ago
- spaCy + UDPipe☆161Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Resources for doing NLP in Polish☆47Updated 5 years ago
- Text span utilities for Rust and Python☆21Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆190Updated last year
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆297Updated 3 years ago
- Semantic search using Transformers and others☆110Updated 4 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆309Updated last year
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆13Updated last year
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆222Updated 2 years ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆67Updated 3 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆193Updated 2 years ago