A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.
☆307Aug 8, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-nlp-polish
Users that are interested in awesome-nlp-polish are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pre-trained models and language resources for Natural Language Processing in Polish☆374Mar 2, 2026Updated 2 months ago
- Resources for doing NLP in Polish☆48Nov 4, 2019Updated 6 years ago
- Evaluation of Sentence Representations in Polish☆23Dec 29, 2022Updated 3 years ago
- RoBERTa models for Polish☆91Mar 8, 2022Updated 4 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35May 25, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆75Feb 3, 2022Updated 4 years ago
- ☆30Nov 22, 2022Updated 3 years ago
- Polish morphological tagger.☆45May 22, 2023Updated 2 years ago
- ☆51Aug 22, 2022Updated 3 years ago
- Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions an…☆13Jun 5, 2023Updated 2 years ago
- ☆18Aug 15, 2015Updated 10 years ago
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆37May 13, 2026Updated last week
- Tool for named entity recognition for Polish based on deep learning.☆31Mar 24, 2023Updated 3 years ago
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆27Jul 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆27Dec 8, 2022Updated 3 years ago
- Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service☆61Jan 29, 2025Updated last year
- Polish datsets for grammatical error correction☆12Oct 13, 2023Updated 2 years ago
- Monitoring of AI Regulations☆19May 30, 2021Updated 4 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython☆11Nov 11, 2019Updated 6 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆36Mar 24, 2023Updated 3 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆79Jan 28, 2022Updated 4 years ago
- ☆83May 31, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Wstęp do programowania używając R☆10Mar 14, 2024Updated 2 years ago
- The 'bdl' package, prepared by Statistics Poland, is an interface to Local Data Bank (Bank Danych Lokalnych - bdl) API with set of useful…☆20May 25, 2023Updated 2 years ago
- A lightweight, hackable, and efficient framework for training and fine-tuning language models☆192Updated this week
- ☆32Apr 20, 2026Updated last month
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 7 years ago
- Materiały z seminariów prowadzonych w MI^2 DataLabie.☆33May 11, 2026Updated last week
- ☆13May 22, 2020Updated 5 years ago
- 2020 Poland coronavirus data (COVID-19 / 2019-nCoV)☆19Dec 8, 2022Updated 3 years ago
- A visualization of Warsaw public transport☆91Jan 8, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Automatize downloading of meteorological/hydrological dataset from IMGW-PIB☆12Aug 11, 2020Updated 5 years ago
- Scripts for preprocessing morfologik data.☆40Dec 2, 2017Updated 8 years ago
- Instruct-tune LLaMA on consumer hardware☆21Apr 2, 2023Updated 3 years ago
- GUI tools for WORLD vocoder☆21Dec 19, 2024Updated last year
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Dec 4, 2017Updated 8 years ago
- R functions for computing indices of non-ignorable selection bias for non-probability samples.☆14Apr 11, 2025Updated last year
- Morfologik Polish Lemmatizer plugin for Elasticsearch☆99Apr 13, 2026Updated last month