A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.
☆308Aug 8, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-nlp-polish
Users that are interested in awesome-nlp-polish are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pre-trained models and language resources for Natural Language Processing in Polish☆376Mar 2, 2026Updated 3 months ago
- Resources for doing NLP in Polish☆48Nov 4, 2019Updated 6 years ago
- Evaluation of Sentence Representations in Polish☆23Dec 29, 2022Updated 3 years ago
- RoBERTa models for Polish☆91Mar 8, 2022Updated 4 years ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆76Feb 3, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆30Nov 22, 2022Updated 3 years ago
- Polish morphological tagger.☆45May 22, 2023Updated 3 years ago
- Lecture notes for 'Interpretable Machine Learning' at WUW and UW. Summer semester 2018/2019☆16Jun 18, 2019Updated 6 years ago
- Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions an…☆13Jun 5, 2023Updated 3 years ago
- ☆18Aug 15, 2015Updated 10 years ago
- Python port of Stempel, an algorithmic stemmer for Polish language.☆39Aug 29, 2024Updated last year
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆37May 20, 2026Updated 2 weeks ago
- Tool for named entity recognition for Polish based on deep learning.☆31Mar 24, 2023Updated 3 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service☆61Jan 29, 2025Updated last year
- Polish datsets for grammatical error correction☆12Oct 13, 2023Updated 2 years ago
- Monitoring of AI Regulations☆19May 30, 2021Updated 5 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython☆11Nov 11, 2019Updated 6 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆79Jan 28, 2022Updated 4 years ago
- The 'bdl' package, prepared by Statistics Poland, is an interface to Local Data Bank (Bank Danych Lokalnych - bdl) API with set of useful…☆20May 25, 2023Updated 3 years ago
- A lightweight, hackable, and efficient framework for training and fine-tuning language models☆193Jun 1, 2026Updated last week
- ☆32Apr 20, 2026Updated last month
- Materiały z seminariów prowadzonych w MI^2 DataLabie.☆33May 27, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Polish data.☆13May 6, 2026Updated last month
- A visualization of Warsaw public transport☆91Jan 8, 2023Updated 3 years ago
- Instruct-tune LLaMA on consumer hardware☆21Apr 2, 2023Updated 3 years ago
- Scripts for preprocessing morfologik data.☆41Dec 2, 2017Updated 8 years ago
- [ARCHIVED] Easily integrate rich ENS user journeys into your wallet, app, or game.☆20Jun 2, 2026Updated last week
- Inforex is a web system for text corpora construction.☆12Apr 29, 2026Updated last month
- Pipelines for Keras Deep Learning Library☆17Oct 24, 2017Updated 8 years ago
- Download datasets from Polish Head Office of Geodesy and Cartography☆35Apr 26, 2025Updated last year
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Dec 4, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- eXplainable Machine Learning 2022 at MIM UW☆20Jul 1, 2023Updated 2 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- A fast and highly accurate differentiable Top-k operator from the "Successive Halving Top-k Operator" AAAI'21 paper.☆16Jun 1, 2021Updated 5 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Sep 22, 2017Updated 8 years ago