ZILiAT-NASK / BAN-PL
Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service
☆50Updated last week
Alternatives and similar repositories for BAN-PL:
Users that are interested in BAN-PL are comparing it to the libraries listed below
- Pre-trained models and language resources for Natural Language Processing in Polish☆331Updated 7 months ago
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆294Updated 3 years ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆66Updated 2 years ago
- ☆64Updated 7 months ago
- RoBERTa models for Polish☆86Updated 2 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- A versatile and powerful library designed to streamline the process of querying LLMs☆76Updated 3 weeks ago
- Archiwum wszystkich wydań newslettera unknowNews☆181Updated last week
- Program służący do masowego pobierania ksiąg wieczystych z serwisu ekw.ms.gov.pl☆166Updated 2 months ago
- Lista darmowego, wolnego oraz otwartego oprogramowania na Windowsa, Linuxa oraz Androida☆184Updated 3 years ago
- Skrypty, tutoriale oraz programistyczna baza wiedzy dotycząca pracy z modelem Bielik.☆62Updated this week
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- ☆50Updated 2 years ago
- Polish datsets for grammatical error correction☆12Updated last year
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- Dodej trocha wůnglu do swojygo gita!☆88Updated 3 years ago
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆25Updated last year
- Instruct-tune LLaMA on consumer hardware☆21Updated last year
- Zbiór wszystkich prezentacji, które zostały przedstawione na spotkaniach grupy CORE☆8Updated 2 years ago
- StyloMetrix☆36Updated 5 months ago
- BTSearch v2 website source code☆63Updated 6 months ago
- Podlaskie aliasy dla gitowych komend☆684Updated 2 years ago
- ☆76Updated last year
- ☆35Updated this week
- Kolekcja skryptów do szybkiego stawiania usług na serwerach Mikrusa☆267Updated 2 months ago
- LaTeX template for engineer and master thesis for Warsaw University of Technology.☆216Updated last year
- Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes☆45Updated 11 years ago
- Seminarium Magisterskie Machine Learning☆24Updated last month
- List of disposable email domains. You can use it to block fake users on your newsletter.☆145Updated last week
- Evaluation of language models on mono- or multilingual tasks.☆76Updated this week