ZILiAT-NASK / BAN-PLLinks
Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service
☆55Updated 5 months ago
Alternatives and similar repositories for BAN-PL
Users that are interested in BAN-PL are comparing it to the libraries listed below
Sorting:
- Pre-trained models and language resources for Natural Language Processing in Polish☆345Updated last year
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆301Updated 3 years ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆67Updated 3 years ago
- ☆50Updated 2 years ago
- Resources for doing NLP in Polish☆47Updated 5 years ago
- The robust European language model benchmark.☆110Updated last week
- RoBERTa models for Polish☆87Updated 3 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆169Updated 8 months ago
- Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes☆45Updated 11 years ago
- A Scandinavian Benchmark for sentence embeddings☆39Updated last month
- A Simple Bulk Labelling Tool☆589Updated 6 months ago
- Instruct-tune LLaMA on consumer hardware☆21Updated 2 years ago
- A curated list of NLP resources for Hungarian☆250Updated 3 months ago
- The most extensive open massively multilingual corpus of datasets for training sentiment models. The corpus consists of 79 manually selec…☆16Updated last year
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆74Updated last month
- A Python library for calculating a large variety of metrics from text☆341Updated 7 months ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆502Updated 3 months ago
- An easy to use python package for deep learning-based german sentiment classification.☆59Updated 2 years ago
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆259Updated last year
- ☆79Updated last year
- StyloMetrix☆43Updated 11 months ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- A data set and model for german sentiment classification.☆67Updated last month
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆43Updated 10 months ago
- Gain clues from clustering!☆316Updated last year
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆26Updated 2 years ago
- Norwegian Transformer Model☆116Updated 7 months ago
- Active Learning for Text Classification in Python☆618Updated 3 weeks ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Updated 10 months ago