CLARIN-PL / LEPISZCZELinks
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
☆13Updated last year
Alternatives and similar repositories for LEPISZCZE
Users that are interested in LEPISZCZE are comparing it to the libraries listed below
Sorting:
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆212Updated 8 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Pre-train Static Word Embeddings☆76Updated this week
- Interpretability for sequence generation models 🐛 🔍☆422Updated last month
- ☆91Updated last year
- zero shot NER fine tuning☆13Updated 2 months ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- LTG-Bert☆33Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆183Updated 5 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆58Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆106Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆126Updated 7 months ago
- TimeLMs: Diachronic Language Models from Twitter☆107Updated last year
- The robust European language model benchmark.☆104Updated this week
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- RoBERTa models for Polish☆87Updated 3 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆122Updated last year
- ☆65Updated last year
- Simple-to-use scoring function for arbitrarily tokenized texts.☆40Updated 3 months ago
- A Scandinavian Benchmark for sentence embeddings☆38Updated 2 weeks ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆136Updated 2 weeks ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆206Updated last month
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆56Updated 3 weeks ago
- Tool for named entity recognition for Polish based on deep learning.☆31Updated 2 years ago
- CLIR version of ColBERT☆67Updated last month
- ☆161Updated 11 months ago
- A BERT-based application for reusable text classification at scale☆38Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆56Updated last month