allegro / klejbenchmark-baselinesLinks
Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.
☆26Updated 2 years ago
Alternatives and similar repositories for klejbenchmark-baselines
Users that are interested in klejbenchmark-baselines are comparing it to the libraries listed below
Sorting:
- RoBERTa models for Polish☆89Updated 3 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆189Updated 3 years ago
- ☆50Updated 3 years ago
- Stanford's Alexa Prize socialbot☆133Updated 2 years ago
- ☆11Updated 5 years ago
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- Annotated corpus + evaluation metrics for text anonymisation☆70Updated 5 months ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Updated 2 years ago
- Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)☆135Updated last year
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆280Updated 2 years ago
- Open source library for few shot NLP☆78Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- The pipeline for the OSCAR corpus☆175Updated last month
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Updated 4 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- RaKUn 2.0 - A fast keyword detection algorithm☆69Updated 5 months ago
- Simply, faster, sentence-transformers☆143Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆199Updated 4 months ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
- SQuARE: Software for question answering research.☆75Updated last year
- code and supplementary materials for a series of Medium articles about the BERT model☆77Updated 2 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 3 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- MFAQ: a Multilingual FAQ Dataset☆18Updated 2 years ago
- Polish BERT☆72Updated 5 years ago
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago