allegro / klejbenchmark-baselinesLinks
Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.
☆26Updated 2 years ago
Alternatives and similar repositories for klejbenchmark-baselines
Users that are interested in klejbenchmark-baselines are comparing it to the libraries listed below
Sorting:
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆186Updated 3 years ago
- Open source library for few shot NLP☆79Updated 2 years ago
- RoBERTa models for Polish☆87Updated 3 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- Stanford's Alexa Prize socialbot☆133Updated last year
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- ☆11Updated 4 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- SQuARE: Software for question answering research.☆75Updated last year
- Polish BERT☆72Updated 4 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- Simply, faster, sentence-transformers☆143Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- ☆86Updated 5 months ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- ☆75Updated 4 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆279Updated last year
- A multilingual version of MS MARCO passage ranking dataset☆144Updated last year
- ☆38Updated 3 years ago
- Question-answers, collected from Google☆129Updated 4 years ago
- The pipeline for the OSCAR corpus☆171Updated last year
- Wikipedia text corpus for self-supervised NLP model training☆44Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 4 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 3 years ago