allegro / klejbenchmark-baselinesLinks
Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.
☆26Updated 2 years ago
Alternatives and similar repositories for klejbenchmark-baselines
Users that are interested in klejbenchmark-baselines are comparing it to the libraries listed below
Sorting:
- RoBERTa models for Polish☆88Updated 3 years ago
 - Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆104Updated 3 years ago
 - Simply, faster, sentence-transformers☆143Updated last year
 - XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆156Updated last year
 - This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
 - Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/☆250Updated last year
 - [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
 - ☆11Updated 4 years ago
 - Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
 - Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
 - Label data using HuggingFace's transformers and automatically get a prediction service☆193Updated 2 years ago
 - Wikipedia text corpus for self-supervised NLP model training☆46Updated 3 years ago
 - Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
 - SQuARE: Software for question answering research.☆75Updated last year
 - The pipeline for the OSCAR corpus☆173Updated last year
 - A collection of task-specific NLU datasets☆159Updated 3 years ago
 - We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆187Updated 3 years ago
 - Ten Thousand German News Articles Dataset for Topic Classification☆86Updated 2 years ago
 - RaKUn 2.0 - A fast keyword detection algorithm☆68Updated 2 months ago
 - An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆280Updated 2 years ago
 - This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Updated 4 years ago
 - Annotated corpus + evaluation metrics for text anonymisation☆70Updated 3 months ago
 - A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
 - Passive/Active sentence Transformer☆28Updated 7 years ago
 - Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Updated 2 years ago
 - Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆69Updated 2 years ago
 - Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆201Updated 2 years ago
 - A spaCy custom component that extracts and normalizes temporal expressions☆55Updated 2 years ago
 - Stanford's Alexa Prize socialbot☆133Updated 2 years ago
 - SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago