allegro / klejbenchmark-baselinesLinks
Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.
☆26Updated last year
Alternatives and similar repositories for klejbenchmark-baselines
Users that are interested in klejbenchmark-baselines are comparing it to the libraries listed below
Sorting:
- ☆50Updated 2 years ago
- RoBERTa models for Polish☆87Updated 3 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- Tool for named entity recognition for Polish based on deep learning.☆31Updated 2 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- Polish BERT☆70Updated 4 years ago
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆45Updated last year
- MFAQ: a Multilingual FAQ Dataset☆17Updated last year
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆181Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- ☆43Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- ☆87Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆50Updated last month
- OpusFilter - Parallel corpus processing toolkit☆104Updated 2 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- A High-level Library for Named Entity Recognition in Python.☆23Updated last year
- SQuARE: Software for question answering research.☆75Updated 11 months ago
- Tutorial on how to convert machine learned models into ONNX☆16Updated 2 years ago
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated 11 months ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Updated last year