CLARIN-PL / LEPISZCZE
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
☆13Updated last year
Alternatives and similar repositories for LEPISZCZE:
Users that are interested in LEPISZCZE are comparing it to the libraries listed below
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- Bi-encoder entity linking architecture☆44Updated 4 months ago
- Pre-train Static Word Embeddings☆34Updated this week
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated last year
- LTG-Bert☆29Updated last year
- Official implementation of "GPT or BERT: why not both?"☆45Updated 2 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated last week
- ☆51Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- A weak supervision framework for (partial) labeling functions☆15Updated 6 months ago
- Efficient few-shot learning with cross-encoders.☆42Updated 11 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆57Updated last year
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆47Updated 2 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆22Updated 2 weeks ago
- Embedding Recycling for Language models☆38Updated last year
- Experiments for XLM-V Transformers Integeration☆13Updated last year
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆21Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆45Updated last year
- Truly flash T5 realization!☆60Updated 7 months ago
- Generalist and Lightweight Model for Text Classification☆58Updated 2 weeks ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆35Updated this week
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024☆27Updated last month
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆29Updated 7 months ago
- A tiny BERT for low-resource monolingual models☆31Updated 3 months ago
- ☆84Updated 8 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆64Updated last month
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆26Updated 3 weeks ago