CLARIN-PL / LEPISZCZELinks
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
☆13Updated 2 years ago
Alternatives and similar repositories for LEPISZCZE
Users that are interested in LEPISZCZE are comparing it to the libraries listed below
Sorting:
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated 2 years ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆23Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- ☆113Updated last month
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- ☆176Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 5 months ago
- Bi-encoder entity linking architecture☆52Updated last year
- Pre-train Static Word Embeddings☆93Updated 3 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- Generalist and Lightweight Model for Text Classification☆166Updated last week
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆108Updated last year
- ☆56Updated 10 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Updated last year
- The robust European language model benchmark.☆142Updated this week
- zero shot NER fine tuning☆13Updated 8 months ago
- Efficient few-shot learning with cross-encoders.☆60Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆251Updated 6 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆69Updated 4 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆130Updated last year
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆56Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- LTG-Bert☆34Updated last year
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆26Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated 11 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Updated 2 years ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction