CLARIN-PL / LEPISZCZELinks
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
☆13Updated last year
Alternatives and similar repositories for LEPISZCZE
Users that are interested in LEPISZCZE are comparing it to the libraries listed below
Sorting:
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated last year
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆23Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆188Updated 3 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆212Updated last year
- ☆105Updated last week
- ☆169Updated last year
- Generalist and Lightweight Model for Text Classification☆163Updated 4 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- SpanMarker for Named Entity Recognition☆460Updated 9 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated last month
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆163Updated 4 months ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆107Updated last year
- Completion After Prompt Probability. Make your LLM make a choice☆80Updated last year
- Pre-train Static Word Embeddings☆89Updated last month
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- git extension for {collaborative, communal, continual} model development☆215Updated 11 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)☆81Updated last year
- Annotated corpus + evaluation metrics for text anonymisation☆70Updated 3 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆68Updated 2 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆55Updated 2 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆197Updated last month
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆127Updated last year
- Zero and Few shot named entity & relationships recognition☆391Updated last month
- A Word Level Transformer layer based on PyTorch and 🤗 Transformers.☆34Updated last year