CLARIN-PL / LEPISZCZELinks
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
☆13Updated 2 years ago
Alternatives and similar repositories for LEPISZCZE
Users that are interested in LEPISZCZE are comparing it to the libraries listed below
Sorting:
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆36Updated 2 years ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆23Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 6 months ago
- ☆116Updated 2 months ago
- ☆179Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated last week
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆132Updated last year
- Bi-encoder entity linking architecture☆51Updated last year
- Pre-train Static Word Embeddings☆94Updated 4 months ago
- Generalist and Lightweight Model for Text Classification☆167Updated last month
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆182Updated last month
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
- The robust European language model benchmark.☆150Updated this week
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆69Updated 5 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆109Updated last year
- RoBERTa models for Polish☆89Updated 3 years ago
- ☆65Updated 2 years ago
- SpanMarker for Named Entity Recognition☆462Updated last year
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆133Updated last month
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆127Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆75Updated last week
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 4 years ago
- Robust and fast topic models with sentence-transformers.☆88Updated 3 weeks ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆199Updated 4 months ago
- Efficient few-shot learning with cross-encoders.☆60Updated last year