CLARIN-PL / embeddings
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
☆36Updated 9 months ago
Related projects: ⓘ
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆13Updated 9 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆207Updated 2 months ago
- RoBERTa models for Polish☆88Updated 2 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆33Updated 3 years ago
- Polish datsets for grammatical error correction☆12Updated 11 months ago
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆23Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆180Updated last month
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆77Updated this week
- Evaluation of Sentence Representations in Polish☆21Updated last year
- Annotated corpus + evaluation metrics for text anonymisation☆48Updated 7 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆61Updated last month
- Tool for named entity recognition for Polish based on deep learning.☆29Updated last year
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆105Updated last week
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆153Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 2 years ago
- Generalist and Lightweight Model for Text Classification☆29Updated 2 weeks ago
- 💫 SpaCy wrapper for ConceptNet 💫☆88Updated last year
- Multi-task model for named-entity recognition, relation extraction, entity mention detection and coreference resolution.☆42Updated 2 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated last year
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆65Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆99Updated 4 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆53Updated last year
- Truly flash T5 realization!☆48Updated 4 months ago
- Creating time-indexed datasets with clusters of texts as inputs and timeseries as targets.☆14Updated 2 months ago
- Polish BERT☆70Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated last year
- 🔗 A graph-augmented dense statute retriever. (EACL 2023)☆17Updated 11 months ago
- A diff tool for language models☆42Updated 8 months ago