CLARIN-PL / embeddingsLinks
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
☆36Updated 2 years ago
Alternatives and similar repositories for embeddings
Users that are interested in embeddings are comparing it to the libraries listed below
Sorting:
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆13Updated 2 years ago
- RoBERTa models for Polish☆89Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Inquisitive Parrots for Search☆199Updated 6 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆53Updated 4 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆214Updated 2 months ago
- Pre-train Static Word Embeddings☆93Updated 3 months ago
- Neural Search☆334Updated last year
- Annotated corpus + evaluation metrics for text anonymisation☆70Updated 4 months ago
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆26Updated 2 years ago
- Retrieval-Augmented Generation battle!☆61Updated 4 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 5 months ago
- ☆87Updated 8 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Vespa application making an index of the CORD-19 dataset.☆39Updated 5 months ago
- Late Interaction Models Training & Retrieval☆666Updated this week
- Efficiently find the best-suited language model (LM) for your NLP task☆132Updated 4 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆67Updated last week
- A Python framework for performing information retrieval experiments, building on http://terrier.org/☆490Updated this week
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- ☆48Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- ☆84Updated 2 years ago
- Bi-encoder entity linking architecture☆52Updated last year
- Generalist and Lightweight Model for Text Classification☆166Updated last week