CLARIN-PL / embeddingsLinks
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
☆37Updated 2 years ago
Alternatives and similar repositories for embeddings
Users that are interested in embeddings are comparing it to the libraries listed below
Sorting:
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆14Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆215Updated last year
- RoBERTa models for Polish☆90Updated 3 years ago
- Inquisitive Parrots for Search☆199Updated 8 months ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆60Updated 2 years ago
- git extension for {collaborative, communal, continual} model development☆217Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆340Updated 2 years ago
- ☆141Updated 2 years ago
- ☆11Updated 5 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- ☆89Updated 10 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 3 years ago
- Annotated corpus + evaluation metrics for text anonymisation☆71Updated 3 weeks ago
- Late Interaction Models Training & Retrieval☆701Updated this week
- Efficiently find the best-suited language model (LM) for your NLP task☆134Updated 6 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆192Updated 7 months ago
- provides a common interface to many IR measure tools☆95Updated last week
- Retrieval-Augmented Generation battle!☆62Updated 6 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆58Updated 6 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆213Updated 4 months ago
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆134Updated 2 weeks ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- Shared code for training sentence embeddings with Flax / JAX☆28Updated 4 years ago
- [NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark☆226Updated last year
- ☆183Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆140Updated last year
- A Python framework for performing information retrieval experiments, building on http://terrier.org/☆493Updated 2 weeks ago
- Bi-encoder entity linking architecture☆52Updated last year