CLARIN-PL / embeddingsLinks
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language 
☆36Updated last year
Alternatives and similar repositories for embeddings
Users that are interested in embeddings are comparing it to the libraries listed below
Sorting:
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆13Updated last year
 - A python package for benchmarking interpretability techniques on Transformers.☆212Updated last year
 - RoBERTa models for Polish☆88Updated 3 years ago
 - Inquisitive Parrots for Search☆198Updated 4 months ago
 - Late Interaction Models Training & Retrieval☆632Updated this week
 - Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
 - FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated last month
 - A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
 - Web UI & Backend for Data Annotations in Aya☆28Updated last year
 - A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
 - git extension for {collaborative, communal, continual} model development☆215Updated 11 months ago
 - ☆84Updated 2 years ago
 - Weakly Supervised End-to-End Learning (NeurIPS 2021)☆157Updated 2 years ago
 - State-of-the-art paired encoder and decoder models (17M-1B params)☆53Updated 2 months ago
 - Datasets collection and preprocessings framework for NLP extreme multitask learning☆188Updated 3 months ago
 - Efficiently find the best-suited language model (LM) for your NLP task☆127Updated 3 months ago
 - ☆86Updated 7 months ago
 - ☆139Updated 2 years ago
 - A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
 - just a bunch of useful embeddings for scikit-learn pipelines☆518Updated last month
 - ☆42Updated last year
 - 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆325Updated 6 months ago
 - provides a common interface to many IR measure tools☆91Updated 2 months ago
 - Generalist and Lightweight Model for Text Classification☆163Updated 4 months ago
 - Tool for named entity recognition for Polish based on deep learning.☆31Updated 2 years ago
 - Annotated corpus + evaluation metrics for text anonymisation☆70Updated 3 months ago
 - [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
 - Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆66Updated last month
 - Library that contains implementations of machine learning components in the hyperbolic space☆142Updated last year
 - Retrieval-Augmented Generation battle!☆59Updated 3 months ago