CLARIN-PL / embeddingsLinks
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
☆36Updated 2 years ago
Alternatives and similar repositories for embeddings
Users that are interested in embeddings are comparing it to the libraries listed below
Sorting:
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆13Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- RoBERTa models for Polish☆89Updated 3 years ago
- Inquisitive Parrots for Search☆199Updated 7 months ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated 3 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated last year
- Rax is a Learning-to-Rank library written in JAX.☆335Updated 4 months ago
- Late Interaction Models Training & Retrieval☆679Updated this week
- State-of-the-art paired encoder and decoder models (17M-1B params)☆54Updated 5 months ago
- Truly flash T5 realization!☆71Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆133Updated last month
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.☆26Updated 2 years ago
- Annotated corpus + evaluation metrics for text anonymisation☆70Updated 5 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 6 months ago
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆127Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- Generalist and Lightweight Model for Text Classification☆167Updated last month
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆112Updated 2 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
- Shared code for training sentence embeddings with Flax / JAX☆28Updated 4 years ago
- Efficiently find the best-suited language model (LM) for your NLP task☆132Updated 5 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- Pre-train Static Word Embeddings☆94Updated 4 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆69Updated last week
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆75Updated last week
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated last week
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- Bi-encoder entity linking architecture☆51Updated last year