CLARIN-PL / embeddingsLinks
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
☆36Updated last year
Alternatives and similar repositories for embeddings
Users that are interested in embeddings are comparing it to the libraries listed below
Sorting:
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆13Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated 11 months ago
- RoBERTa models for Polish☆87Updated 3 years ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆44Updated 3 weeks ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆211Updated 3 months ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆186Updated last month
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆127Updated last year
- ☆11Updated 4 years ago
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated last month
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- git extension for {collaborative, communal, continual} model development☆216Updated 9 months ago
- ☆86Updated 5 months ago
- Late Interaction Models Training & Retrieval☆532Updated this week
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- just a bunch of useful embeddings for scikit-learn pipelines☆516Updated 2 weeks ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆319Updated 4 months ago
- SpanMarker for Named Entity Recognition☆451Updated 7 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆337Updated 2 years ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆136Updated 8 months ago
- Bi-encoder entity linking architecture☆49Updated 11 months ago
- ☆139Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆128Updated 2 months ago
- Inquisitive Parrots for Search☆196Updated 2 months ago
- ☆82Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆93Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆66Updated 2 months ago
- Generalist and Lightweight Model for Text Classification☆156Updated 2 months ago