Efficiently find the best-suited language model (LM) for your NLP task
☆135Jul 26, 2025Updated 7 months ago
Alternatives and similar repositories for transformer-ranker
Users that are interested in transformer-ranker are comparing it to the libraries listed below
Sorting:
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110May 16, 2024Updated last year
- ☆10Oct 2, 2024Updated last year
- A very simple news crawler with a funny name☆438Updated this week
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fi…☆12Sep 17, 2024Updated last year
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆25Jul 2, 2024Updated last year
- A comprehensive benchmark for entity disambiguation☆28Jun 29, 2023Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- Library for evaluating RAG using Nuclia's models☆18Jul 31, 2024Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 5 months ago
- Python library to use Pleias-RAG models☆68May 1, 2025Updated 10 months ago
- Toolkit for domain-specific information retrieval experimentation☆19Feb 24, 2026Updated 2 weeks ago
- ☆57Dec 27, 2025Updated 2 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆63Aug 6, 2025Updated 7 months ago
- Model implementation for the contextual embeddings project☆41Jun 2, 2025Updated 9 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆64Feb 6, 2025Updated last year
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 7 months ago
- French Jurisprudences at your fingertips @ every 72h☆15Nov 18, 2025Updated 3 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆81Feb 10, 2026Updated 3 weeks ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Apr 17, 2023Updated 2 years ago
- A Multi-domain Benchmark for Personalized Search Evaluation☆12Sep 7, 2023Updated 2 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- decontamination☆26Updated this week
- Datamodels for hugging face tokenizers☆99Mar 2, 2026Updated last week
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆10Dec 27, 2021Updated 4 years ago
- Pytorch Datasets for Easy-To-Hard☆29Jan 9, 2025Updated last year
- ☆107Jun 2, 2025Updated 9 months ago
- Repository for opt-out requests.☆10Mar 25, 2024Updated last year
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- ☆12Apr 29, 2022Updated 3 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆35Jan 14, 2026Updated last month
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆63Oct 25, 2024Updated last year
- Chatbot using NLTK & Keras Deep Learning☆12May 19, 2020Updated 5 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- ☆12Dec 6, 2024Updated last year