hotchpotch / yasemLinks
YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings
☆12Updated 5 months ago
Alternatives and similar repositories for yasem
Users that are interested in yasem are comparing it to the libraries listed below
Sorting:
- My NER Experiments with ModernBERT and Ettin☆22Updated 3 months ago
- Pre-train Static Word Embeddings☆87Updated last month
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated this week
- Model implementation for the contextual embeddings project☆36Updated 4 months ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Updated 4 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated last month
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Updated 2 years ago
- Simply, faster, sentence-transformers☆143Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Updated 2 years ago
- Crispy reranking models by Mixedbread☆38Updated last month
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆27Updated 2 years ago
- ☆83Updated 3 months ago
- Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)☆108Updated 5 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- NLP with Rust for Python 🦀🐍☆65Updated 5 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆33Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- A massively multilingual modern encoder language model☆104Updated 2 weeks ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆88Updated last month
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Datamodels for hugging face tokenizers☆86Updated this week
- Source code and data for Like a Good Nearest Neighbor☆30Updated 9 months ago
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆26Updated 4 months ago
- YAST - Yet Another SPLADE or Sparse Trainer☆20Updated 4 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Updated 4 months ago
- ☆33Updated 2 years ago