MeetElise / surprise-similarityLinks
A context-aware embedding similarity score
☆11Updated 2 years ago
Alternatives and similar repositories for surprise-similarity
Users that are interested in surprise-similarity are comparing it to the libraries listed below
Sorting:
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- Pre-train Static Word Embeddings☆87Updated last month
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated 2 weeks ago
- Code for SaGe subword tokenizer (EACL 2023)☆26Updated 10 months ago
- ☆83Updated 4 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆66Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago
- ☆49Updated 8 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆50Updated 2 months ago
- Universal text classifier for generative models☆25Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆188Updated 3 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated last year
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆41Updated 2 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Updated 3 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆68Updated 2 months ago
- Python library to use Pleias-RAG models☆63Updated 5 months ago
- Using short models to classify long texts☆21Updated 2 years ago
- ☆77Updated 3 months ago
- EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction☆26Updated last year
- ☆57Updated 2 weeks ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated 9 months ago
- ☆57Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated last month
- PyTorch implementation for MRL☆19Updated last year
- Multilingual Entity Linking model by BELA model☆12Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆61Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆80Updated last year