qdrant / quaterion-models
The collection of bulding blocks building fine-tunable metric learning models
☆32Updated 2 months ago
Alternatives and similar repositories for quaterion-models:
Users that are interested in quaterion-models are comparing it to the libraries listed below
- Tools for merging pretrained large language models.☆19Updated 9 months ago
- ☆28Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 3 months ago
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- My explorations into editing the knowledge and memories of an attention network☆34Updated 2 years ago
- Create a source of truth for ML model results and browse it on Papers with Code☆26Updated 3 years ago
- ☆15Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Embedding Recycling for Language models☆38Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆32Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆37Updated 2 years ago
- 🤝 Trade any tensors over the network☆30Updated last year
- ☆14Updated 5 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆47Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- PyTorch implementation for MRL☆18Updated last year
- Index of URLs to pdf files all over the internet and scripts☆22Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated last year
- Pre-train Static Word Embeddings☆49Updated 2 weeks ago
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated last year
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago