rom1504 / awesome-semantic-searchLinks
Semantic search with embeddings: index anything
☆139Updated 3 years ago
Alternatives and similar repositories for awesome-semantic-search
Users that are interested in awesome-semantic-search are comparing it to the libraries listed below
Sorting:
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆88Updated last year
- ☆43Updated 2 years ago
- Neural information retrieval / Semantic search / Bi-encoders☆172Updated last year
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆58Updated 3 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Updated 2 years ago
- Simply, faster, sentence-transformers☆143Updated 10 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Updated last year
- Developing tools to automatically analyze datasets☆74Updated 8 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- Efficient few-shot learning with cross-encoders.☆54Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Open source library for few shot NLP☆78Updated 2 years ago
- Library for creating causal chains using language models.☆78Updated 2 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆67Updated 3 months ago
- The largest multilingual image-text classification dataset. It contains fashion products.☆72Updated 2 years ago
- ☆86Updated 3 months ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆101Updated 5 months ago
- Completion After Prompt Probability. Make your LLM make a choice☆79Updated 8 months ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 3 years ago
- What can I do with a LLM model?☆157Updated 3 months ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extraction…☆104Updated last year