Snowflake-Labs / arctic-embedLinks
☆77Updated 9 months ago
Alternatives and similar repositories for arctic-embed
Users that are interested in arctic-embed are comparing it to the libraries listed below
Sorting:
- Pre-train Static Word Embeddings☆85Updated 3 weeks ago
- Model implementation for the contextual embeddings project☆36Updated 4 months ago
- Efficient few-shot learning with cross-encoders.☆60Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆55Updated 3 weeks ago
- CLIR version of ColBERT☆73Updated 3 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆189Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated last year
- PyLate efficient inference engine☆64Updated 3 weeks ago
- Python API for https://vespa.ai, the open big data serving engine☆143Updated this week
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆109Updated last year
- ☆86Updated 6 months ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆196Updated last year
- ☆62Updated last year
- ☆53Updated 2 months ago
- ☆46Updated 3 years ago
- Simply, faster, sentence-transformers☆143Updated last year
- Efficient BM25 with DuckDB 🦆☆55Updated 9 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆58Updated last year
- ☆57Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago
- Enhaced version of Wikiextrator: A wikipedia dumps extractor☆21Updated 2 weeks ago
- Finetune mistral-7b-instruct for sentence embeddings☆87Updated last year
- Crispy reranking models by Mixedbread☆36Updated 2 weeks ago
- The pipeline for the OSCAR corpus☆172Updated last year
- Source code and data for Like a Good Nearest Neighbor☆30Updated 8 months ago
- Vespa application making an index of the CORD-19 dataset.☆39Updated 2 months ago
- provides a common interface to many IR measure tools☆91Updated last month
- ☆13Updated 3 years ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆27Updated last year