Snowflake-Labs/arctic-embed

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Snowflake-Labs/arctic-embed)

Snowflake-Labs / arctic-embed

☆89

Alternatives and similar repositories for arctic-embed

Users that are interested in arctic-embed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

isekulic / longformer-marco
View on GitHub
Longformer for MS MARCO document re-ranking task.
☆20Jan 11, 2021Updated 5 years ago
LeeSureman / E5-Retrieval-Reproduction
View on GitHub
Use contrastive learning to train a large language model (LLM) as a retriever
☆12Jul 19, 2024Updated 2 years ago
DSBA-Lab / Contrastive-Accumulation
View on GitHub
☆14Jul 7, 2024Updated 2 years ago
PrithivirajDamodaran / SPLADERunner
View on GitHub
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…
☆35Aug 24, 2024Updated last year
algoprog / Faspect
View on GitHub
A library for open domain query facet extraction and generation
☆16Apr 24, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yuh-zha / Align
View on GitHub
Align, a general text alignment function
☆15Dec 7, 2023Updated 2 years ago
microsoft / MS-MARCO-Web-Search
View on GitHub
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
☆351Dec 16, 2024Updated last year
datasette / datasette-litestream
View on GitHub
Datasette plugin for streaming SQLite database backups to S3, using Litestream!
☆20Jan 20, 2026Updated 6 months ago
Lurunchik / NF-CATS
View on GitHub
☆17Jul 18, 2022Updated 4 years ago
Snowflake-Labs / Arctic_Agentic_RAG
View on GitHub
☆20Jun 3, 2025Updated last year
DunZhang / Jasper-Token-Compression-Training
View on GitHub
The training codes of Jasper-Token-Compression-600M
☆20Nov 19, 2025Updated 8 months ago
JHU-CLSP / mmBERT
View on GitHub
A massively multilingual modern encoder language model
☆145Jan 20, 2026Updated 6 months ago
nlp-uoregon / ullme
View on GitHub
☆20Apr 8, 2025Updated last year
microsoft / multifield-adaptive-retrieval
View on GitHub
Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval
☆18Feb 13, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
roipony / flash-maxsim
View on GitHub
☆27Jun 11, 2026Updated last month
Ankush7890 / ssfinetuning
View on GitHub
A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning
☆14Oct 27, 2021Updated 4 years ago
Ren-Research / Making-AI-Less-Thirsty
View on GitHub
[Preprint] Making AI Less ''Thirsty'': Uncovering and Addressing the Secret Water Footprint of AI
☆32Apr 7, 2023Updated 3 years ago
hltcoe / rank-k
View on GitHub
Repository for the listwise reranker Rank-K
☆16May 23, 2025Updated last year
sparkle-reasoning / sparkle
View on GitHub
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
☆16Dec 12, 2025Updated 7 months ago
justin-yan / pyjust
View on GitHub
☆14Jan 7, 2024Updated 2 years ago
koaning / scikit-churn
View on GitHub
Exploring some issues related to churn
☆17Mar 19, 2024Updated 2 years ago
thunlp / ReInfoSelect
View on GitHub
☆36Jun 12, 2023Updated 3 years ago
facebookresearch / ReasonIR
View on GitHub
Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".
☆230Jul 2, 2026Updated 3 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
J-Seo / KoCommonGEN-V2
View on GitHub
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Aug 24, 2024Updated last year
hfthair / emerald_crawler
View on GitHub
☆11Oct 12, 2023Updated 2 years ago
getlago / lago-python-client
View on GitHub
Python wrapper for the Lago Rest API
☆28Updated this week
naver / splade
View on GitHub
SPLADE: sparse neural search (SIGIR21, SIGIR22)
☆999May 3, 2024Updated 2 years ago
microtica / templates
View on GitHub
Production-ready infrastructure and application templates to build solutions on AWS without ever opening the console.
☆19Dec 9, 2025Updated 7 months ago
weaviate-tutorials / Hurricane
View on GitHub
Writing Blog Posts with Generative Feedback Loops!
☆52Mar 19, 2024Updated 2 years ago
zetaalphavector / InPars
View on GitHub
Inquisitive Parrots for Search
☆200Jun 5, 2025Updated last year
instructkr / reranker-simple-benchmark
View on GitHub
Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.
☆35Dec 2, 2025Updated 7 months ago
enjalot / latent-sae
View on GitHub
Training code for Sparse Autoencoders on Embedding models
☆39Jul 11, 2026Updated 2 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
PxYu / Pretraining-CLIR
View on GitHub
"Cross-lingual Language Model Pretraining for Retrieval". (WWW 2021)
☆10Jun 17, 2022Updated 4 years ago
castorini / rank_llm
View on GitHub
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
☆610Updated this week
SVAIGBA / CDKGen
View on GitHub
☆23May 4, 2020Updated 6 years ago
associatedpress / national-caseload-data-ingest
View on GitHub
Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying
☆15May 22, 2023Updated 3 years ago
strangeloopcanon / ParaLLM
View on GitHub
CLI that queries multiple language models in parallel using prompts from a CSV file
☆28Sep 24, 2025Updated 10 months ago
lintool / robust04-analysis
View on GitHub
Meta-Analysis of Robust04 Papers (Yang et al., SIGIR 2019)
☆12May 25, 2019Updated 7 years ago
peijunallin / alphalora
View on GitHub
☆19Nov 10, 2024Updated last year