☆86Nov 3, 2025Updated 6 months ago
Alternatives and similar repositories for arctic-embed
Users that are interested in arctic-embed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆45Mar 6, 2024Updated 2 years ago
- ☆14Jul 7, 2024Updated last year
- Longformer for MS MARCO document re-ranking task.☆20Jan 11, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- Welcome to the LLM Tutorials and RAG Implementations repository! This repository provides tutorials, guides, and implementations for work…☆13Jul 1, 2025Updated 10 months ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆349Dec 16, 2024Updated last year
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- Datasette plugin for streaming SQLite database backups to S3, using Litestream!☆19Jan 20, 2026Updated 3 months ago
- Repository for paper CELLS: A Parallel Corpus for Biomedical Lay Language Generation☆19Apr 2, 2024Updated 2 years ago
- code for piccolo embedding model from SenseTime☆144May 21, 2024Updated last year
- ☆20Apr 8, 2025Updated last year
- Topological Data Analysis (TDA) for Natural Language Processing (NLP) Applications☆25Apr 27, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Experiments with reasoning models, training techniques, papers☆29Updated this week
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆992May 3, 2024Updated 2 years ago
- [Preprint] Making AI Less ''Thirsty'': Uncovering and Addressing the Secret Water Footprint of AI☆31Apr 7, 2023Updated 3 years ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆227Apr 8, 2026Updated 3 weeks ago
- Exploring some issues related to churn☆17Mar 19, 2024Updated 2 years ago
- ☆63Jan 26, 2025Updated last year
- A massively multilingual modern encoder language model☆140Jan 20, 2026Updated 3 months ago
- ☆36Jun 12, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Terminal Image Viewer for iTerm2☆12Jul 6, 2019Updated 6 years ago
- Controllable Sentence Simplification with T5☆18May 24, 2023Updated 2 years ago
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 11 months ago
- Writing Blog Posts with Generative Feedback Loops!☆50Mar 19, 2024Updated 2 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- Searchable archive of Tracking Jupyter newsletters☆15Jun 26, 2020Updated 5 years ago
- ☆19Nov 10, 2024Updated last year
- ☆10Dec 10, 2023Updated 2 years ago
- Training code for Sparse Autoencoders on Embedding models☆39Apr 25, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- "Cross-lingual Language Model Pretraining for Retrieval". (WWW 2021)☆10Jun 17, 2022Updated 3 years ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆198Sep 13, 2025Updated 7 months ago
- ☆23May 4, 2020Updated 6 years ago
- Smart reproducible analytical pipeline inspection☆21Feb 13, 2026Updated 2 months ago
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆15May 22, 2023Updated 2 years ago
- ☆62Jul 21, 2024Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago