☆85Nov 3, 2025Updated 4 months ago
Alternatives and similar repositories for arctic-embed
Users that are interested in arctic-embed are comparing it to the libraries listed below
Sorting:
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated 11 months ago
- Terminal Image Viewer for iTerm2☆12Jul 6, 2019Updated 6 years ago
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆14May 22, 2023Updated 2 years ago
- Smart reproducible analytical pipeline inspection☆21Feb 13, 2026Updated 2 weeks ago
- ☆20Apr 24, 2025Updated 10 months ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- Pull out versions of specific files from a gitscraping repo into individual files.☆15Jul 14, 2021Updated 4 years ago
- hnswlib.rb provides Ruby bindings for Hnswlib☆15Feb 17, 2026Updated 2 weeks ago
- pip-installable SQLite extensions☆15Feb 23, 2023Updated 3 years ago
- ☆13Nov 28, 2020Updated 5 years ago
- Searchable archive of Tracking Jupyter newsletters☆15Jun 26, 2020Updated 5 years ago
- COGS Operates Google Sheets☆16Nov 20, 2025Updated 3 months ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆346Dec 16, 2024Updated last year
- Exploring some issues related to churn☆17Mar 19, 2024Updated last year
- CLI that queries multiple language models in parallel using prompts from a CSV file☆28Sep 24, 2025Updated 5 months ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆44Mar 6, 2024Updated last year
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆22May 15, 2024Updated last year
- Proxy server for triton gRPC server that inferences embedding model in Rust☆21Aug 10, 2024Updated last year
- The corporate repository where we discuss our serious business☆22Mar 9, 2025Updated 11 months ago
- Keeps tabs on the ticking donation amount found on ActBlue's home page.☆24Updated this week
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆90Oct 15, 2024Updated last year
- SQLite3 extension for read-only HTTP(S) database access☆57Nov 19, 2023Updated 2 years ago
- ☆57Jan 26, 2025Updated last year
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 9 months ago
- Efficient vector database for hundred millions of embeddings.☆212May 17, 2024Updated last year
- code for piccolo embedding model from SenseTime☆145May 21, 2024Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Mar 19, 2024Updated last year
- NextPlaid, ColGREP: Multi-vector search, from database to coding agents.☆165Feb 26, 2026Updated last week
- 🔒🐧 Run command in a secure OS sandbox☆73Feb 3, 2026Updated last month
- sponge your gmail with artificial intelligence☆22Jan 22, 2025Updated last year
- Build requirements files from setup.py.☆27Sep 4, 2025Updated 6 months ago
- Provide fine-grained push access to GitHub from a JupyterHub☆29Updated this week
- Late Interaction Models Training & Retrieval☆732Updated this week
- Chat Markup Language conversation library☆55Jan 3, 2024Updated 2 years ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆579Feb 24, 2026Updated last week
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆190Sep 13, 2025Updated 5 months ago
- xargs for semgrep☆29Mar 27, 2024Updated last year
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 3 months ago