utilities for loading and running text embeddings with onnx
☆45Aug 16, 2025Updated 8 months ago
Alternatives and similar repositories for onnx_embedding_models
Users that are interested in onnx_embedding_models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated 3 weeks ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 9 months ago
- Run embedding models using ONNX☆36Jan 29, 2024Updated 2 years ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated 3 months ago
- run embeddings in MLX☆98Sep 27, 2024Updated last year
- ☆61Updated this week
- ☆34Jun 17, 2024Updated last year
- Neural Search☆371Mar 11, 2025Updated last year
- Mistral-7B finetuned for function calling☆16Jan 28, 2024Updated 2 years ago
- sponge your gmail with artificial intelligence☆21Jan 22, 2025Updated last year
- The official repository of the OpenToM dataset☆32Feb 2, 2025Updated last year
- Training code for Sparse Autoencoders on Embedding models☆39Apr 25, 2026Updated last week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Sparse autoencoders for Contra text embedding models☆25Apr 24, 2024Updated 2 years ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Vector Embedding Server in under 100 lines of code☆22Mar 1, 2024Updated 2 years ago
- ☆28Aug 1, 2024Updated last year
- Efficient vector database for hundred millions of embeddings.☆215May 17, 2024Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- ☆67Mar 4, 2024Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Oct 28, 2025Updated 6 months ago
- ☆22Jun 26, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A single static file as vector database, using the cloud-native flatgeobuf file format and http range requests☆17Oct 28, 2025Updated 6 months ago
- ☆26Dec 13, 2024Updated last year
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆34Oct 8, 2025Updated 6 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆63Jun 20, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Build Agentic workflows with function calling using open LLMs☆28Apr 6, 2026Updated 3 weeks ago
- TinyMCE Component for SolidJS☆13May 29, 2025Updated 11 months ago
- ☆10Jun 29, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Efficient BM25 with DuckDB 🦆☆67Dec 20, 2024Updated last year
- 🍒 Dynamically inline assets into the DOM using Fetch Injection. Mirror of Fetch Inject on Codeberg.☆13May 26, 2024Updated last year
- ☆21Apr 24, 2026Updated last week
- Finetune your embeddings in-browser☆34Apr 14, 2024Updated 2 years ago