utilities for loading and running text embeddings with onnx
☆46Aug 16, 2025Updated 10 months ago
Alternatives and similar repositories for onnx_embedding_models
Users that are interested in onnx_embedding_models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated 2 years ago
- Universal text classifier for generative models☆24Jul 25, 2024Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆161Jul 14, 2025Updated 11 months ago
- Run embedding models using ONNX☆36Jan 29, 2024Updated 2 years ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated 5 months ago
- run embeddings in MLX☆97Sep 27, 2024Updated last year
- ☆34Jun 17, 2024Updated 2 years ago
- Neural Search☆371Mar 11, 2025Updated last year
- Mistral-7B finetuned for function calling☆17Jan 28, 2024Updated 2 years ago
- utilities for batched llm calls with retries☆51Updated this week
- The official repository of the OpenToM dataset☆33Feb 2, 2025Updated last year
- Training code for Sparse Autoencoders on Embedding models☆39Jun 16, 2026Updated 2 weeks ago
- Sparse autoencoders for Contra text embedding models☆25Apr 24, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Vector Embedding Server in under 100 lines of code☆22Mar 1, 2024Updated 2 years ago
- ☆28Aug 1, 2024Updated last year
- Efficient vector database for hundred millions of embeddings.☆216May 17, 2024Updated 2 years ago
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated last year
- ☆30Apr 26, 2024Updated 2 years ago
- ☆67Mar 4, 2024Updated 2 years ago
- ☆22Jun 26, 2024Updated 2 years ago
- A single static file as vector database, using the cloud-native flatgeobuf file format and http range requests☆17Oct 28, 2025Updated 8 months ago
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- minimal pytorch implementation of bm25 (with sparse tensors)☆106Oct 28, 2025Updated 8 months ago
- data cleaning and curation for unstructured text☆330Aug 6, 2024Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆34Oct 8, 2025Updated 8 months ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Build Agentic workflows with function calling using open LLMs☆27Jun 1, 2026Updated last month
- TinyMCE Component for SolidJS☆13May 29, 2025Updated last year
- ☆10Jun 29, 2021Updated 5 years ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Efficient BM25 with DuckDB 🦆☆68Dec 20, 2024Updated last year
- ☆23Jun 20, 2026Updated 2 weeks ago
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆69Feb 17, 2025Updated last year
- FormFill is a CLI tool that uses LLMs to automatically fill out PDF forms.☆33Nov 22, 2024Updated last year
- Demo of ConversationEntityMemory in Streamlit.☆52Jan 23, 2023Updated 3 years ago
- ☆15Apr 26, 2025Updated last year