utilities for loading and running text embeddings with onnx
☆45Aug 16, 2025Updated 9 months ago
Alternatives and similar repositories for onnx_embedding_models
Users that are interested in onnx_embedding_models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated last month
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- Universal text classifier for generative models☆24Jul 25, 2024Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 10 months ago
- Run embedding models using ONNX☆36Jan 29, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated last year
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated 4 months ago
- run embeddings in MLX☆98Sep 27, 2024Updated last year
- ☆65Updated this week
- ☆34Jun 17, 2024Updated last year
- Mistral-7B finetuned for function calling☆16Jan 28, 2024Updated 2 years ago
- utilities for batched llm calls with retries☆50Apr 23, 2026Updated last month
- ☆12Apr 21, 2025Updated last year
- sponge your gmail with artificial intelligence☆21Jan 22, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The official repository of the OpenToM dataset☆32Feb 2, 2025Updated last year
- Training code for Sparse Autoencoders on Embedding models☆39May 9, 2026Updated 2 weeks ago
- Sparse autoencoders for Contra text embedding models☆25Apr 24, 2024Updated 2 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Jun 15, 2024Updated last year
- Vector Embedding Server in under 100 lines of code☆22Mar 1, 2024Updated 2 years ago
- ☆28Aug 1, 2024Updated last year
- ☆12Jun 2, 2023Updated 2 years ago
- Efficient vector database for hundred millions of embeddings.☆215May 17, 2024Updated 2 years ago
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆30Apr 26, 2024Updated 2 years ago
- ☆67Mar 4, 2024Updated 2 years ago
- ☆22Jun 26, 2024Updated last year
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 3 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆105Oct 28, 2025Updated 6 months ago
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Build Agentic workflows with function calling using open LLMs☆28May 4, 2026Updated 3 weeks ago
- ☆10Jun 29, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- 🍒 Dynamically inline assets into the DOM using Fetch Injection. Mirror of Fetch Inject on Codeberg.☆13May 26, 2024Updated last year
- Finetune your embeddings in-browser☆35Apr 14, 2024Updated 2 years ago
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆67Feb 17, 2025Updated last year