utilities for loading and running text embeddings with onnx
☆45Aug 16, 2025Updated 7 months ago
Alternatives and similar repositories for onnx_embedding_models
Users that are interested in onnx_embedding_models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 6, 2026Updated last week
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated last year
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated 2 months ago
- ☆57Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- run embeddings in MLX☆98Sep 27, 2024Updated last year
- ☆33Jun 17, 2024Updated last year
- Neural Search☆369Mar 11, 2025Updated last year
- Mistral-7B finetuned for function calling☆16Jan 28, 2024Updated 2 years ago
- ☆12Jul 21, 2021Updated 4 years ago
- ☆13Aug 4, 2021Updated 4 years ago
- Resources backing the Feast fraud tutorial on GCP☆14May 31, 2022Updated 3 years ago
- A toolkit for managing data access policies as code☆12Apr 18, 2024Updated last year
- Training code for Sparse Autoencoders on Embedding models☆39Apr 5, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Sparse autoencoders for Contra text embedding models☆25Apr 24, 2024Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Jun 15, 2024Updated last year
- Vector Embedding Server in under 100 lines of code☆22Mar 1, 2024Updated 2 years ago
- ☆28Aug 1, 2024Updated last year
- ☆12Jun 2, 2023Updated 2 years ago
- Efficient vector database for hundred millions of embeddings.☆215May 17, 2024Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 10 months ago
- ☆30Apr 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆67Mar 4, 2024Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Oct 28, 2025Updated 5 months ago
- ☆22Jun 26, 2024Updated last year
- A single static file as vector database, using the cloud-native flatgeobuf file format and http range requests☆17Oct 28, 2025Updated 5 months ago
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 2 years ago
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆33Oct 8, 2025Updated 6 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆61Jun 20, 2024Updated last year
- Build Agentic workflows with function calling using open LLMs☆28Apr 6, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Get a markdown version of any webpage with a keyboard shortcut.☆66Feb 17, 2025Updated last year
- YC companies example built on Trieve☆14Aug 1, 2024Updated last year
- 🍒 Dynamically inline assets into the DOM using Fetch Injection. Mirror of Fetch Inject on Codeberg.☆13May 26, 2024Updated last year