utilities for loading and running text embeddings with onnx
☆45Aug 16, 2025Updated 7 months ago
Alternatives and similar repositories for onnx_embedding_models
Users that are interested in onnx_embedding_models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 11 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- Universal text classifier for generative models☆24Jul 25, 2024Updated last year
- ☆42Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated last year
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated 2 months ago
- run embeddings in MLX☆97Sep 27, 2024Updated last year
- ☆33Jun 17, 2024Updated last year
- Wikidata's QRank as a SQLite DB.☆28Jan 16, 2024Updated 2 years ago
- Neural Search☆367Mar 11, 2025Updated last year
- Mistral-7B finetuned for function calling☆16Jan 28, 2024Updated 2 years ago
- ☆12Apr 21, 2025Updated 11 months ago
- sponge your gmail with artificial intelligence☆22Jan 22, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The official repository of the OpenToM dataset☆29Feb 2, 2025Updated last year
- A toolkit for managing data access policies as code☆13Apr 18, 2024Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Sparse autoencoders for Contra text embedding models☆25Apr 24, 2024Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Jun 15, 2024Updated last year
- Efficient vector database for hundred millions of embeddings.☆213May 17, 2024Updated last year
- ☆30Apr 26, 2024Updated last year
- ☆67Mar 4, 2024Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Oct 28, 2025Updated 4 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆22Jun 26, 2024Updated last year
- ☆26Dec 13, 2024Updated last year
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆33Oct 8, 2025Updated 5 months ago
- Build Agentic workflows with function calling using open LLMs☆28Mar 2, 2026Updated 3 weeks ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- YC companies example built on Trieve☆14Aug 1, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 🍒 Dynamically inline assets into the DOM using Fetch Injection. Mirror of Fetch Inject on Codeberg.☆13May 26, 2024Updated last year
- A Claude Code plugin + Agent Skill + MCP Server for analyzing Federal Election Commission (FEC) campaign finance filings.☆31Feb 4, 2026Updated last month
- ☆21Mar 18, 2026Updated last week
- Finetune your embeddings in-browser☆34Apr 14, 2024Updated last year
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- An interface for llama.cpp, ChatGPT, Gemini, and Claude☆27Mar 9, 2026Updated 2 weeks ago
- A simple app for downloading YouTube Shorts transcripts. Built to self-host with Python and Streamlit. Free and open source.☆32Dec 4, 2024Updated last year