utilities for loading and running text embeddings with onnx
☆45Aug 16, 2025Updated 6 months ago
Alternatives and similar repositories for onnx_embedding_models
Users that are interested in onnx_embedding_models are comparing it to the libraries listed below
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 11 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- ☆12Apr 21, 2025Updated 10 months ago
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated last month
- ☆40Updated this week
- ☆13Dec 12, 2023Updated 2 years ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Mar 14, 2025Updated 11 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆159Jul 14, 2025Updated 7 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated last year
- Mistral-7B finetuned for function calling☆16Jan 28, 2024Updated 2 years ago
- Neural Search☆367Mar 11, 2025Updated 11 months ago
- Vector Embedding Server in under 100 lines of code☆22Mar 1, 2024Updated 2 years ago
- ☆24Apr 5, 2023Updated 2 years ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- FormFill is a CLI tool that uses LLMs to automatically fill out PDF forms.☆29Nov 22, 2024Updated last year
- ☆22Jun 26, 2024Updated last year
- Build Agentic workflows with function calling using open LLMs☆28Feb 2, 2026Updated last month
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- ☆27Aug 1, 2024Updated last year
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 2 years ago
- ⚡️Lightning fast in-memory VectorDB written in rust🦀☆30Mar 10, 2025Updated 11 months ago
- Demo of ConversationEntityMemory in Streamlit.☆52Jan 23, 2023Updated 3 years ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- utilities for batched llm calls with retries☆46Feb 26, 2026Updated last week
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Oct 28, 2025Updated 4 months ago
- codellama on CPU without Docker☆25Feb 8, 2024Updated 2 years ago
- Sparse autoencoders for Contra text embedding models☆25Apr 24, 2024Updated last year
- Collection of resources for RL and Reasoning☆27Feb 3, 2025Updated last year
- Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data☆21Apr 14, 2024Updated last year
- Generate, explain and execute commands in the CLI☆27Feb 21, 2025Updated last year
- ☆67Mar 4, 2024Updated 2 years ago
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆67Feb 17, 2025Updated last year
- ☆27Oct 22, 2024Updated last year
- ☆24Jan 30, 2025Updated last year
- Fluid Database☆113Sep 20, 2024Updated last year
- Not financial advice.☆28Mar 18, 2023Updated 2 years ago