taylorai / onnx_embedding_modelsLinks

utilities for loading and running text embeddings with onnx

☆44

Alternatives and similar repositories for onnx_embedding_models

Users that are interested in onnx_embedding_models are comparing it to the libraries listed below

Sorting:

enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 5 months ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
lightonai / pylate-rs
PyLate efficient inference engine
☆61Updated 2 weeks ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 5 months ago
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
cfahlgren1 / hf-data-explorer
Chrome Extension for exploring Hugging Face datasets 🔎
☆50Updated 10 months ago
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆84Updated 2 months ago
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆64Updated 2 months ago
Narsil / hf-chat
☆26Updated 7 months ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆101Updated last year
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
abetlen / program-constrained-language-model-sampling
☆35Updated 2 years ago
knowrohit / know_medical_dialogues
KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…
☆24Updated last year
enjalot / latent-sae
Training code for Sparse Autoencoders on Embedding models
☆38Updated 5 months ago
thesephist / spectre
Sparse autoencoders for Contra text embedding models
☆25Updated last year
xjdr-alt / muzero_sketch
☆38Updated last year
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated 2 months ago
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆21Updated 8 months ago
waefrebeorn / KAN-WuBu-Memory
An AI character interaction system with emotional modeling and advanced memory management
☆16Updated 9 months ago
ashvardanian / jaccard-index
Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables
☆20Updated 2 months ago
simonw / llm-embed-jina
Embedding models from Jina AI
☆61Updated last year
simonw / llm-cluster
LLM plugin for clustering embeddings
☆77Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
simonw / llm-anyscale-endpoints
LLM plugin for models hosted by Anyscale Endpoints
☆33Updated last year