FL33TW00D / embdLinks

GPU accelerated client-side embeddings for vector search, RAG etc.

☆66

Alternatives and similar repositories for embd

Users that are interested in embd are comparing it to the libraries listed below

Sorting:

taylorai / onnx_embedding_models
utilities for loading and running text embeddings with onnx
☆44Updated 11 months ago
Narsil / hf-chat
☆26Updated 7 months ago
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
mrcolo / longboii
☆19Updated 2 years ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
abetlen / program-constrained-language-model-sampling
☆35Updated 2 years ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
mattneary / colorspace
assign color hues to a collection of text fragments based on embeddings
☆20Updated last year
catid / lllm
Latent Large Language Models
☆18Updated 11 months ago
FL33TW00D / laserbeak
Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU
☆102Updated 2 years ago
Vaibhavs10 / fast-llm.rs
☆138Updated last year
bananaml / potassium
An HTTP serving framework by Banana
☆102Updated last year
foobarbaz-inc / conversation-memory-streamlit
Demo of ConversationEntityMemory in Streamlit.
☆52Updated 2 years ago
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆71Updated 5 months ago
ashvardanian / jaccard-index
Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables
☆20Updated 2 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
yoheinakajima / autofinetune
auto fine tune of models with synthetic data
☆76Updated last year
knowrohit / know_medical_dialogues
KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…
☆24Updated last year
simonw / llm-embed-jina
Embedding models from Jina AI
☆61Updated last year
CarperAI / treasure_trove
☆22Updated last year
whitphx / transformers.js.py
☆86Updated last week
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆101Updated last year
granawkins / latent-dictionary
A dictionary, but it shows you position in embedding space relative to some synonyms/antonyms instead of a definition.
☆74Updated 6 months ago
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆100Updated 2 years ago
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 10 months ago
cfahlgren1 / hf-data-explorer
Chrome Extension for exploring Hugging Face datasets 🔎
☆50Updated 10 months ago