FL33TW00D / embdLinks
GPU accelerated client-side embeddings for vector search, RAG etc.
☆65Updated last year
Alternatives and similar repositories for embd
Users that are interested in embd are comparing it to the libraries listed below
Sorting:
- utilities for loading and running text embeddings with onnx☆44Updated 3 weeks ago
- Using modal.com to process FineWeb-edu data☆20Updated 5 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- ☆19Updated 2 years ago
- ☆26Updated 9 months ago
- Chat Markup Language conversation library☆55Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- Demo of ConversationEntityMemory in Streamlit.☆52Updated 2 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Latent Large Language Models☆18Updated last year
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21Updated 3 months ago
- ☆46Updated last year
- Embedding models from Jina AI☆64Updated last year
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆73Updated 7 months ago
- ☆22Updated 2 years ago
- Verbosity control for AI agents☆65Updated last year
- auto fine tune of models with synthetic data☆76Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 11 months ago
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Vanilla-Python ergonomics on top of DSPy☆33Updated 3 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- A dictionary, but it shows you position in embedding space relative to some synonyms/antonyms instead of a definition.☆74Updated 7 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆103Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- Simple Graph Memory for AI applications☆90Updated 3 months ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- Run GGML models with Kubernetes.☆174Updated last year
- An HTTP serving framework by Banana☆101Updated last year