FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆65Updated last year
Alternatives and similar repositories for embd:
Users that are interested in embd are comparing it to the libraries listed below
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Latent Large Language Models☆17Updated 5 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 6 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆4Updated 5 months ago
- Routing on Random Forest (RoRF)☆112Updated 4 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- ☆20Updated 3 months ago
- a version of baby agi using dspy and typed predictors☆17Updated 11 months ago
- ☆22Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated 8 months ago
- Very minimal (and stateless) agent framework☆41Updated last month
- Embedding models from Jina AI☆58Updated last year
- ☆77Updated this week
- ☆38Updated 11 months ago
- ☆19Updated last year
- ☆48Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 9 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆36Updated 4 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- ☆34Updated last year
- Replace expensive LLM calls with finetunes automatically☆62Updated last year
- Sparse autoencoders for Contra text embedding models☆25Updated 9 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆23Updated 7 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year