GPU accelerated client-side embeddings for vector search, RAG etc.
☆65Dec 4, 2023Updated 2 years ago
Alternatives and similar repositories for embd
Users that are interested in embd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆104Jun 10, 2023Updated 3 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- Implementation of ModernBERT in MLX☆21Jan 7, 2026Updated 5 months ago
- trying to make WebGPU a bit easier to use☆19Jan 9, 2024Updated 2 years ago
- An extensible CLI for integrating LLM models with a flexible scripting system☆22Jul 1, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Your one stop CLI for ONNX model analysis.☆47Nov 13, 2022Updated 3 years ago
- A cross-platform browser ML framework.☆763May 26, 2026Updated 2 weeks ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Observability World for WASI☆27Mar 5, 2025Updated last year
- Vector functions and indexing for SQLite☆10Mar 26, 2023Updated 3 years ago
- Semantic search webassembly module☆18Oct 13, 2024Updated last year
- Just another sentiment wrapper.☆18Dec 11, 2021Updated 4 years ago
- Audio transcription using mlx whisper and vad silence processing☆17Oct 14, 2024Updated last year
- ☆22Jun 27, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- run embeddings in MLX☆97Sep 27, 2024Updated last year
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 9 months ago
- Zig compiler compiled to WASM☆19Apr 17, 2024Updated 2 years ago
- ☆13Nov 27, 2025Updated 6 months ago
- ☆18Apr 30, 2025Updated last year
- LiT (Zero-Shot Transfer with Locked-image text Tuning) image and text encoder models, working in the browser☆11May 16, 2022Updated 4 years ago
- Experimental wasm32-unknown-wasi runtime for Python code execution☆40Nov 28, 2024Updated last year
- Tool for visual profiling Core ML models, compatible with both package and compiled versions, including reasons for unsupported operation…☆39Jun 18, 2024Updated last year
- CosmWasm + zkVM RISC-V EFI template☆23Oct 20, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- automatic sentence highlights based on their significance to the document☆196Nov 22, 2023Updated 2 years ago
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Jan 11, 2024Updated 2 years ago
- Have UV deal with all your Jupyter deps.☆28Sep 7, 2024Updated last year
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- Mine conversations from novels in Project Gutenberg, to generate data for data-driven dialogue systems.☆15May 7, 2019Updated 7 years ago
- Implementation of go-diff's diffmatchpatch in Zig☆32May 31, 2026Updated 2 weeks ago
- A fast, simple, multi-threaded string interning library.☆18Jul 11, 2025Updated 11 months ago
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 9 months ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Training code for Sparse Autoencoders on Embedding models☆39May 9, 2026Updated last month
- Example using hyperdb with webrtc swarm☆10Jul 17, 2018Updated 7 years ago
- Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTS☆15Apr 24, 2024Updated 2 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- ☆72Mar 15, 2024Updated 2 years ago
- Barebones Rust EVM Implementation☆12Feb 9, 2022Updated 4 years ago
- ☆11Dec 23, 2023Updated 2 years ago