yaman / fashion-clip-rsLinks
A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GRPC services and as a standalone library, providing highly efficient text and image embeddings.
☆39Updated 11 months ago
Alternatives and similar repositories for fashion-clip-rs
Users that are interested in fashion-clip-rs are comparing it to the libraries listed below
Sorting:
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆193Updated 3 weeks ago
- ☆137Updated last year
- Inference engine for GLiNER models, in Rust☆64Updated last month
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆28Updated 4 months ago
- ☆156Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Updated 4 months ago
- implement llava using candle☆15Updated last year
- Modular Rust transformer/LLM library using Candle☆36Updated last year
- ☆26Updated 7 months ago
- ☆13Updated last year
- A client library in Rust for Nvidia Triton.☆30Updated 2 years ago
- utilities for loading and running text embeddings with onnx☆44Updated last year
- Contextualized per-token embeddings☆27Updated 2 months ago
- ⚡️Lightning fast in-memory VectorDB written in rust🦀☆22Updated 5 months ago
- Notebooks using the Neural Magic libraries 📓☆40Updated last year
- 🚀 End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam☆28Updated last year
- Fast serverless LLM inference, in Rust.☆88Updated 5 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆58Updated 3 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Tantivy directory implementation backed by object_store☆36Updated last year
- Neural search for web-sites, docs, articles - online!☆136Updated last week
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated last year
- ☆130Updated last year
- HSNW module for Redis☆57Updated 4 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆57Updated last year
- Tree-based indexes for neural-search☆32Updated last year
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆100Updated 5 months ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆20Updated 2 months ago