unum-cloud / uformLinks
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster than OpenAI CLIP and LLaVA πΌοΈ & ποΈ
β1,217Updated 3 months ago
Alternatives and similar repositories for uform
Users that are interested in uform are comparing it to the libraries listed below
Sorting:
- β716Updated last year
- CLIP inference in plain C/C++ with no extra dependenciesβ552Updated 7 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for varioβ¦β1,043Updated 11 months ago
- Fast Open-Source Search & Clustering engine Γ for Vectors & Arbitrary Objects Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Sβ¦β3,763Updated 2 weeks ago
- β1,282Updated 2 years ago
- Automatically create Faiss knn indices with the most optimal similarity search parameters.β894Updated 3 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.β685Updated last year
- NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024β1,810Updated 2 months ago
- Run inference on MPT-30B using CPUβ576Updated 2 years ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbonesβ1,307Updated this week
- fastest vector database made in numpyβ767Updated 4 months ago
- Explore and interpret large embeddings in your browser with interactive visualization! πβ514Updated 2 weeks ago
- Blazing fast framework for fine-tuning similarity learning modelsβ662Updated last month
- This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025β1,416Updated 4 months ago
- C++ implementation for BLOOMβ809Updated 2 years ago
- This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Rβ¦β1,987Updated 2 years ago
- Inference code for Persimmon-8Bβ412Updated 2 years ago
- Things you can do with the token embeddings of an LLMβ1,452Updated 2 months ago
- π€ A PyTorch library of curated Transformer models and their composable componentsβ894Updated last year
- Train Models Contrastively in Pytorchβ774Updated 10 months ago
- Exact structure out of any language model completion.β514Updated 2 years ago
- β401Updated last year
- Training LLMs with QLoRA + FSDPβ1,539Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100sβ725Updated 2 years ago
- Tune any FALCON in 4-bitβ463Updated 2 years ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β867Updated 2 years ago
- β748Updated last year
- A SQLite extension for efficient vector search, based on Faiss!β1,965Updated last year
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"β1,066Updated last year
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascriptβ616Updated last year