unum-cloud / uformLinks
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster than OpenAI CLIP and LLaVA πΌοΈ & ποΈ
β1,148Updated 5 months ago
Alternatives and similar repositories for uform
Users that are interested in uform are comparing it to the libraries listed below
Sorting:
- Fast Open-Source Search & Clustering engine Γ for Vectors & Arbitrary Objects Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Sβ¦β2,795Updated last week
- β710Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100sβ714Updated last year
- CLIP inference in plain C/C++ with no extra dependenciesβ502Updated last week
- Build robust LLM applications with true composability πβ419Updated last year
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expertβ¦β1,460Updated 3 months ago
- Automatically create Faiss knn indices with the most optimal similarity search parameters.β857Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructionsβ819Updated 2 years ago
- Collections of vector search related libraries, service and research papersβ1,504Updated 10 months ago
- Blazing fast framework for fine-tuning similarity learning modelsβ656Updated 2 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,887Updated 5 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,511Updated last month
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.β683Updated 10 months ago
- C++ implementation for BLOOMβ810Updated 2 years ago
- π°οΈ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.β1,461Updated 2 months ago
- A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.β1,399Updated 4 months ago
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and croβ¦β815Updated 6 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for varioβ¦β1,016Updated 3 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,613Updated 10 months ago
- π€ A PyTorch library of curated Transformer models and their composable componentsβ891Updated last year
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".β1,307Updated last year
- Tune any FALCON in 4-bitβ467Updated last year
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ2,148Updated last week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpaliβ2,252Updated 2 weeks ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β857Updated last year
- Training LLMs with QLoRA + FSDPβ1,485Updated 7 months ago
- Things you can do with the token embeddings of an LLMβ1,445Updated 2 months ago
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddingsβ1,966Updated 5 months ago
- Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)β567Updated last year
- β2,965Updated 9 months ago