unum-cloud / UFormLinks
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster than OpenAI CLIP and LLaVA πΌοΈ & ποΈ
β1,215Updated 2 months ago
Alternatives and similar repositories for UForm
Users that are interested in UForm are comparing it to the libraries listed below
Sorting:
- β717Updated last year
- CLIP inference in plain C/C++ with no extra dependenciesβ548Updated 7 months ago
- Automatically create Faiss knn indices with the most optimal similarity search parameters.β893Updated 2 months ago
- Fine-tune mistral-7B on 3090s, a100s, h100sβ724Updated 2 years ago
- Fast Open-Source Search & Clustering engine Γ for Vectors & Arbitrary Objects Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Sβ¦β3,712Updated 3 weeks ago
- C++ implementation for BLOOMβ809Updated 2 years ago
- β1,283Updated 2 years ago
- The simplest way to serve AI/ML models in productionβ1,109Updated this week
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for varioβ¦β1,043Updated 11 months ago
- This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025β1,399Updated 3 months ago
- β1,026Updated 2 years ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbonesβ1,306Updated last year
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".β1,309Updated 2 years ago
- fastest vector database made in numpyβ767Updated 3 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.β686Updated last year
- The repository for the code of the UltraFastBERT paperβ519Updated last year
- Training LLMs with QLoRA + FSDPβ1,536Updated last year
- NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024β1,803Updated 2 months ago
- Explore and interpret large embeddings in your browser with interactive visualization! πβ512Updated 5 months ago
- Scale LLM Engine public repositoryβ819Updated this week
- Train Models Contrastively in Pytorchβ772Updated 10 months ago
- π€ A PyTorch library of curated Transformer models and their composable componentsβ894Updated last year
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β867Updated 2 years ago
- Effort to open-source NLLB checkpoints.β474Updated last year
- BentoDiffusion: A collection of diffusion models served with BentoMLβ380Updated 8 months ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"β1,065Updated last year
- Tune any FALCON in 4-bitβ463Updated 2 years ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructionsβ823Updated 2 years ago
- Collections of vector search related libraries, service and research papersβ1,542Updated last year
- A Python vector database you just need - no more, no less.β640Updated last year