unum-cloud / uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster than OpenAI CLIP and LLaVA πΌοΈ & ποΈ
β1,094Updated last month
Alternatives and similar repositories for uform:
Users that are interested in uform are comparing it to the libraries listed below
- Fast Open-Source Search & Clustering engine Γ for Vectors & π Strings Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, Cβ¦β2,460Updated last week
- CLIP inference in plain C/C++ with no extra dependenciesβ476Updated 5 months ago
- Automatically create Faiss knn indices with the most optimal similarity search parameters.β833Updated 8 months ago
- β707Updated 11 months ago
- β1,271Updated last year
- π€ A PyTorch library of curated Transformer models and their composable componentsβ878Updated 9 months ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructionsβ817Updated last year
- BentoDiffusion: A collection of diffusion models served with BentoMLβ349Updated this week
- C++ implementation for BLOOMβ810Updated last year
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333β1,081Updated last year
- Effort to open-source NLLB checkpoints.β436Updated 8 months ago
- Inference code for Persimmon-8Bβ416Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100sβ705Updated last year
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expertβ¦β1,349Updated 2 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ2,355Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.β828Updated last week
- Tune any FALCON in 4-bitβ466Updated last year
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascriptβ565Updated 7 months ago
- An Open-source Toolkit for LLM Developmentβ2,756Updated last month
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbonesβ1,260Updated 9 months ago
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".β1,306Updated last year
- Python bindings for the Transformer models implemented in C/C++ using GGML library.β1,838Updated last year
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β852Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language modelβ1,468Updated 3 weeks ago
- Exact structure out of any language model completion.β506Updated last year
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for varioβ¦β998Updated 4 months ago
- Chat language model that can use tools and interpret the resultsβ1,513Updated this week
- Customizable implementation of the self-instruct paper.β1,038Updated 11 months ago
- Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"β1,675Updated last year
- This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinfβ¦β817Updated 2 months ago