vikhyat / moondreamLinks
tiny vision language model
☆8,917Updated this week
Alternatives and similar repositories for moondream
Users that are interested in moondream are comparing it to the libraries listed below
Sorting:
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,441Updated 8 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,918Updated last week
- Go ahead and axolotl questions☆10,798Updated this week
- Inference and training library for high-quality TTS models.☆5,480Updated 11 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,371Updated last year
- High-resolution models for human tasks.☆5,211Updated last year
- Foundational model for human-like, expressive TTS☆4,195Updated last year
- Blazingly fast LLM inference.☆6,230Updated this week
- Everything about the SmolLM and SmolVLM family of models☆3,408Updated 2 months ago
- Tools for merging pretrained large language models.☆6,447Updated 2 weeks ago
- Local AI API Platform☆2,764Updated 4 months ago
- Official Code for Stable Cascade☆6,583Updated last year
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,617Updated 2 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,640Updated last week
- Ollama Python library☆8,862Updated this week
- A fast multimodal LLM for real-time voice☆4,258Updated 2 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,063Updated last year
- Local realtime voice AI☆2,377Updated 8 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,888Updated 3 weeks ago
- ☆3,038Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,799Updated last year
- Python scraper based on AI☆21,793Updated last week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,270Updated 8 months ago
- PyTorch native post-training library☆5,595Updated this week
- ☆8,735Updated 3 weeks ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆6,934Updated 10 months ago
- A vector search SQLite extension that runs anywhere!☆6,412Updated 9 months ago
- High-speed Large Language Model Serving for Local Deployment☆8,388Updated 3 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,327Updated last year
- Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.☆2,743Updated 2 weeks ago