vikhyat / moondreamLinks
tiny vision language model
☆9,000Updated last month
Alternatives and similar repositories for moondream
Users that are interested in moondream are comparing it to the libraries listed below
Sorting:
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,647Updated this week
- A fast multimodal LLM for real-time voice☆4,283Updated this week
- Examples in the MLX framework☆8,044Updated 3 weeks ago
- Inference and training library for high-quality TTS models.☆5,495Updated last year
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,437Updated 9 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,448Updated 3 weeks ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,184Updated 3 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,017Updated last week
- Foundational model for human-like, expressive TTS☆4,197Updated last year
- Local realtime voice AI☆2,386Updated 3 weeks ago
- Local AI API Platform☆2,764Updated 5 months ago
- ☆3,049Updated 3 weeks ago
- Blazingly fast LLM inference.☆6,280Updated this week
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,407Updated last year
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,220Updated 3 months ago
- Perplexity Inspired Answer Engine☆5,009Updated 5 months ago
- ML-powered speech recognition directly in your browser☆3,184Updated last year
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆49,366Updated last week
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,710Updated last year
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,998Updated 11 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆3,024Updated 3 weeks ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,624Updated 3 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,835Updated last year
- Large Action Model framework to develop AI Web Agents☆6,215Updated 10 months ago
- Modeling, training, eval, and inference code for OLMo☆6,220Updated 3 weeks ago
- Devon: An open-source pair programmer☆3,463Updated 6 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,325Updated last year
- The #1 open-source voice interface for desktop, mobile, and ESP32 chips.☆5,099Updated last year
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,959Updated last week
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆7,023Updated 9 months ago