vikhyat / moondreamLinks
tiny vision language model
☆8,838Updated last month
Alternatives and similar repositories for moondream
Users that are interested in moondream are comparing it to the libraries listed below
Sorting:
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,420Updated 7 months ago
- Inference and training library for high-quality TTS models.☆5,452Updated 10 months ago
- Foundational model for human-like, expressive TTS☆4,191Updated last year
- Local AI API Platform☆2,758Updated 3 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,022Updated 2 weeks ago
- Blazingly fast LLM inference.☆6,171Updated this week
- A fast multimodal LLM for real-time voice☆4,236Updated last month
- ☆8,654Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,861Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,780Updated last year
- ☆3,037Updated last year
- On-device AI across mobile, embedded and edge for PyTorch☆3,374Updated this week
- The #1 open-source voice interface for desktop, mobile, and ESP32 chips.☆5,093Updated 11 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆6,881Updated 10 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,679Updated last year
- Python bindings for llama.cpp☆9,678Updated 2 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,007Updated last year
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,798Updated 7 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,323Updated last year
- An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own se…☆3,305Updated last year
- Large Language Model Text Generation Inference☆10,605Updated last month
- Large Action Model framework to develop AI Web Agents☆6,185Updated 9 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆23,822Updated last year
- Open Source AI Platform - AI Chat with advanced features that works with every LLM☆15,589Updated this week
- Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative☆4,829Updated 7 months ago
- An Open Source text-to-speech system built by inverting Whisper.☆4,513Updated 4 months ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,617Updated last month
- Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.☆18,971Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,327Updated 5 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,642Updated last month