vikhyat / moondream
tiny vision language model
☆6,732Updated this week
Alternatives and similar repositories for moondream:
Users that are interested in moondream are comparing it to the libraries listed below
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,197Updated this week
- Go ahead and axolotl questions☆8,293Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,462Updated this week
- Composable building blocks to build Llama Apps☆6,036Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆16,235Updated this week
- ☆7,156Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆5,974Updated 2 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆15,474Updated this week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆13,996Updated this week
- Inference and training library for high-quality TTS models.☆4,910Updated last month
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,326Updated 6 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,011Updated 6 months ago
- Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.☆17,869Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,775Updated 3 months ago
- Blazingly fast LLM inference.☆4,826Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆21,693Updated this week
- Large Action Model framework to develop AI Web Agents☆5,807Updated 2 months ago
- A language model programming library.☆5,556Updated 3 weeks ago
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆18,680Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆5,509Updated last week
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆9,387Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆7,430Updated this week
- We write your reusable computer vision tools. 💜☆24,649Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆18,641Updated this week
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,116Updated 3 months ago
- DSPy: The framework for programming—not prompting—language models☆21,018Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆6,639Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆7,353Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,096Updated 5 months ago
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,278Updated 7 months ago