lucataco / cog-moondream2Links
Cog wrapper for moondream2
☆13Updated 11 months ago
Alternatives and similar repositories for cog-moondream2
Users that are interested in cog-moondream2 are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- ☆29Updated last year
- This repo lets you run mistral-7b in Google Colab.☆16Updated last year
- AI narrator☆15Updated last year
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Updated 4 months ago
- A function to do all☆36Updated last year
- huggingface chat-ui integration with mlx-lm server☆60Updated last year
- ☆58Updated 7 months ago
- ☆41Updated last year
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- Use this code to access pipeline to Gemini from inside notebookLM☆29Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Chatbot for The Carbon Almanac book or a climate change related topic☆14Updated 2 years ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- This repository will guide you to create your Images via Stable Diffusion using a Smart Virtual Assistant like Google Assistant using Ope…☆35Updated 2 years ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- Submodule for Grounded-SAM☆12Updated 2 years ago
- BH hackathon☆14Updated last year
- AI-powered image editor☆46Updated 2 years ago
- ☆11Updated last year
- LoRA Explorer model to test with LoRAs using Flux.1[Dev] as the base model☆50Updated 9 months ago
- ☆20Updated last year
- On-device LLM Inference using Mediapipe LLM Inference API.☆22Updated last year
- Google's Gemini implemented with GPT-4 Vision, Whisper and Resemble AI☆26Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆18Updated 2 weeks ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated 2 weeks ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 9 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- ☆46Updated last year
- 🚀 End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam☆28Updated last year