tigrisdata-community / multi-modal-starter-kitLinks
Multi-modal starter kit for AI video understanding and narration. Works with Ollama (Llava, bakllava), GPT-4v
☆131Updated 8 months ago
Alternatives and similar repositories for multi-modal-starter-kit
Users that are interested in multi-modal-starter-kit are comparing it to the libraries listed below
Sorting:
- ☆125Updated last year
- AI agent to automatically check grammar and spelling on documentation files☆87Updated 8 months ago
- List of awesome projects powered by fal.ai☆76Updated 9 months ago
- Safely deploy OpenAI's Realtime APIs in less than 5 minutes!☆156Updated 8 months ago
- ☆28Updated 6 months ago
- ☆108Updated 4 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆290Updated 10 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- Demo of AI chatbot that predicts user message to generate response quickly.☆101Updated last year
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆132Updated 8 months ago
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated last year
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆235Updated last year
- Create and share chatbots with external knowledge ✨☆69Updated 7 months ago
- A playground for creative exploration that uses SDXL Turbo.☆222Updated 2 weeks ago
- A couple scripts to grab stats from email☆42Updated 8 months ago
- Replicate Flux LoRA image editor.☆51Updated 9 months ago
- For LLMs to better code with Jina API☆148Updated last month
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆57Updated 8 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 7 months ago
- olive-cli: an optionally 100% local lightweight agent system with batteries☆13Updated last week
- ☆47Updated last year
- converts url content into JSON with a simple prefix☆68Updated last year
- ☆38Updated 8 months ago
- Gradio UI for a Cog API☆66Updated last year
- A spotify playlist agent using CrewAI☆81Updated last year
- A relay server for OpenAI's realtime API, for Cloudflare Workers☆140Updated 6 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated this week
- napkins.dev – from screenshot to app☆86Updated 8 months ago
- Extract information from any website by chatting with AI - Fork of Vercel AI Chatbot w/ Firecrawl Integrated☆120Updated 4 months ago
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆151Updated last year