tigrisdata-community / multi-modal-starter-kitLinks
Multi-modal starter kit for AI video understanding and narration. Works with Ollama (Llava, bakllava), GPT-4v
☆140Updated last year
Alternatives and similar repositories for multi-modal-starter-kit
Users that are interested in multi-modal-starter-kit are comparing it to the libraries listed below
Sorting:
- Demo of AI chatbot that predicts user message to generate response quickly.☆104Updated last year
- AI agent to automatically check grammar and spelling on documentation files☆93Updated last month
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆139Updated 6 months ago
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated 2 years ago
- ☆30Updated last year
- ☆125Updated last year
- List of awesome projects powered by fal.ai☆104Updated 5 months ago
- AI agent workflow for generating profiles of clients and running research tasks for them. There is an agent for each part of the process:…☆82Updated last year
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- ☆47Updated last year
- A spotify playlist agent using CrewAI☆82Updated last year
- A browser extension that demos Gemini Nano via window.ai and Cartesia TTS ⚡️☆38Updated last year
- ☆107Updated 11 months ago
- converts url content into JSON with a simple prefix☆71Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- A couple scripts to grab stats from email☆43Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 6 months ago
- ☆22Updated last year
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆55Updated last year
- Create and share chatbots with external knowledge ✨☆70Updated last year
- ☆45Updated last year
- The next evolution of Agents☆48Updated 3 weeks ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year
- ☆171Updated last year
- ActBot is a prototype for an injectable chatbot to give any website agentic capabilities☆57Updated last year
- Summarize, Verify & Chat with any YouTube video in seconds.☆174Updated 10 months ago
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆59Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- Code Interpreter Replica☆26Updated 2 years ago