tigrisdata-community / multi-modal-starter-kitLinks
Multi-modal starter kit for AI video understanding and narration. Works with Ollama (Llava, bakllava), GPT-4v
☆140Updated last year
Alternatives and similar repositories for multi-modal-starter-kit
Users that are interested in multi-modal-starter-kit are comparing it to the libraries listed below
Sorting:
- AI agent to automatically check grammar and spelling on documentation files☆95Updated 2 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆221Updated last month
- List of awesome projects powered by fal.ai☆108Updated 7 months ago
- ☆30Updated last year
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated 2 years ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆105Updated last year
- ☆124Updated 2 years ago
- A spotify playlist agent using CrewAI☆81Updated last year
- ☆47Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- Create and share chatbots with external knowledge ✨☆69Updated last year
- 🐝 Create powerful, collaborative AI applications.☆65Updated last year
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- AI agent workflow for generating profiles of clients and running research tasks for them. There is an agent for each part of the process:…☆83Updated last year
- ☆108Updated last year
- converts url content into JSON with a simple prefix☆73Updated last year
- The very first artist assistant☆23Updated 2 years ago
- ☆82Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆55Updated last year
- The next evolution of Agents☆48Updated this week
- Summarize, Verify & Chat with any YouTube video in seconds.☆174Updated last year
- A couple scripts to grab stats from email☆43Updated last year
- A function to do all☆34Updated last year
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆295Updated last year
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆242Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆97Updated last year
- Gradio UI for a Cog API☆70Updated last year
- Replicate Flux LoRA image editor.☆54Updated last year
- https://narrateit.streamlit.app/☆39Updated last year
- Record a sample of your own voice and let AI narrate the text in your own voice.☆79Updated 2 years ago