tigrisdata-community / multi-modal-starter-kitLinks
Multi-modal starter kit for AI video understanding and narration. Works with Ollama (Llava, bakllava), GPT-4v
β138Updated last year
Alternatives and similar repositories for multi-modal-starter-kit
Users that are interested in multi-modal-starter-kit are comparing it to the libraries listed below
Sorting:
- AI agent to automatically check grammar and spelling on documentation filesβ93Updated 4 months ago
- πΈ The open framework for question answering fine-tuning LLMs on private dataβ69Updated 2 years ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.β135Updated 5 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.β103Updated last year
- β30Updated 11 months ago
- Create and share chatbots with external knowledge β¨β70Updated last year
- β47Updated last year
- A function to do allβ35Updated last year
- Build Web Datasets with Easeβ33Updated last year
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and moreβ293Updated last year
- converts url content into JSON with a simple prefixβ71Updated last year
- [WIP] AI Try-On plugin for Chromeβ28Updated last year
- A spotify playlist agent using CrewAI