tigrisdata-community / multi-modal-starter-kitLinks
Multi-modal starter kit for AI video understanding and narration. Works with Ollama (Llava, bakllava), GPT-4v
☆136Updated 11 months ago
Alternatives and similar repositories for multi-modal-starter-kit
Users that are interested in multi-modal-starter-kit are comparing it to the libraries listed below
Sorting:
- AI agent to automatically check grammar and spelling on documentation files☆90Updated last month
- List of awesome projects powered by fal.ai☆91Updated last month
- ☆30Updated 8 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆133Updated 2 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆104Updated last year
- Create and share chatbots with external knowledge ✨☆69Updated 10 months ago
- ☆125Updated last year
- A spotify playlist agent using CrewAI☆82Updated last year
- ☆47Updated last year
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated last year
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year
- A couple scripts to grab stats from email☆43Updated 11 months ago
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆239Updated last year
- Chat with your git repo☆159Updated last year
- 🐝 Create powerful, collaborative AI applications.☆64Updated 9 months ago
- AI assistant that Intuitively Adapts to You☆81Updated last year
- converts url content into JSON with a simple prefix☆71Updated last year
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆152Updated last year
- A function to do all☆35Updated last year
- Record voice notes & transcribe, summarize, and get tasks☆42Updated last year
- Summarize, Verify & Chat with any YouTube video in seconds.☆171Updated 6 months ago
- ☆11Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated 2 months ago
- ☆172Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 10 months ago
- ☆72Updated last year
- The very first artist assistant☆22Updated 2 years ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆95Updated last year
- AI agent workflow for generating profiles of clients and running research tasks for them. There is an agent for each part of the process:…☆82Updated 10 months ago