tigrisdata-community / multi-modal-starter-kit
Multi-modal starter kit for AI video understanding and narration. Works with Ollama (Llava, bakllava), GPT-4v
☆129Updated 7 months ago
Alternatives and similar repositories for multi-modal-starter-kit:
Users that are interested in multi-modal-starter-kit are comparing it to the libraries listed below
- AI agent to automatically check grammar and spelling on documentation files☆86Updated 7 months ago
- ☆125Updated last year
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆133Updated 7 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆101Updated last year
- converts url content into JSON with a simple prefix☆68Updated 11 months ago
- List of awesome projects powered by fal.ai☆73Updated 8 months ago
- ☆29Updated 5 months ago
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated last year
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆234Updated last year
- ☆47Updated last year
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆289Updated 9 months ago
- A spotify playlist agent using CrewAI☆81Updated 11 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- A function to do all☆36Updated last year
- Replicate Flux LoRA image editor.☆51Updated 8 months ago
- A couple scripts to grab stats from email☆42Updated 7 months ago
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆96Updated last year
- auto fine tune of models with synthetic data☆75Updated last year
- Choose a topic, a music genre and wait for the agents to generate a song☆55Updated 10 months ago
- ☆22Updated 10 months ago
- An Open Source Playground with Agent Datasets and APIs for building and testing your own Autonomous Web Agents☆191Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 6 months ago
- Extract information from any website by chatting with AI - Fork of Vercel AI Chatbot w/ Firecrawl Integrated☆116Updated 3 months ago
- For LLMs to better code with Jina API☆146Updated 2 weeks ago
- ☆171Updated 8 months ago
- A desktop for AI agents☆142Updated last week
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated 2 months ago
- Safely deploy OpenAI's Realtime APIs in less than 5 minutes!☆155Updated 7 months ago
- Chat interface that searches the web for you real-time☆95Updated 6 months ago
- a minimalistic template for dynamic self-building AI agents☆97Updated 3 months ago