tigrisdata-community / multi-modal-starter-kitView on GitHub
Multi-modal starter kit for AI video understanding and narration. Works with Ollama (Llava, bakllava), GPT-4v
140Sep 9, 2024Updated last year

Alternatives and similar repositories for multi-modal-starter-kit

Users that are interested in multi-modal-starter-kit are comparing it to the libraries listed below

Sorting:

Are these results useful?