win4r / VideoFinder-Llama3.2-vision-Ollama

VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objects or people within video content. By combining the capabilities of Llama Vision model with a streamlined web interface, it enables real-time, frame-by-frame video analysis with natural language descriptions.
59Updated last week

Related projects

Alternatives and complementary repositories for VideoFinder-Llama3.2-vision-Ollama