xISSAx / Alpha-Co-Vision

A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
β˜†119Updated last year

Related projects β“˜

Alternatives and complementary repositories for Alpha-Co-Vision