semaj87 / image-to-text-to-speech

An app that uses Hugging Face AI models together with OpenAI & LangChain, to generate text from an image, which then generates audio from the text
13Updated 11 months ago

Related projects

Alternatives and complementary repositories for image-to-text-to-speech