shahizat / Vision2Audio_2

Vision2Audio - Giving the blind an understanding through AI. Utilizing the LLaVA through MLC LLM to describe the image using Nvidia Riva Speech AI SDK
11Updated last year

Alternatives and similar repositories for Vision2Audio_2:

Users that are interested in Vision2Audio_2 are comparing it to the libraries listed below