IDEA-CCNL / Real-Gemini

Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本、语音、图像和视频和这是世界进行问答和交流。
22Updated last year

Alternatives and similar repositories for Real-Gemini:

Users that are interested in Real-Gemini are comparing it to the libraries listed below