zyxcambridge / GPT4O
复现GPT4O的实时视频和音频理解
☆11Updated 10 months ago
Alternatives and similar repositories for GPT4O:
Users that are interested in GPT4O are comparing it to the libraries listed below
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆27Updated 7 months ago
- EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained de…☆44Updated last year
- 视频理解:千问视频多模态模型 & Dify☆50Updated 7 months ago
- ☆26Updated 8 months ago
- ☆33Updated 4 months ago
- Sora 中文的提示词 | 短视频提示词(prompt)技巧 | 调教指南。各种场景使用指南。学习怎么让它听你的话。兼顾了 Sora 的多场景应用。☆38Updated this week
- GLM Series Edge Models☆136Updated 2 months ago
- ☆11Updated 2 years ago
- 一个基于Together AI的强大图像生成工具,支持文生图、图生图和提示词分析功能。☆24Updated 5 months ago
- TPO 是一个优化 LLM 输出文本的框架,通过迭代反馈和优化提示的方式来“微调模型”,而非直接调整模型的参数,使模型在推理过程中与人类偏好对齐以生成更好的结果。本项目提供了一个友好的 WebUI 来加载模型,实时优化基础模型并展示最佳结果。☆10Updated 2 months ago
- ☆19Updated last year
- WIP. Apps (100+) + AI.☆28Updated 7 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆23Updated last year
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Updated last year
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated last year
- ☆27Updated 2 months ago
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆20Updated 6 months ago
- ☆13Updated last year
- app会常驻手机后台,你可以随时随地保持与Fay数字人的沟通。☆43Updated 4 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated 9 months ago
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆18Updated this week
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated this week
- Open source intent recognition framework powered by LLMs.☆18Updated 4 months ago
- Prompt 工程师利器,可同时比较多个 Prompts 在多个 LLM 模型上的效果☆95Updated last year
- Collect VLM models that can be tried online.☆13Updated last year
- ☆19Updated 2 years ago
- 用文本编辑器剪视频☆37Updated last year
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- 百度QA100万数据集☆47Updated last year