zyxcambridge / GPT4OLinks
复现GPT4O的实时视频和音频理解
☆14Updated last year
Alternatives and similar repositories for GPT4O
Users that are interested in GPT4O are comparing it to the libraries listed below
Sorting:
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- ☆25Updated last year
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆25Updated last year
- AI-agent应用,基于GPT、langchain、function calling、Stable diffusion等的AI儿童绘本生成☆24Updated 2 years ago
- Sora 中文的提示词 | 短视频提示词(prompt)技巧 | 调教指南。各种场景使用指南。学习怎么让它听你的话。兼顾了 Sora 的多场景应用。☆96Updated this week
- Big map for Google I/O 2025☆32Updated 6 months ago
- Turn Dify API into OpenAI API schema☆17Updated last year
- 视频理解:千问视频多模态模型 & Dify☆65Updated last year
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained de…☆49Updated last year
- 油猴脚本添加github跳转deepwiki按钮☆45Updated 7 months ago
- 一个基于Together AI的强大图像生成工具,支持文生图、图生图和提示词分析功能。☆24Updated last year
- Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generati…☆87Updated this week
- Qwen 提示词工程 & 最佳实践☆37Updated last year
- Translate HTML to PPTX☆18Updated 2 years ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- Collect VLM models that can be tried online.☆14Updated last year
- Eko Browser Extension Template☆36Updated 6 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆115Updated last week
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆19Updated last year
- SPO | Self-Supervised Prompt Optimization☆29Updated 8 months ago
- 这是一个基于 Next.js 构建的多语言 AI 模型评估平台,支持多模型对比和实时流式响应。A multilingual AI model evaluation platform built with Next.js, allowing users to compare …☆96Updated last year
- 使用强化学习训 练PPT的Agent☆38Updated last month
- AI-StoryLab 是一款基于 Next.js 的智能故事创作平台,集成音频制作与 AI 绘图提示词生成功能。☆48Updated 10 months ago
- Y-Agent Studio 是一个面向 企业级应用 的Agent开发套,Y-Agent是其中的核心模块。 包含了:支持智能体编排、RAG、流程日志、单元测试、流程测试、语料生产等垂直领域非常需要的功能。 智能体编排可以在同一个流程中,同时支持多智能体协作和流程混合编排…☆20Updated last month
- ☆54Updated 8 months ago
- ☆31Updated last year
- ☆33Updated last year
- Real time faster whisper gradio☆25Updated 3 months ago
- 一个可以验证和计算文本消耗 Token 的小工具,支持在浏览器中使用,汉化自 OpenAI Tokenizer。☆60Updated last year