zyxcambridge / GPT4O
复现GPT4O的实时视频和音频理解
☆11Updated 11 months ago
Alternatives and similar repositories for GPT4O
Users that are interested in GPT4O are comparing it to the libraries listed below
Sorting:
- EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained de…☆44Updated last year
- ☆26Updated 8 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆17Updated 8 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆27Updated 7 months ago
- 中文原生文生图测评基准☆9Updated 10 months ago
- TPO 是一个优化 LLM 输出文本的框架,通过迭代反馈和优化提示的方式来“微调模型”,而非直接调整模型的参数,使模型在推理过程中与人类偏好对齐以生成更好的结果。本项目提供了一个友好的 WebUI 来加载模型,实时优化基础模型并展示最佳结果。☆10Updated 2 months ago
- A prompt set of ChatGLM-6B☆14Updated last year
- 一个基于Together AI的强大图像生成工具,支持文生图、图生图和提示词分析功能。☆24Updated 5 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆28Updated last year
- 基于bilibili视频构建大模型问答训练数据,输入bilibili视频地址等信息即可生成QA数据供videoQA_databuilder项目使用☆18Updated last year
- support BM25+vecetor☆29Updated 5 months ago
- 视频理解:千问视频多模态模型 & Dify☆53Updated 8 months ago
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated 3 weeks ago
- HayLM是专门为儿童训练的大模型,通过对InternLM的训练和微调,结合儿童心理学、教育学以及对话风格的数据训练,实现与儿童的智能互动,并在交流过程中不断学习和适应用户特性,成为一个伴随儿童成长的虚拟朋友。☆12Updated 3 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated 10 months ago
- ☆50Updated this week
- ☆12Updated last year
- Open-source project showcasing how single or multiple AI Agents interact with MCP in real-world tasks. Includes both frontend and backend…☆13Updated this week
- 机器学习基础☆8Updated 6 years ago
- ⚡ A LLM Prompt distribution tool ⚡☆25Updated last year
- 基于Roo Cline+DeepSeek的AI开发教程☆50Updated 2 months ago
- 百度QA100万数据集☆47Updated last year
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆23Updated last year
- Creating Interactive and Embedded Physics Simulations from Static Textbook Diagrams☆21Updated 2 months ago
- kimi-chat 测试数据☆7Updated last year
- 知乎大语言模型、ChatGPT、Transformers问答☆36Updated last year
- Real time faster whisper gradio☆26Updated 7 months ago
- Qimen表示的是奇门遁甲之术,用于抽取各种实体的工具。☆29Updated 5 years ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆18Updated last week