byebyebruce / waduLinks
Translate the PDF into an online audiobook.
☆37Updated last month
Alternatives and similar repositories for wadu
Users that are interested in wadu are comparing it to the libraries listed below
Sorting:
- ☆102Updated 9 months ago
- RTC AIGC Demo☆217Updated last week
- Step-by-step Jupyter notebook tutorials for ChatTTS☆169Updated last year
- A python native agent framework☆461Updated 11 months ago
- 让算法工程化更简单☆94Updated 7 months ago
- A complete 7-layer intelligent memory system for AI Agents with multi-modal memory fusion also support context_engineering☆128Updated 3 months ago
- 基于Roo Cline+DeepSeek的AI开发教程☆70Updated 7 months ago
- Qwen 提示词工程 & 最佳实践☆36Updated last year
- 医疗问诊系统multi-agent框架☆88Updated 7 months ago
- 标书大模型(Proposal-LLM Chinese version )☆281Updated 11 months ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆302Updated this week
- LLM大语言模型可视化3D演示 | Chinese translation of llm-viz☆142Updated 4 months ago
- AI virtual human bot framework(public)☆264Updated 3 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆25Updated last year
- 可以实现按下 Option 按钮开始录制,抬起按钮就结束录制,并调用 Groq Whisper Large V3 Turbo 模型进行转译,由于 Groq 的速度非常快,所以大部分的语音输入都可以在 1-2s 内反馈。并且得益于 whisper 的强大能力,转译效果非常不错…☆579Updated 9 months ago
- 视频理解:千问视频多模态模型 & Dify☆65Updated last year
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆206Updated last year
- The Python SDK for the Coze API☆422Updated last week
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆516Updated last year
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆421Updated 10 months ago
- ☆403Updated 5 months ago
- 该项目围绕 Coze 打造 AI 私人提效助理展开,整合实用 AI 工作流并做拆解,同时准备提示词手册和案例手册,旨在展示项目可行性,帮助学习者更好地理解和实操相关技能。☆170Updated 7 months ago
- 《机器学习工程》开源电子书,欢迎一起贡献完善《Machine Learning Engineering》中文版☆72Updated last year
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆49Updated 5 months ago
- PatentWriterAgent Demo☆270Updated this week
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute you…☆309Updated 2 months ago
- ☆135Updated 8 months ago
- Delibird 是一个多合一大模型接口网关。主要针对国内的大模型,包括文心、百川、千问、星火、智谱等提供统一的接口调用。基于 Python 开发,容易集成。原生提供 Streaming 接口、多进程异步调度模式,性能较好、调用接口完全兼容 openai APi,方便集成。☆14Updated last year
- 百聆 是一个类似GPT-4o的语音对话机器人 ,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断☆1,485Updated 3 months ago
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆315Updated 3 months ago