byebyebruce / waduLinks
Translate the PDF into an online audiobook.
☆37Updated 3 months ago
Alternatives and similar repositories for wadu
Users that are interested in wadu are comparing it to the libraries listed below
Sorting:
- ☆103Updated 10 months ago
- RTC AIGC Demo☆229Updated 3 weeks ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆325Updated last month
- LabelFree☆60Updated 2 years ago
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute you…☆336Updated 3 months ago
- Step-by-step Jupyter notebook tutorials for ChatTTS☆172Updated last year
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆428Updated 11 months ago
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆295Updated last year
- 基于 faster-whisper 的伪实时语音转写服务☆232Updated 7 months ago
- 可用于深度思考和复杂流程的AI工具☆463Updated 9 months ago
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆45Updated last year
- A complete 7-layer intelligent memory system for AI Agents with multi-modal memory fusion also support context_engineering☆132Updated 5 months ago
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆50Updated 3 weeks ago
- 视频理解:千问视频多模态模型 & Dify☆65Updated last year
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆207Updated last year
- 官方推荐的 ChatTTS 最佳入门指南,整理和汇总了常见问题和相关资源☆101Updated last year
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆530Updated last year
- AI virtual human bot framework(public)☆282Updated last week
- a open framework for blind navigation based on esp32☆990Updated last month
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Updated last year
- 医疗问诊系统multi-agent框架☆91Updated 8 months ago
- ☆290Updated last year
- 《机器学习工程》开源电子书,欢迎一起贡献完善《Machine Learning Engineering》中文版☆72Updated last year
- UnderstandingDeepLearing中文翻译☆126Updated last year
- 从小说中提取对话数据集☆295Updated 3 months ago
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆179Updated last month
- ☆599Updated last year
- TTS☆49Updated last year
- MCP Server for the Bilibili API, supporting various operations.☆170Updated 7 months ago
- A python native agent framework☆461Updated last year