自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。
☆12Dec 26, 2024Updated last year
Alternatives and similar repositories for voice-to-voice-llm-structure
Users that are interested in voice-to-voice-llm-structure are comparing it to the libraries listed below
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated 9 months ago
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆23Feb 12, 2026Updated 3 weeks ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- ☆18Apr 10, 2025Updated 11 months ago
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆24Dec 3, 2024Updated last year
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆36Dec 12, 2024Updated last year
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆41Sep 23, 2024Updated last year
- rag base on langchain☆11Mar 1, 2024Updated 2 years ago
- Dataset created for the Power Line Insulators Inspection Detections☆10Jul 2, 2020Updated 5 years ago
- 全网首发,mmdetection Co-DETR TensorRT端到端推理加速☆39Nov 27, 2024Updated last year
- AI-WordCards is an innovative project that leverages the power of GPT, StableDiffusion, and DALL-E3 to create educational and engaging wo…☆10May 16, 2024Updated last year
- ☆17Jan 11, 2025Updated last year
- datatochart.com☆12Mar 29, 2025Updated 11 months ago
- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins☆12Sep 20, 2024Updated last year
- ☆11Dec 24, 2024Updated last year
- Epub Highlighter highlights specified words in EPub w/o meaning.☆11Jul 26, 2017Updated 8 years ago
- TypeScript SDK for programmatic access to Google NotebookLM☆28Jan 14, 2026Updated last month
- ☆10Mar 28, 2024Updated last year
- ☆13Aug 18, 2022Updated 3 years ago
- cpp rotation album,基于cpp eigen实现的3d旋转相册,GAMES101复现内容☆12Jul 25, 2022Updated 3 years ago
- ☆12Oct 8, 2022Updated 3 years ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- One-click deploy Ubuntu development environment. 一键部署Ubuntu开发环境。☆10Jan 1, 2018Updated 8 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- 舆情分析系统前端☆11Jun 20, 2021Updated 4 years ago
- Leverage 3D video and Spatial Audio to deliver an immersive experience.☆11Oct 11, 2023Updated 2 years ago
- 这是一个深度学习的一个小项目,利用卷积神经网络识别猫狗图片☆21May 5, 2022Updated 3 years ago
- Algorithm for bird's-eye-view L-shape fitting in 3D LIDAR point clouds from traffic scenarios☆10Mar 5, 2020Updated 6 years ago
- A syscall hooking system for FreeBSD, NetBSD and also Linux.☆16Nov 14, 2021Updated 4 years ago
- 网络舆情监测系统☆16Aug 11, 2024Updated last year
- Towards practical change detection, including annotation, algorithms and deployment.☆12Dec 15, 2022Updated 3 years ago
- ☆12Nov 12, 2020Updated 5 years ago
- C++对接富途证券Open API☆14Oct 18, 2018Updated 7 years ago
- Complex Reinforcement Learning Simulation for PiH task used for M.Sc. degree.☆13Aug 15, 2023Updated 2 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 8 months ago
- FunASR安卓端侧离线版本2pass全模式☆14Sep 4, 2023Updated 2 years ago
- WebRTC based video conferencing SDK for iOS (Swift / Objective C)☆13Jan 27, 2026Updated last month
- A Pytorch implementing of A Deep Learning approach to Template Matching. Usie Hypernet + VGG to match the templates.☆12Dec 18, 2021Updated 4 years ago