henttttai / voice-to-voice-llm-structureView external linksLinks
自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。
☆12Dec 26, 2024Updated last year
Alternatives and similar repositories for voice-to-voice-llm-structure
Users that are interested in voice-to-voice-llm-structure are comparing it to the libraries listed below
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated 9 months ago
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- ☆18Apr 10, 2025Updated 10 months ago
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆24Dec 3, 2024Updated last year
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆35Dec 12, 2024Updated last year
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Sep 23, 2024Updated last year
- TypeScript SDK for programmatic access to Google NotebookLM☆23Jan 14, 2026Updated last month
- rag base on langchain☆11Mar 1, 2024Updated last year
- Dataset created for the Power Line Insulators Inspection Detections☆10Jul 2, 2020Updated 5 years ago
- 全网首发,mmdetection Co-DETR TensorRT端到端推理加速☆39Nov 27, 2024Updated last year
- 网络舆情监测系统☆15Aug 11, 2024Updated last year
- ☆11Dec 24, 2024Updated last year
- A Swift implementation of Qwen3-ASR speech recognition model using MLX Swift for Apple Silicon.☆46Updated this week
- datatochart.com☆12Mar 29, 2025Updated 10 months ago
- AI-WordCards is an innovative project that leverages the power of GPT, StableDiffusion, and DALL-E3 to create educational and engaging wo…☆10May 16, 2024Updated last year
- ☆17Jan 11, 2025Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆17Updated this week
- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins☆12Sep 20, 2024Updated last year
- offline tts for react native on iOS☆18Jan 4, 2026Updated last month
- Epub Highlighter highlights specified words in EPub w/o meaning.☆11Jul 26, 2017Updated 8 years ago
- ☆10Mar 28, 2024Updated last year
- GLCM,logistic fuction , OTSU , matlab,horizon detection in maritime, sea and sky segmentation☆12Jan 5, 2020Updated 6 years ago
- WebRTC based video conferencing SDK for iOS (Swift / Objective C)☆13Jan 27, 2026Updated 3 weeks ago
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Algorithm for bird's-eye-view L-shape fitting in 3D LIDAR point clouds from traffic scenarios☆10Mar 5, 2020Updated 5 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 2 years ago
- ☆10Apr 18, 2022Updated 3 years ago
- Three examples using AutoGLM api to control mobile through esp32 and web server☆27Aug 21, 2025Updated 5 months ago
- ☆13Aug 18, 2022Updated 3 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 7 months ago
- C++对接富途证券Open API☆14Oct 18, 2018Updated 7 years ago
- cpp rotation album,基于cpp eigen实现的3d旋转相册,GAMES101复现内容☆12Jul 25, 2022Updated 3 years ago
- Towards practical change detection, including annotation, algorithms and deployment.☆12Dec 15, 2022Updated 3 years ago
- pytorch+bert实现的意图识别与槽位填充☆11May 30, 2023Updated 2 years ago
- 通过高德地图Api定位当前位置,并搜索周边POI☆11Mar 3, 2015Updated 10 years ago
- 开源语音识别自定义数据模型训练指南☆13Oct 8, 2023Updated 2 years ago