YanZiBuGuiCHunShiWan / RESTFUL_ASRView external linksLinks
基于wenet的短时在线语音识别服务
☆11Feb 25, 2023Updated 2 years ago
Alternatives and similar repositories for RESTFUL_ASR
Users that are interested in RESTFUL_ASR are comparing it to the libraries listed below
Sorting:
- This project aims to add masks to the facial dataset, which is based on FMA-3D and constructs a effective, easy to operate, and efficient…☆18Oct 5, 2023Updated 2 years ago
- ☆11Dec 24, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆17Updated this week
- A mini Photoshop software with c++, OpenCV and Qt☆10Jun 6, 2021Updated 4 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- FunASR安卓端侧离线版本2pass全模式☆14Sep 4, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 7 months ago
- [CVPR 2023] Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution☆10Mar 19, 2024Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- [ICLR 2025] Code for the paper "Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization"☆22Apr 21, 2025Updated 9 months ago
- Official codes for 'Preserving Full Degradation Details for Blind Image Super-Resolution'☆12Jul 2, 2024Updated last year
- This is official repository for "ConStyle v2: A Strong Prompter for All-in-One Image Restoration"☆12Jun 12, 2024Updated last year
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- [IEEE TCSVT 2025] Ultra-High-Definition Image Restoration: New Benchmarks and A Dual Interaction Prior-Driven Solution☆13Sep 26, 2025Updated 4 months ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- GitHub repository for the Bria 3.2 pipeline☆44Sep 10, 2025Updated 5 months ago
- This project is a reimplementation of the paper Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12Aug 29, 2025Updated 5 months ago
- ☆12Jul 11, 2024Updated last year
- ☆15Oct 19, 2024Updated last year
- ☆16Updated this week
- ☆18Apr 28, 2025Updated 9 months ago
- ComfyUI help pages☆31Feb 7, 2026Updated last week
- [WACV 2024] Official PyTorch implementation of "UGPNet"☆12Jan 2, 2024Updated 2 years ago
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- HippoRAG implementation using APIs☆15Jun 6, 2024Updated last year
- Eliminating sensitive information from monitoring data☆13Jul 6, 2024Updated last year
- codes for RFSR: Improving ISR Diffusion Models via Reward Feedback Learning☆18Dec 8, 2024Updated last year
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- Both audio-only and audio-visual speaker diarization datasets are listed here.☆14Feb 22, 2023Updated 2 years ago
- Unofficial pixabay python API client☆13Feb 6, 2023Updated 3 years ago
- Unconditional Geomodeling related work (codes, data, and results)☆16Jan 4, 2023Updated 3 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 6 years ago
- ☆13Jul 25, 2024Updated last year
- ☆16Nov 9, 2023Updated 2 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- Code repository for the paper "Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction" @ ECCV 2024 (Oral)☆13Apr 22, 2025Updated 9 months ago