该开源项目旨在提供一个能够自动检测并识别中文语音的模型,支持wav、mp4、m4a等格式的音频文件上传。无论是从录音设备中获取的wav文件,还是从视频中提取的mp4、m4a文件,我们的模型可以准确识别其中的中文文字内容。通过集成最先进的语音识别技术和深度学习算法,我们的模型能够快速、准确地将声音转换为文字,为用户提供便捷的语音识别体验。
☆46Jun 6, 2024Updated 2 years ago
Alternatives and similar repositories for voice_translation
Users that are interested in voice_translation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 抖音带货直播实时GMV Top5直播间评论抓取与情感分析☆20Mar 10, 2024Updated 2 years ago
- ☆13Mar 10, 2024Updated 2 years ago
- ☆16Apr 3, 2025Updated last year
- 一个强大的、由 AI 驱动的演示文稿(PPt)自动化生成工具,真正生产化的工具,全流程可控,帮助用户快速制作出符合需求的 PPt。☆29Sep 23, 2025Updated 8 months ago
- combine ASR, LLM and TTS in local development with python☆19Sep 21, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- “象棋学习助手”是一款智能化的中国象棋辅助工具。通过实时识别屏幕中的棋盘局面,结合高性能引擎分析,为在线对战平台的玩家提供精准走法建议与策略指导,帮助象棋爱好者快速提升棋艺水平。☆69Jan 28, 2026Updated 4 months ago
- 增加了较为详细的注释、一些自己的功能和封装代码方便嵌入☆17Apr 4, 2022Updated 4 years ago
- 中文文本摘要,基于pytorch,采用LCSTS数据集☆21Nov 11, 2021Updated 4 years ago
- KWS demo based on CTC prefix beam search.☆18Oct 21, 2023Updated 2 years ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Sep 23, 2024Updated last year
- 小白可以学习,作者免费提供所有源代码和ui,大佬不喜互喷☆30Jan 10, 2023Updated 3 years ago
- This repository contains the Code for SOTA model on Google Speech Command V2 dataset.☆16Sep 28, 2023Updated 2 years ago
- In this repository, I implement a system for detecting specific spoken words in speech signal. When reading a speech signal, I detect not…☆19Sep 27, 2021Updated 4 years ago
- WP2AI可以将您的WordPress文章变成智能知识库,并通过AI智能匹配和解读,使其更准确的回答问题。☆15Mar 5, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆57Mar 28, 2025Updated last year
- ☆24May 12, 2024Updated 2 years ago
- 凌波微步,一款在线刷步神器(目前支持微信,支付宝,QQ,阿里体育,钉钉...)☆10Jan 4, 2023Updated 3 years ago
- ☆12May 24, 2022Updated 4 years ago
- Desktop App to Translate Sign Language from Voice to Sign and Vice Versa☆33Apr 25, 2021Updated 5 years ago
- thundernet ncnn☆43Feb 20, 2021Updated 5 years ago
- [CVPR 2025] Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model☆63May 7, 2025Updated last year
- [CVPR'24] Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery☆60Dec 16, 2024Updated last year
- Test Framework for few-shot open set KWS☆43Nov 8, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 基于gin+gorm+mysql+redis搭建的论坛系统☆31Nov 25, 2023Updated 2 years ago
- Research sources on graph-based anomaly detection☆13Nov 29, 2022Updated 3 years ago
- Car model classification☆57Mar 12, 2026Updated 3 months ago
- ☆13May 16, 2025Updated last year
- Code for running experiments and benchmarking on GNNExplainer: Generating Explanations for Graph Neural Networks☆15May 8, 2021Updated 5 years ago
- Automatic method for the recognition of hand gestures for the categorization of vowels and numbers in Colombian sign language based on Ne…☆15Nov 18, 2018Updated 7 years ago
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆48Jan 24, 2026Updated 4 months ago
- ☆10Apr 5, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 本地检测 proxypool 节点的可用性☆18Dec 8, 2021Updated 4 years ago
- [ICLR 2025 Spotlight] Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation☆73May 7, 2025Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- ☆19Nov 26, 2023Updated 2 years ago
- ☆11Sep 19, 2022Updated 3 years ago
- 基于 SiliconFlow API 的语音转文字桌面工具,支持 PyQt5 图形界面、音频文件批量转录和结果编辑管理 | A PyQt5 desktop app for speech-to-text transcription using SiliconFlow API☆19May 22, 2026Updated 3 weeks ago
- Another implementation of the paper "Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs" in…☆13Jun 30, 2021Updated 4 years ago