Python的音频工具
☆16Dec 5, 2025Updated 6 months ago
Alternatives and similar repositories for YeAudio
Users that are interested in YeAudio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jul 23, 2024Updated last year
- demos using speex☆12Apr 20, 2018Updated 8 years ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- Re-implementation of SLAM-ASR paper's experiment, using Phi-2 and Hubert☆21Jun 14, 2024Updated 2 years ago
- ☆29Apr 17, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆64Jul 5, 2025Updated 11 months ago
- The Neural-SRP method for DOA estimation☆36May 24, 2024Updated 2 years ago
- Wave U Net (NNabla)☆13Jul 1, 2020Updated 5 years ago
- 基于DINet的推理服务,推理视频流和视频☆17Nov 8, 2023Updated 2 years ago
- ☆13Sep 20, 2023Updated 2 years ago
- IPA Phonetic dataset lexicon☆18May 26, 2026Updated 2 weeks ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a vari…☆597Dec 17, 2025Updated 5 months ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 7 months ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- paraformer(chinense asr) online onnx runtime for python☆54Mar 27, 2024Updated 2 years ago
- Python bindings of speexdsp noise suppression library☆48Nov 18, 2022Updated 3 years ago
- Active noise controller (ANC) design: a practical primer☆14Jan 8, 2026Updated 5 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆37Mar 10, 2022Updated 4 years ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Mar 16, 2023Updated 3 years ago
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🗄 A Rust library designed as a specialized database for AI Agents, focusing on knowledge memory.☆30Updated this week
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆28Feb 11, 2023Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆40Oct 11, 2024Updated last year
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆28Apr 1, 2026Updated 2 months ago
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆15Dec 3, 2021Updated 4 years ago
- ☆23Aug 4, 2025Updated 10 months ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- This tool displays tflite signatures and rewrites the input/output OP name to the name of the signature. There is no need to install Tens…☆14Dec 13, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- C++版本的汉字转拼音 Transfer chinese character to pinyin☆15Aug 31, 2018Updated 7 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 4 years ago
- A thread-safe vector database for model inference inside LMDB.☆16Jun 6, 2026Updated last week
- Feedforward Sequential Memory Networks☆17Aug 2, 2022Updated 3 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- VITS Inference using ONNX Runtime on C++☆13Dec 25, 2023Updated 2 years ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago