Wrapper for pydub AudioSegment objects
☆96Dec 27, 2022Updated 3 years ago
Alternatives and similar repositories for AudioSegment
Users that are interested in AudioSegment are comparing it to the libraries listed below
Sorting:
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Jul 16, 2022Updated 3 years ago
- Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019☆13Jan 6, 2020Updated 6 years ago
- 这是一个简单的粤语注音工具,可对中文进行粤语的注音,查询词语的解析和查询某个发 音有哪些对应的字。☆11Jan 5, 2015Updated 11 years ago
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorflo…☆13Aug 24, 2017Updated 8 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84May 23, 2023Updated 2 years ago
- ☆16Jul 25, 2016Updated 9 years ago
- Tensorflow implementation of pix2pix(cGAN) for audio source separation☆16Jun 30, 2018Updated 7 years ago
- Hackathon project to digitize your own voice and have it speak for you! Fully automated!☆12Oct 22, 2019Updated 6 years ago
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 11 months ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Oct 1, 2019Updated 6 years ago
- Manipulate audio with a simple and easy high level interface☆9,744Jul 26, 2025Updated 7 months ago
- A library that helps you to convert from one subtitle format to another☆19Jan 8, 2019Updated 7 years ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆76Jun 16, 2025Updated 9 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- Lyrics Generation using LSTM , word2vec Analysis and more☆10Jun 7, 2018Updated 7 years ago
- ParallelWaveGAN adaptation for Mozilla TTS☆15May 23, 2020Updated 5 years ago
- Python 汉字到粤拼转换工具。☆35Feb 26, 2024Updated 2 years ago
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆27Mar 11, 2022Updated 4 years ago
- noise reduction☆17Jul 3, 2024Updated last year
- A recursive forced aligner built on Gentle.☆16Mar 20, 2019Updated 6 years ago
- ☆14Jan 26, 2025Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- Group Gated Fusion on Attention-based Bidirectional Alignment for Multimodal Emotion Recognition☆14May 10, 2022Updated 3 years ago
- Easy handle DPlayer-Lite or DPlayer on WordPress. A shortcode for WordPress to using DPlayer.☆12Jan 3, 2020Updated 6 years ago
- Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmenta…☆55Jul 25, 2022Updated 3 years ago
- A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'☆41Aug 1, 2018Updated 7 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Aug 2, 2024Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Jun 17, 2022Updated 3 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆458Jun 3, 2020Updated 5 years ago
- Class project aimed at using ICA to implement Blind Source Separation on sound signals☆10Apr 23, 2015Updated 10 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Aug 17, 2020Updated 5 years ago
- Reproduction of a paper"Small-footprint keyword spotting using deep neural networks"☆12Mar 11, 2019Updated 7 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Jan 29, 2022Updated 4 years ago
- Implementation of the paper "Exploiting Time-Frequency Conformers for Music Audio Enhancement"☆12Mar 21, 2025Updated 11 months ago
- FPGA based, Real-time processing of audio, including voiceprint recognition, adaptive noise suppression, et al.☆15May 8, 2025Updated 10 months ago
- Approximate and vectorized versions of common mathematical functions☆13Mar 1, 2017Updated 9 years ago
- ☆40Jul 7, 2016Updated 9 years ago