video cut powered by AI
☆24Nov 15, 2022Updated 3 years ago
Alternatives and similar repositories for wecut
Users that are interested in wecut are comparing it to the libraries listed below
Sorting:
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 6 months ago
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- ☆25Nov 23, 2021Updated 4 years ago
- Production first, nn-based on-device signal processing toolkit.☆65May 30, 2023Updated 2 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54May 25, 2022Updated 3 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆150Aug 25, 2023Updated 2 years ago
- 无需情感标注的情感可控原神+崩三全人物语音合成模型,基于VITS☆22May 30, 2023Updated 2 years ago
- Audio streaming transfer demo with google.api.HttpBody and grpc gateway for speech synthesis☆20Jan 28, 2020Updated 6 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Interact with GPT-3 through speech☆12Dec 12, 2022Updated 3 years ago
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆87Aug 17, 2020Updated 5 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- ☆11May 9, 2023Updated 2 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 4 years ago
- A Unified Deep Learning Framework for ssTEM Image Restoration☆12Jul 27, 2022Updated 3 years ago
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- ☆11May 7, 2022Updated 3 years ago
- ☆28Oct 7, 2025Updated 4 months ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 9 months ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- ☆12Apr 25, 2022Updated 3 years ago
- ☆88Nov 1, 2022Updated 3 years ago
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Jun 22, 2022Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- ☆33Nov 29, 2022Updated 3 years ago
- Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling☆15Oct 9, 2023Updated 2 years ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…