masyagin1998 / HSVLinks
Hubbub Suppression for Voice
☆16Updated last year
Alternatives and similar repositories for HSV
Users that are interested in HSV are comparing it to the libraries listed below
Sorting:
- Краткий гайд по написанию плагинов для GCC на русском языке☆18Updated 6 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- RObust document image BINarization☆180Updated 10 months ago
- ☆14Updated 4 years ago
- Training BERT for punctuation task☆10Updated 4 years ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆33Updated last year
- ☆15Updated last year
- Lightweight knowledge distillation pipeline☆28Updated 3 years ago
- ncnn HiFi-GAN☆26Updated 8 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- T5-based (russian) text normalization☆21Updated last year
- Deep Learning framework with NVIDIA & AMD support☆59Updated 2 years ago
- Real-time speech enhancement based on spectral subtraction☆14Updated 7 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Updated 5 years ago
- Use DTLN real time speech denoising model(https://github.com/breizhn/DTLN) in web.☆13Updated 2 years ago
- Implementation of Automatic Speech Recognition inspired by "Listen, Attend and Spell" paper in PyTorch☆11Updated 5 years ago
- Geometric Augmentation for Text Image☆9Updated 5 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 4 months ago
- Unofficial Tensorflow 2 implementation of SINet: Extreme Lightweight Portrait Segmentation Networks with Spatial Squeeze Modules and Info…☆14Updated 2 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Normalize Text in Russian☆27Updated last year
- Using FCN to segment the book's content and background, then dewarping the pages,☆21Updated 3 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆27Updated 3 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- YACHT 🛳️: Smoothly riding the waves of C++ projects☆11Updated 2 years ago
- A PyTorch Dataset that caches samples in shared memory, accessible globally to all processes☆20Updated 3 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- Russian phonetical transcription☆10Updated last year
- Supervoice Speaker Separation Network☆12Updated last year