An attempt to Vietnamese speech enhencement with U-net and Unet based ResNet
☆22Nov 6, 2021Updated 4 years ago
Alternatives and similar repositories for speech-enhancement
Users that are interested in speech-enhancement are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Wave U Net (NNabla)☆13Jul 1, 2020Updated 5 years ago
- PAGAN: a phase-adapted GAN for speech enhancement☆36Sep 17, 2020Updated 5 years ago
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago
- A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil.☆16Feb 17, 2025Updated last year
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆32Jul 25, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- End-to-End binaural sound localization☆17Feb 27, 2020Updated 6 years ago
- Self-Attention Generative Adversarial Network for Speech Enhancement using Tensorflow 2☆16Jan 30, 2021Updated 5 years ago
- Audio classification deep learning model using TensorFlow 2.0 to detect Gunshots. 97.5% test set accuracy and 99% training set accuracy w…☆22Feb 16, 2020Updated 6 years ago
- Complex-valued neural networks for DOA estimation☆30Jan 25, 2023Updated 3 years ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- Graph Neural Networks for Sound Source Localization☆27Oct 31, 2023Updated 2 years ago
- Sherpa-onnx-tts-stt source for homeassisstant addon with Kroko Onnx Streaming STT integration.☆28Dec 18, 2025Updated 4 months ago
- ☆25Jul 20, 2021Updated 4 years ago
- Machine Reading Comprehension has attracted significant interest in research on natural language understanding, and large-scale datasets …☆10Aug 14, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ResNet-STFT Model for Sound Source Localization☆20Aug 25, 2022Updated 3 years ago
- Scripts to automate simple tasks throughout learning process at UET-VNU☆17Jun 8, 2021Updated 4 years ago
- ☆21Jun 13, 2019Updated 6 years ago
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆25May 27, 2021Updated 4 years ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆17Aug 31, 2017Updated 8 years ago
- Files for the paper: "Sound Source Localization using Deep Residual Learning"☆24Nov 13, 2017Updated 8 years ago
- A chrome extension to toggle subtitles using keyboard shortcut (C)☆10Jul 4, 2025Updated 10 months ago
- [ACL 2024] Novel reranking method to select the best solutions for code generation☆16Jun 9, 2024Updated last year
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Mar 22, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆51Sep 3, 2025Updated 8 months ago
- This is a implementation of kaldi-plda.☆15Jun 9, 2018Updated 7 years ago
- Unofficial API to get MCQs from Sanfoundry, some results/answers may be incorrect☆13Aug 25, 2024Updated last year
- Deep Noise Suppression for Real Time Speech Enhancement in a Single Channel Wide Band Scenario☆27Jan 25, 2024Updated 2 years ago
- Machine Learning Approach to built a robust speaker recognition model using MFCC features and GMM universal background model.☆15May 30, 2020Updated 5 years ago
- [DEPRECATED] Vietnamese Handwriting Recognition with CRNN and CTC Loss☆32Apr 2, 2019Updated 7 years ago
- Parameter Estimation in Multi-standard Wideband Receivers via Deep Learning. (DOA - Direction of Arrival)☆27Apr 22, 2022Updated 4 years ago
- ☆20Mar 2, 2022Updated 4 years ago
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆24Jun 19, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Conformer-based Metric GAN for speech enhancement☆419May 3, 2024Updated 2 years ago
- GEBI: Global Explanations for Bias Identification. Open source code for discovering bias in data with skin lesion dataset☆18Feb 20, 2022Updated 4 years ago
- alm0n for UET's viewgrade☆16Feb 7, 2023Updated 3 years ago
- Implementation of MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications☆23Sep 4, 2021Updated 4 years ago
- Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.☆38Aug 30, 2021Updated 4 years ago
- Speech Denoising using RNNs in Tensorflow☆24Apr 20, 2018Updated 8 years ago
- MultiSV: scripts for data preparation☆30Jan 18, 2025Updated last year