A PyTorch Implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.
☆37Dec 28, 2023Updated 2 years ago
Alternatives and similar repositories for svoice_demo
Users that are interested in svoice_demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new …☆1,317Nov 16, 2023Updated 2 years ago
- Using SepFormer☆10Feb 2, 2023Updated 3 years ago
- The second generation of VoiceFixer, a toolkit for general speech restoration. *Not affiliated with the original VoiceFixer repo*☆23Nov 19, 2023Updated 2 years ago
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆11Dec 27, 2021Updated 4 years ago
- Speech Separation☆21Mar 7, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 4 months ago
- ☆10Jul 27, 2021Updated 4 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Jun 11, 2024Updated last year
- Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …☆18Feb 15, 2022Updated 4 years ago
- Notebooks and tests for 🤗 Diffusers library☆10Aug 6, 2023Updated 2 years ago
- ☆14Feb 15, 2022Updated 4 years ago
- Pytorch implement of DANet For Speech Separation☆21Jan 9, 2020Updated 6 years ago
- Cocktail party problem solution using deep learning☆16Jan 26, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A comapartive analysis of voice spoofing detection systems, based on a paper available at https://arxiv.org/abs/2210.00417.☆17Oct 24, 2022Updated 3 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- This repository is created on top of two repositories i.e., yolov7 face detection and yolov7 blurring object☆14Jan 21, 2023Updated 3 years ago
- Roboshaul☆21Dec 31, 2025Updated 4 months ago
- A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …☆15Dec 8, 2022Updated 3 years ago
- 9-qubit quantum support vector machine to identify Parkinson's disease based upon speech indicators.☆22May 14, 2020Updated 5 years ago
- TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation Code☆26Jul 7, 2024Updated last year
- High-level Rust library that binds to Poppler to extract text from a PDF☆11Dec 16, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for reproducing the Stanford Alpaca InstructLLaMA result on consumer hardware☆19Mar 16, 2023Updated 3 years ago
- Documentation about the Tympan☆14Jun 14, 2022Updated 3 years ago
- 量化投资探索指数基金定投的策略☆11Oct 21, 2017Updated 8 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆270Jul 25, 2024Updated last year
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Feb 20, 2025Updated last year
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- ☆12Oct 14, 2020Updated 5 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- Materials for Edcon 2019 webcast☆11Mar 31, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Jun 14, 2024Updated last year
- An efficient speech separation method☆275Apr 11, 2024Updated 2 years ago
- 一个支持跨模态大语言模型的webui. A chatbot webui that supports various multi-modal large language models☆11May 8, 2023Updated 2 years ago
- Reinforcement learning for self-driving in a 3D simulation☆20Dec 6, 2021Updated 4 years ago
- Code for paper: KNN-BERT: Fine-Tuning Pre-Trained Models with KNN Classifier☆26Dec 5, 2021Updated 4 years ago
- ☆13Jun 24, 2021Updated 4 years ago
- A simple AI/ML tool for non-technical creatives☆11May 5, 2023Updated 2 years ago