simple to use, pretrained/training-less models for speaker diarization
☆21Aug 23, 2023Updated 2 years ago
Alternatives and similar repositories for pydiar
Users that are interested in pydiar are comparing it to the libraries listed below
Sorting:
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆11Apr 6, 2020Updated 5 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- music semantic understanding evaluation benchmark☆25Aug 12, 2023Updated 2 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- ☆29Feb 24, 2024Updated 2 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- ☆33Jan 14, 2023Updated 3 years ago
- This tool provide a way to build Django RESTful projects based on your database☆30Oct 21, 2021Updated 4 years ago
- edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries.…☆10Nov 14, 2021Updated 4 years ago
- A lightweight implementation of shapes drawn across a geo-temporal plane.☆12Jan 27, 2026Updated last month
- A minimum inference engine for DiffSinger☆37Apr 5, 2024Updated last year
- A tool to paste Excel ranges to Reddit☆11Sep 20, 2025Updated 5 months ago
- ☆32Mar 15, 2022Updated 3 years ago
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆13Jan 3, 2023Updated 3 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Feb 27, 2021Updated 5 years ago
- canvas-based talking head model using viseme data☆32Sep 4, 2023Updated 2 years ago
- On-device noise suppression powered by deep learning☆83Updated this week
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 2 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆41Jan 4, 2026Updated last month
- ☆31Jul 13, 2023Updated 2 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- This is a repository for georeferencing of pushbroom hyperspectral imagery and includes ray-intersection, orthorectification and a coregi…☆11Oct 23, 2024Updated last year
- [ARCHIVED] ✨ Full-stack school homepage / TypeScript, Remix (React), Prisma, CI/CD and more☆12Sep 26, 2022Updated 3 years ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- ☆11Jul 3, 2020Updated 5 years ago
- The first OpenSource Mafia Bot!☆10Oct 5, 2023Updated 2 years ago
- Coqui Inference Engine☆40Aug 3, 2021Updated 4 years ago
- A python-tabulate wrapper for producing tables from generators☆57Nov 5, 2022Updated 3 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Feb 18, 2022Updated 4 years ago
- A data management platform for the web☆11Feb 2, 2026Updated last month