jfgonsalves / parakeet-diarizedView external linksLinks
Parakeet 0.6b V2 + Pyannote diarization behind a Whisper API
☆63Nov 4, 2025Updated 3 months ago
Alternatives and similar repositories for parakeet-diarized
Users that are interested in parakeet-diarized are comparing it to the libraries listed below
Sorting:
- Transcribing audio files on Modal with open source ASR models is fast, cheap, and easy!☆18Jul 25, 2025Updated 6 months ago
- Text Match Cut Video Generator Web App☆36Aug 26, 2025Updated 5 months ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 10 months ago
- ☆24Mar 29, 2025Updated 10 months ago
- ☆19Jan 8, 2025Updated last year
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Jun 5, 2023Updated 2 years ago
- A transcription text editor with respeak module☆14Jan 24, 2026Updated 3 weeks ago
- Timed-transcript editor component built using Draft.js.☆45Aug 20, 2018Updated 7 years ago
- LinkedIn Lead Scraper - Automated Profile Discovery & Lead Generation Tool☆25Jan 21, 2026Updated 3 weeks ago
- WebAMS is an Open Source web application for reporting and resolving incidents or tickets☆10Dec 11, 2022Updated 3 years ago
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- This repository is the official implementation of our paper "Improving Generalization for AI-Synthesized Voice Detection", which has been…☆22Jan 13, 2026Updated last month
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆30Oct 29, 2024Updated last year
- Example: Micro speech for TensorFlow Lite☆34Dec 18, 2023Updated 2 years ago
- ☆29Jan 6, 2026Updated last month
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆50Sep 20, 2025Updated 4 months ago
- ☆29Jun 30, 2025Updated 7 months ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆36Feb 11, 2025Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- ☆28Dec 14, 2021Updated 4 years ago
- ☆38Apr 3, 2025Updated 10 months ago
- ☆36Oct 15, 2024Updated last year
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆44May 26, 2025Updated 8 months ago
- iOS ViewStates☆14Aug 27, 2018Updated 7 years ago
- ☆38Apr 15, 2024Updated last year
- Speech-to-text transcription VST3/ARA plugin☆53Feb 2, 2026Updated 2 weeks ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Transform messy HTML from Google Docs into well-structured HTML!☆13Jul 10, 2025Updated 7 months ago
- A single node.js script to automatically inject user/password to http proxy server via a local forwarder☆11Nov 21, 2019Updated 6 years ago
- convert formatted text to markdown☆13Dec 29, 2025Updated last month
- ☆11Aug 11, 2023Updated 2 years ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 2 months ago
- Manage a Google Drive Service Account visually☆12Oct 17, 2024Updated last year
- Whisper finetuning☆15Apr 9, 2025Updated 10 months ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- Fast Event Video Annotation Tool☆14Oct 29, 2023Updated 2 years ago
- ☆14Feb 19, 2024Updated last year