Parakeet 0.6b V2 + Pyannote diarization behind a Whisper API
☆65Feb 21, 2026Updated 2 weeks ago
Alternatives and similar repositories for parakeet-diarized
Users that are interested in parakeet-diarized are comparing it to the libraries listed below
Sorting:
- Transcribing audio files on Modal with open source ASR models is fast, cheap, and easy!☆18Jul 25, 2025Updated 7 months ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 11 months ago
- Text Match Cut Video Generator Web App☆36Feb 19, 2026Updated 2 weeks ago
- ☆25Mar 29, 2025Updated 11 months ago
- ☆19Jan 8, 2025Updated last year
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Jun 5, 2023Updated 2 years ago
- A transcription text editor with respeak module☆14Jan 24, 2026Updated last month
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆25Jul 15, 2025Updated 7 months ago
- Timed-transcript editor component built using Draft.js.☆45Aug 20, 2018Updated 7 years ago
- WebAMS is an Open Source web application for reporting and resolving incidents or tickets☆10Dec 11, 2022Updated 3 years ago
- This repository is the official implementation of our paper "Improving Generalization for AI-Synthesized Voice Detection", which has been…☆23Jan 13, 2026Updated last month
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- Example: Micro speech for TensorFlow Lite☆35Dec 18, 2023Updated 2 years ago
- ☆31Oct 29, 2024Updated last year
- LinkedIn Lead Scraper - Automated Profile Discovery & Lead Generation Tool☆27Jan 21, 2026Updated last month
- ☆29Jan 6, 2026Updated 2 months ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆51Sep 20, 2025Updated 5 months ago
- ☆30Jun 30, 2025Updated 8 months ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆37Feb 11, 2025Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- ☆38Apr 3, 2025Updated 11 months ago
- ☆28Dec 14, 2021Updated 4 years ago
- ☆36Oct 15, 2024Updated last year
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆44May 26, 2025Updated 9 months ago
- ☆39Apr 15, 2024Updated last year
- Speech-to-text transcription VST3/ARA plugin☆54Feb 2, 2026Updated last month
- eForms is the new notification standard for public procurement procedures in the EU. The TED XML Data Converter is an XSLT project to con…☆12Oct 10, 2024Updated last year
- ☆14Feb 19, 2024Updated 2 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- ☆13Oct 9, 2025Updated 5 months ago
- ☆46Apr 16, 2023Updated 2 years ago
- convert formatted text to markdown☆13Dec 29, 2025Updated 2 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"☆12May 26, 2024Updated last year
- Whisper finetuning☆16Apr 9, 2025Updated 11 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Fixed it, so that years actually make sense, instead of AD and BC nonsense☆14Mar 21, 2025Updated 11 months ago