☆493Sep 10, 2025Updated 5 months ago
Alternatives and similar repositories for nlp
Users that are interested in nlp are comparing it to the libraries listed below
Sorting:
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,274Feb 20, 2026Updated 2 weeks ago
- ☆663Sep 24, 2025Updated 5 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,395Feb 23, 2026Updated last week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆252Feb 10, 2026Updated 3 weeks ago
- How to use OpenAIs Whisper to transcribe and diarize audio files☆373Oct 12, 2022Updated 3 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,368Feb 22, 2026Updated last week
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆833Sep 12, 2025Updated 5 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆539Nov 6, 2023Updated 2 years ago
- ☆18Nov 8, 2022Updated 3 years ago
- Streaming transcriber with whisper☆696May 1, 2023Updated 2 years ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,769Sep 9, 2025Updated 5 months ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆2,169Oct 29, 2025Updated 4 months ago
- A python package to build AI-powered real-time audio applications☆1,938Feb 12, 2025Updated last year
- ☆357Mar 17, 2024Updated last year
- My girlfriend wants me to stop swearing. Let's ask Whisper for some help.☆16Oct 12, 2022Updated 3 years ago
- openvino version of openai/whisper☆182Nov 6, 2023Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Nov 12, 2024Updated last year
- Faster Whisper transcription with CTranslate2☆21,289Nov 19, 2025Updated 3 months ago
- Podalize: Podcast Transcription and Analysis☆160Sep 8, 2024Updated last year
- ☆19Nov 4, 2022Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Feb 4, 2023Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,686Apr 3, 2024Updated last year
- Speaker prediction for captions on the Lex Fridman podcast☆27Feb 14, 2024Updated 2 years ago
- ☆8,818Oct 25, 2025Updated 4 months ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- Port of OpenAI's Whisper model in C/C++☆47,262Updated this week
- Transcription and Diarization based on OpenAI's Whisper☆25Sep 9, 2025Updated 5 months ago
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 2 years ago
- ☆38Dec 26, 2022Updated 3 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Convert a directory of .vtt or json transcripts into a fast searchable database☆19Oct 7, 2024Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- generate granular word-level captions in srt format☆57Sep 26, 2022Updated 3 years ago
- A curated list of awesome OpenAI's Whisper☆104Sep 17, 2023Updated 2 years ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆4,049Jan 8, 2025Updated last year
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated last month