NbAiLab / nb-whisperLinks
β49Updated last year
Alternatives and similar repositories for nb-whisper
Users that are interested in nb-whisper are comparing it to the libraries listed below
Sorting:
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ540Updated last year
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β217Updated last year
- Python bindings for whisper.cppβ321Updated last month
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ348Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β250Updated this week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ892Updated 8 months ago
- β657Updated 4 months ago
- whisper.cpp bindings for pythonβ110Updated 2 years ago
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ373Updated 3 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β104Updated 5 months ago
- β497Updated last year
- β323Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ535Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extractionβ106Updated 7 months ago
- Create an LJSpeech structured voice dataset on wave inputβ37Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β358Updated 2 years ago
- A testing repo to share code and thoughts on diarisationβ57Updated last year
- Real time speech to text transcription app.β434Updated 3 years ago
- G2Pβ403Updated 6 months ago
- Repository for the EM German Modelβ112Updated 2 years ago
- β556Updated last year
- openvino version of openai/whisperβ182Updated 2 years ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β418Updated last week
- Whisper from OpenAi and diarization with Pyannoteβ51Updated 2 years ago
- Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own languageβ61Updated 3 months ago
- β357Updated last year
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviewsβ48Updated last year
- Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.β45Updated 10 months ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β694Updated last week
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago