luigisaetta / whisper-appLinks

This repository contains all the work I have done (and I'm doing) in developing a web app for speech-to-text, based on OpenAI Whisper

☆9

Alternatives and similar repositories for whisper-app

Users that are interested in whisper-app are comparing it to the libraries listed below

Sorting:

ccoreilly / wav2vec2-service
☆38Updated 3 years ago
huggingface / open_asr_leaderboard
☆104Updated 3 weeks ago
efeslab / LiteASR
LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
☆110Updated last month
ylacombe / finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
☆156Updated last year
revdotcom / speech-datasets
Various speech datasets made available to the public
☆122Updated 6 months ago
huggingface / diarizers
☆296Updated last year
jasonppy / PromptingWhisper
Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
☆146Updated last year
jumon / zac
Zero-shot Audio Classification using Whisper
☆79Updated 2 years ago
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
Open-Speech-EkStep / indic-punct
☆43Updated 2 years ago
FrenchKrab / IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆84Updated last year
roatienza / efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
☆170Updated last year
patrickvonplaten / Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆111Updated 2 years ago
clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆62Updated 3 weeks ago
bayartsogt-ya / whisper-multiple-hf-datasets
Whisper fine-tuning event script to use multiple hf datasets
☆32Updated 2 years ago
Open-Speech-EkStep / ULCA-asr-dataset-corpus
☆46Updated 2 years ago
krylm / whisper-event-tuning
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Updated 2 years ago
mogwai / nanodrz
Speaker Diarization with Transformers
☆68Updated 2 weeks ago
mesolitica / vllm-whisper
A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper
☆28Updated 11 months ago
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆99Updated 8 months ago
egorsmkv / optimized-whisper
Use quantized versions of Whisper to speed up inference
☆12Updated 8 months ago
scart97 / thunder-speech
A Hackable speech recognition library.
☆25Updated 8 months ago
ANonEntity / WhisperWithVAD
Whisper combined with Silero VAD, for improved long-form transcriptions
☆52Updated 2 years ago
hlt-mt / mosel
Collection of Open Source Speech Data
☆159Updated 7 months ago
NVIDIA / RAD-MMM
A TTS model that makes a speaker speak new languages
☆76Updated last year
skrbnv / javad
☆56Updated 5 months ago
kurianbenoy / whisper_normalizer
A python package for whisper normalizer
☆62Updated last week
shivammehta25 / OverFlow
Putting flows on top of neural transducers for better TTS
☆62Updated this week
besacier / ASR2022
☆56Updated 2 years ago
huggingface / speechbox
☆359Updated last year