projectlucas/efficient_whisper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/projectlucas/efficient_whisper)

projectlucas / efficient_whisper

Robust Speech Recognition via Large-Scale Weak Supervision

☆19

Alternatives and similar repositories for efficient_whisper

Users that are interested in efficient_whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

litagin02 / Aivis-Dataset
View on GitHub
💠 Aivis: AI Voice Imitation System
☆27Feb 25, 2024Updated 2 years ago
sarulab-speech / whisper-asr-finetune
View on GitHub
☆32Dec 4, 2022Updated 3 years ago
Hypotheses-Paradise / UADF
View on GitHub
☆17May 5, 2024Updated 2 years ago
Hiroshiba / openjtalk-label-getter
View on GitHub
☆10Dec 10, 2021Updated 4 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
duyichao / NPDA-KNN-ST
View on GitHub
Official implementation of EMNLP'2022 paper "Non-Parametric Domain Adaptation for End-to-End Speech Translation"
☆11Oct 26, 2022Updated 3 years ago
apple-yinhan / Noise-robust-SED
View on GitHub
☆14Jan 2, 2025Updated last year
FairyDevicesRD / thinklet.squid.run
View on GitHub
THINKLETから直接 Youtube Live にストリーミング配信をする
☆10Dec 10, 2024Updated last year
satoshin21 / JUNSON
View on GitHub
decoding and encoding JSON library for Swift3 - more easily, or more strictly.
☆11Dec 15, 2023Updated 2 years ago
fakerybakery / simpletts
View on GitHub
A lightweight Python library for running TTS models with a unified API.
☆20Feb 18, 2025Updated last year
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
thamquocdung / eCMU
View on GitHub
eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)
☆10Oct 30, 2024Updated last year
alexisdmacintyre / SpeechBreathingToolbox
View on GitHub
Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.
☆11Feb 17, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
boson-ai / WildASR-public
View on GitHub
Revisiting ASR in the Age of Voice Agents [COLM26]
☆24Apr 13, 2026Updated 3 months ago
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
ORI-Muchim / Efficient-Speech
View on GitHub
Lightweight Korean TTS Model based on FastSpeech2
☆15Mar 4, 2026Updated 4 months ago
huangyz0918 / kws-continual-learning
View on GitHub
[ICASSP'22] Continual Learning Benchmark for Spoken Keyword Spotting
☆17Jun 7, 2022Updated 4 years ago
tonnetonne814 / PITS-44100-Ja
View on GitHub
44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。
☆21May 2, 2023Updated 3 years ago
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago
isaacOnline / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆13Oct 28, 2023Updated 2 years ago
TUM-Dev / gocast-voice-service
View on GitHub
Microservice that generates subtitles for TUM-Live
☆19Apr 24, 2026Updated 3 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
xmos / sln_voice
View on GitHub
XCORE-VOICE Solution
☆20Apr 8, 2026Updated 3 months ago
Speech-Arena / speech_df_arena
View on GitHub
☆40Feb 26, 2026Updated 4 months ago
nadare881 / voice-changer-vector-search
View on GitHub
This is a repository for comparing voice changer results and searching datasets and trained models.
☆30May 21, 2023Updated 3 years ago
JosefAlbers / e2tts-mlx
View on GitHub
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
☆29Oct 15, 2024Updated last year
adefossez / audio_mod_idessai
View on GitHub
Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.
☆13Sep 13, 2024Updated last year
YUCHEN005 / RATS-Channel-A-Speech-Data
View on GitHub
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆16Oct 22, 2022Updated 3 years ago
R1ckShi / FrontEnd-AEC
View on GitHub
Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.
☆19Apr 22, 2019Updated 7 years ago
ANonEntity / WhisperWithVAD
View on GitHub
Whisper combined with Silero VAD, for improved long-form transcriptions
☆55Dec 11, 2022Updated 3 years ago
Syuparn / TextGridConverter
View on GitHub
convert .lab files to .TextGrid files, which can be used in Praat
☆14Nov 2, 2018Updated 7 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
unilight / jatts
View on GitHub
JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit
☆43Mar 13, 2026Updated 4 months ago
lukaszliniewicz / breath-removal
View on GitHub
Detect and remove or lower the volume of breathing in speech recordings.
☆17May 14, 2025Updated last year
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 7 months ago
tarun360 / SpeakerProfiling
View on GitHub
Estimating the Age, Height, and Gender of a speaker with their speech signal.
☆15Sep 19, 2022Updated 3 years ago
Zhongxu-Wang / ArtSpeech
View on GitHub
ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations
☆22Sep 21, 2025Updated 10 months ago
yoyolicoris / variational-diffwave
View on GitHub
☆32Jul 27, 2022Updated 3 years ago