Robust Speech Recognition via Large-Scale Weak Supervision
β19Dec 1, 2022Updated 3 years ago
Alternatives and similar repositories for efficient_whisper
Users that are interested in efficient_whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π Aivis: AI Voice Imitation Systemβ27Feb 25, 2024Updated 2 years ago
- β32Dec 4, 2022Updated 3 years ago
- β39Feb 26, 2026Updated 4 months ago
- β17May 5, 2024Updated 2 years ago
- Detect and remove or lower the volume of breathing in speech recordings.β16May 14, 2025Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β13Mar 23, 2026Updated 3 months ago
- β14Jan 2, 2025Updated last year
- β15Jan 26, 2025Updated last year
- β15Feb 18, 2022Updated 4 years ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systemsβ13Jan 16, 2025Updated last year
- THINKLETγγη΄ζ₯ Youtube Live γ«γΉγγͺγΌγγ³γ°ι δΏ‘γγγβ10Dec 10, 2024Updated last year
- ESPNet TTS with Streamlit GUIβ14Apr 30, 2023Updated 3 years ago
- decoding and encoding JSON library for Swift3 - more easily, or more strictly.β11Dec 15, 2023Updated 2 years ago
- A lightweight Python library for running TTS models with a unified API.β20Feb 18, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networksβ17Aug 18, 2023Updated 2 years ago
- Forced alignment decoder for Whisper.β16Mar 13, 2024Updated 2 years ago
- Tutorials for FLAVA model https://arxiv.org/abs/2112.04482β12Jun 22, 2022Updated 4 years ago
- β10Dec 10, 2021Updated 4 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)β10Oct 30, 2024Updated last year
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.β11Feb 17, 2024Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Modelsβ14Oct 19, 2022Updated 3 years ago
- Robust Speech Recognition via Large-Scale Weak Supervisionβ13Oct 28, 2023Updated 2 years ago
- β18Jul 22, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 44100Hzζ₯ζ¬θͺι³ζΊγ«ε―ΎεΏγγ PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor γ§γγβ21May 2, 2023Updated 3 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.β18Aug 1, 2025Updated 11 months ago
- β14Jul 8, 2020Updated 5 years ago
- convert .lab files to .TextGrid files, which can be used in Praatβ14Nov 2, 2018Updated 7 years ago
- Microservice that generates subtitles for TUM-Liveβ18Apr 24, 2026Updated 2 months ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellationβ30Nov 12, 2025Updated 7 months ago
- This is a repository for comparing voice changer results and searching datasets and trained models.β30May 21, 2023Updated 3 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Logβ¦β16Oct 22, 2022Updated 3 years ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXβ29Oct 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Whisper combined with Silero VAD, for improved long-form transcriptionsβ55Dec 11, 2022Updated 3 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.β19Apr 22, 2019Updated 7 years ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.β13Sep 13, 2024Updated last year
- Reimplementation of speech decoding 2022 paper by MetaAIβ14Oct 17, 2023Updated 2 years ago
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkitβ43Mar 13, 2026Updated 3 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.β22May 26, 2025Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.β15Sep 19, 2022Updated 3 years ago