sanchit-gandhi/whisper-flash-attention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sanchit-gandhi/whisper-flash-attention)

sanchit-gandhi / whisper-flash-attention

☆21

Alternatives and similar repositories for whisper-flash-attention

Users that are interested in whisper-flash-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huhailinguist / ChineseNLIProbing
View on GitHub
☆10Oct 17, 2021Updated 4 years ago
30stomercury / hmm-backprop
View on GitHub
Fast and differentiable hidden Markov model in C++
☆19Jan 20, 2023Updated 3 years ago
Yolanda-Gao / VoiceGANmodel
View on GitHub
☆19Feb 28, 2018Updated 8 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
sholiday / cppBKTree
View on GitHub
A BKTree written in C++
☆11Jul 8, 2011Updated 15 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yangdongchao / Omni-AutoThink
View on GitHub
Adaptive Multimodal Reasoning via Reinforcement Learning
☆23Jan 11, 2026Updated 6 months ago
hfutami / distill-bert-for-seq2seq-asr
View on GitHub
☆24Jun 17, 2020Updated 6 years ago
sooftware / jasper
View on GitHub
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
☆32Mar 4, 2021Updated 5 years ago
sooftware / tacotron2
View on GitHub
Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
☆19Jan 21, 2021Updated 5 years ago
open-speech / kaldi-io
View on GitHub
c++ Kaldi IO lib (static and dynamic).
☆25Nov 26, 2018Updated 7 years ago
smallflyingpig / speech-to-image-translation-without-text
View on GitHub
Code for paper "direct speech-to-image translation"
☆26Jun 8, 2020Updated 6 years ago
daandouwe / ngram-lm
View on GitHub
A simple n-gram language model.
☆12Sep 11, 2018Updated 7 years ago
sooftware / speech-paper-review
View on GitHub
Review of papers I read
☆14Dec 11, 2020Updated 5 years ago
zzw922cn / LPC_for_TTS
View on GitHub
Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.
☆72Mar 19, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cpdu / vallt
View on GitHub
☆36Mar 14, 2025Updated last year
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
fangfm / lcnn
View on GitHub
A TensorFlow implementation of light convolutional neural network (LCNN)
☆12Dec 27, 2018Updated 7 years ago
TakHemlata / T-EER
View on GitHub
Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"
☆14Sep 25, 2023Updated 2 years ago
speech-paper-reading / speech-paper-reading
View on GitHub
Repository for speech paper reading
☆33Aug 19, 2021Updated 4 years ago
tunib-ai / transformers
View on GitHub
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
☆31Feb 5, 2022Updated 4 years ago
robin1001 / vad
View on GitHub
simple energy vad
☆19Jun 3, 2017Updated 9 years ago
TUT-ARG / DCASE2016-baseline-system-matlab
View on GitHub
☆13Jan 10, 2017Updated 9 years ago
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ymoslem / MT-Tools
View on GitHub
Collection of Common Machine Translation Tools
☆11Jul 26, 2022Updated 4 years ago
npuichigo / tarzan
View on GitHub
High-level API for tar-based dataset
☆12Feb 3, 2024Updated 2 years ago
kmbmjn / search_conference_name_of_paper
View on GitHub
☆11Jun 4, 2021Updated 5 years ago
muskang48 / Speaker-Diarization
View on GitHub
This project is about performing Speaker diarization for Hindi Language.
☆58Mar 21, 2021Updated 5 years ago
Raghvender1205 / AI_From_Scratch
View on GitHub
Into the depths of some concepts of Artificial Intelligence and Machine Learning
☆10Apr 4, 2026Updated 3 months ago
MokkeMeguru / TFGENZOO
View on GitHub
Library about construction helper for Generative models e.g. Flow-based Model with Tensorflow 2.x.
☆12Feb 16, 2023Updated 3 years ago
Yuan-ManX / audio-ai-agent
View on GitHub
Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
☆16Dec 8, 2023Updated 2 years ago
huaxiuyao / KGML
View on GitHub
KGML for EMNLP 2021
☆10Feb 2, 2022Updated 4 years ago
SeongokRyu / my-study-materials
View on GitHub
☆13Jul 4, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SVDDChallenge / CtrSVDD_Utils
View on GitHub
☆18Jan 10, 2024Updated 2 years ago
gillesdegottex / pulsemodel
View on GitHub
Pulse Model vocoder
☆42Dec 5, 2018Updated 7 years ago
Kazuhito00 / onnx-model-encrypt-sample
View on GitHub
ONNXモデルをpyca/cryptographyを用いて暗号化/復号化するサンプル
☆16Mar 19, 2022Updated 4 years ago
miquelindia90 / DoubleAttentionSpeakerVerification
View on GitHub
Pytorch implemenation of the model proposed in the paper: Double Multi-Head Attention for Speaker Verification
☆19Jul 25, 2024Updated 2 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
zhengmidon / singaligner
View on GitHub
a compact audio-to-phoneme aligner for singing voice
☆12Jan 17, 2024Updated 2 years ago
warnikchow / kosp2e
View on GitHub
Korean Speech to English Translation Corpus
☆45Sep 3, 2021Updated 4 years ago