retkowsky / audio_embeddingsLinks

Audio search using Azure Cognitive Search

☆24

Alternatives and similar repositories for audio_embeddings

Users that are interested in audio_embeddings are comparing it to the libraries listed below

Sorting:

mogwai / nanodrz
Speaker Diarization with Transformers
☆69Updated 2 months ago
jasonppy / PromptingWhisper
Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
☆148Updated last year
Vaibhavs10 / translate-with-whisper
☆158Updated 2 years ago
sanchit-gandhi / codesnippets
☆10Updated last year
miguelvalente / whisperer
Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.
☆137Updated last year
EndlessReform / smoltts
Open TTS models, built for streaming on the edge
☆43Updated 4 months ago
Vaibhavs10 / ml-with-audio
HF's ML for Audio study group
☆194Updated 2 years ago
ducanhdt / openai_whisper_finetuning
☆49Updated 2 years ago
masakhane-io / lafand-mt
MAFAND-MT
☆57Updated last year
sanchit-gandhi / seq2seq-speech
Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.
☆36Updated 2 years ago
jumon / zac
Zero-shot Audio Classification using Whisper
☆79Updated 2 years ago
knoriy / CLARA
☆62Updated last year
apple / pytorch-speech-features
☆85Updated last year
neulab / AfricanVoices
Hosts text-to-speech corpus and speech synthesizers for African languages.
☆17Updated 2 years ago
parambharat / whisper-finetuning
Repository contains code to fine-tune WhisperASR model
☆23Updated 2 years ago
huggingface / speechbox
☆359Updated last year
Open-Speech-EkStep / ULCA-asr-dataset-corpus
☆47Updated 2 years ago
clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆65Updated 2 weeks ago
prateekralhan / OpenAI_Whisper_ASR
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models
☆66Updated 2 years ago
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
argmaxinc / OpenBench
Open-source reproducible benchmarks from Argmax
☆45Updated this week
huggingface / open_asr_leaderboard
☆116Updated 2 weeks ago
ylacombe / finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
☆161Updated last year
krylm / whisper-event-tuning
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Updated 2 years ago
german-asr / megs
A merged version of multiple open-source German speech datasets.
☆32Updated last year
indri-voice / audiotoken
Audio tokenization, in the fastest way possible!
☆52Updated 11 months ago
PranavPutsa1006 / Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
☆18Updated 2 years ago
AI4Bharat / vistaar
Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR
☆60Updated 2 months ago
linto-ai / linto-diarization
Speaker diarization service
☆23Updated last month
google-research-datasets / Hinglish-TOP-Dataset
Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…
☆41Updated 2 years ago