huggingface / audio-transformers-course
The Hugging Face Course on Transformers for Audio
☆359Updated last week
Alternatives and similar repositories for audio-transformers-course:
Users that are interested in audio-transformers-course are comparing it to the libraries listed below
- ☆266Updated 7 months ago
- ☆348Updated 10 months ago
- HF's ML for Audio study group☆191Updated last year
- ☆325Updated 4 months ago
- Learning audio concepts from natural language supervision☆517Updated 3 months ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆404Updated 8 months ago
- MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation☆374Updated last year
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆629Updated 3 months ago
- Place where folks can contribute to 🤗 community events☆407Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆275Updated last year
- NeMo text processing for ASR and TTS☆297Updated last week
- The Open Source Code of UniAudio☆537Updated 5 months ago
- Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.☆638Updated 5 months ago
- 🐸 - A general purpose model trainer, as flexible as it gets☆202Updated 10 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆139Updated last year
- Audio Large Language Models☆308Updated this week
- Keep track of big models in audio domain, including speech, singing, music etc.☆467Updated 3 months ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆244Updated 7 months ago
- PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.☆220Updated 3 months ago
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆332Updated 8 months ago
- ☆179Updated 2 years ago
- An Audio Language model for Audio Tasks☆298Updated 8 months ago
- Finetune VITS and MMS using HuggingFace's tools☆130Updated 9 months ago
- Audio Dataset for training CLAP and other models☆657Updated 11 months ago
- A collection of useful audio datasets and transforms for PyTorch.☆137Updated last year
- open-source audio datasets☆147Updated last year
- ☆62Updated last month
- A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.☆49Updated 7 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆571Updated last year
- Library for Textless Spoken Language Processing☆530Updated last year