jonnor / brewing-audio-event-detectionLinks
Tracking beer/wine using Audio Event Detection with Machine Learning
☆15Updated last year
Alternatives and similar repositories for brewing-audio-event-detection
Users that are interested in brewing-audio-event-detection are comparing it to the libraries listed below
Sorting:
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- Forced alignment decoder for Whisper.☆14Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆18Updated 2 years ago
- ☆13Updated 3 years ago
- ☆11Updated 4 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆15Updated 11 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 3 months ago
- ☆15Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated 2 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- ☆28Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Updated 5 months ago
- Training code for kokoro tts model☆25Updated 2 weeks ago
- Automatic Dialect Detection Repository☆39Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated last year
- ☆19Updated last year
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆16Updated 8 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 9 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆31Updated last month
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆16Updated last year
- Overlapped Speech detection in Multi-party Conversations☆22Updated 7 years ago
- MSP-Podcast Challenge Baseline Code☆28Updated last year
- ☆14Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 6 months ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆17Updated 4 years ago