jonnor / brewing-audio-event-detectionLinks
Tracking beer/wine using Audio Event Detection with Machine Learning
☆15Updated last year
Alternatives and similar repositories for brewing-audio-event-detection
Users that are interested in brewing-audio-event-detection are comparing it to the libraries listed below
Sorting:
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Updated last year
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Updated last year
- steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Updated 6 months ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- Forced alignment decoder for Whisper.☆14Updated last year
- A simple command line tool to calculate WER for ASR.☆14Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated 2 years ago
- ☆15Updated 5 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆22Updated 2 years ago
- ☆11Updated 4 years ago
- ☆13Updated 4 years ago
- ☆17Updated 4 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Updated 3 years ago
- ☆13Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- ☆15Updated 4 years ago
- ☆12Updated 4 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated 2 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆16Updated 9 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 11 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Updated 2 months ago
- Open Source Crimean Tatar Text-to-Speech datasets☆14Updated 10 months ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Updated 3 years ago
- ☆10Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆13Updated 9 months ago