jonnor / brewing-audio-event-detectionLinks
Tracking beer/wine using Audio Event Detection with Machine Learning
☆15Updated last year
Alternatives and similar repositories for brewing-audio-event-detection
Users that are interested in brewing-audio-event-detection are comparing it to the libraries listed below
Sorting:
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Updated 2 years ago
- Forced alignment decoder for Whisper.☆14Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 7 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 11 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Updated 7 months ago
- ☆18Updated last year
- Transfer learning approach to pronunciation scoring☆11Updated 2 years ago
- Mason-Alberta Phonetic Segmenter☆15Updated last month
- Official PyTorch implementation of (ICME2025) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speec…☆17Updated 10 months ago
- IPA Phonetic dataset lexicon☆18Updated 2 weeks ago
- ☆17Updated 2 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Updated 3 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Updated 3 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- This is the experimental description of MnTTS2.☆11Updated last year
- a very simple vocal tract model, few tube model. generate vowel sound by it☆18Updated 2 years ago
- A family of efficient speech models for multilingual phone recognition☆37Updated 3 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Updated last year
- CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning [Official PyTorch implementation]☆22Updated 7 months ago
- One command to start a streaming ASR server.☆12Updated last year
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 3 years ago
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆11Updated last year
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Updated 2 years ago
- ☆12Updated 4 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Updated 2 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Updated 3 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago