ubclaunchpad / minutes
Speaker diarization via transfer learning
β27Updated 5 years ago
Alternatives and similar repositories for minutes:
Users that are interested in minutes are comparing it to the libraries listed below
- π A web app to play, visualize, and annotate your audio files for machine learningβ119Updated 4 years ago
- This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The oβ¦β21Updated 6 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.β47Updated 8 years ago
- Learning embeddings for laughter categorizationβ34Updated 6 years ago
- Segment speech sequences based on speaker transitions, using ML and DSP.β17Updated 6 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ64Updated 4 years ago
- Text independent speaker recognition algorithm based on CNNβ23Updated 2 years ago
- β84Updated 4 years ago
- β65Updated 11 years ago
- Tools for parsing the audio track in television news programsβ19Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15Updated 4 years ago
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should statβ¦β65Updated 4 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow modelsβ39Updated 7 months ago
- Automatic Speech Recognition Dataset Generationβ37Updated 6 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?β34Updated 6 years ago
- Paper: https://arxiv.org/abs/1702.02285β63Updated 6 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone dataβ96Updated last year
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 10 years ago
- Speaker diarization scripts, based on AaltoASRβ190Updated 6 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problemβ51Updated 6 years ago
- An end-to-end Python pipeline for performing sentiment analysis on audio files of call-center conversations.β36Updated 7 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ98Updated last month
- A deep learning framework for Speech-Music discrimination of continuous audio streamsβ68Updated 6 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated last year
- β25Updated 7 years ago
- End to End Dialect Identification using Convolutional Neural Networkβ52Updated 5 years ago