ubclaunchpad / minutesLinks
Speaker diarization via transfer learning
β27Updated 6 years ago
Alternatives and similar repositories for minutes
Users that are interested in minutes are comparing it to the libraries listed below
Sorting:
- Speaker diarization scripts, based on AaltoASRβ190Updated 6 years ago
- π A web app to play, visualize, and annotate your audio files for machine learningβ119Updated 5 years ago
- β65Updated 11 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.β50Updated 8 years ago
- Learning embeddings for laughter categorizationβ34Updated 6 years ago
- Speech-to-text based on wav2letter built for transfer learningβ98Updated 2 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ82Updated last year
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ65Updated 5 years ago
- Tools for parsing the audio track in television news programsβ19Updated 4 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β130Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ71Updated 3 years ago
- End to End Dialect Identification using Convolutional Neural Networkβ52Updated 5 years ago
- Automatic Speech Recognition Dataset Generationβ37Updated 7 years ago
- A simple audio feature extraction libraryβ80Updated 6 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- Segment speech sequences based on speaker transitions, using ML and DSP.β17Updated 7 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15Updated 5 years ago
- This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The oβ¦β22Updated 7 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- A collection of basic python modules for spoken natural language processingβ55Updated 5 years ago
- Paper: https://arxiv.org/abs/1702.02285β64Updated 6 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable woβ¦β69Updated 7 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow modelsβ39Updated last year
- β84Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archiβ¦β30Updated 2 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ180Updated 3 years ago
- An opensource speech-to-text software written in tensorflowβ160Updated 2 years ago