ruslantau / media-annotatorView external linksLinks
Web-based annotation tool for media data. The easiest way to create you own media dataset.
☆16May 12, 2023Updated 2 years ago
Alternatives and similar repositories for media-annotator
Users that are interested in media-annotator are comparing it to the libraries listed below
Sorting:
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆21Apr 7, 2021Updated 4 years ago
- Speech Recognition implementation using Artificial Neural Networks☆10Sep 7, 2015Updated 10 years ago
- Simple implementation of TDOA localization algorithm.☆13Oct 12, 2016Updated 9 years ago
- A signal processing library, currently sufficient for basic speech recognition stuff like mel frequency cepstrum☆19Mar 15, 2012Updated 13 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- ☆10Jun 11, 2021Updated 4 years ago
- Using acceleration and heart rate data to classify awake, deep, and light sleep☆10Dec 21, 2017Updated 8 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- ☆14Mar 15, 2022Updated 3 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Audio source separation using CASA approaches in Python.☆11Apr 2, 2015Updated 10 years ago
- Visualization for hidden Markov model computations☆14Dec 19, 2014Updated 11 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- EditEvo is a browser-based video editor designed to provide users with editing capabilities directly within their web browser. The appli…☆11May 7, 2024Updated last year
- Multistream CNN for Robust Acoustic Modeling☆40Jun 17, 2021Updated 4 years ago
- Implementation of joint bayesian model, written in python.☆11Aug 2, 2021Updated 4 years ago
- Music segmentation by ordinal linear discriminant analysis☆18Nov 10, 2017Updated 8 years ago
- Voice Music Separation competing for 6th Huawei Cup in ZJU☆11Jun 2, 2015Updated 10 years ago
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Dec 29, 2020Updated 5 years ago
- 夏目悠李/男声歌声データベースの最新ラベルデータ☆11Sep 2, 2020Updated 5 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- ☆17Jul 29, 2018Updated 7 years ago
- Cordova/Phonegap plugin for Android SpeechRecognizer feature.☆11Apr 17, 2015Updated 10 years ago
- ☆12Oct 7, 2020Updated 5 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 8 years ago
- A python implementation of the neural network joint language model and an extension of it using global source context.☆11May 17, 2017Updated 8 years ago
- a sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Models☆22Jan 18, 2016Updated 10 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Python based tool to use text to speech to read books or study material quickly.☆10Sep 22, 2021Updated 4 years ago
- Android Photo/Video Recording/Capture/Effects via OpenGL☆10Feb 21, 2021Updated 4 years ago
- Speech Dereverberation using weighted prediction error☆11Dec 22, 2019Updated 6 years ago
- JSGF Deducer based on JSGF grammar and WFST☆11Jan 11, 2018Updated 8 years ago
- Symbolic Graphics Programming with Large Language Models☆37Sep 14, 2025Updated 4 months ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- Voice synthesis library for Text-to-Speech applications (Currently HTS Engine rewrite in Rust language)☆13Updated this week