eellak/gsoc2021-audio-annotation-tool

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eellak/gsoc2021-audio-annotation-tool)

eellak / gsoc2021-audio-annotation-tool

Creation of a multi user audio first annotation tool - GSoC 2021

☆29

Alternatives and similar repositories for gsoc2021-audio-annotation-tool

Users that are interested in gsoc2021-audio-annotation-tool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

streichgeorg / autosing
View on GitHub
☆18Jan 20, 2025Updated last year
CODEJIN / PWGAN_for_HiFiSinger
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
colaudiolab / AudioSet-R
View on GitHub
Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"
☆19Oct 9, 2025Updated 9 months ago
mozilla / murmur
View on GitHub
DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training
☆20May 23, 2019Updated 7 years ago
xushengyuan / VocalnetOpenDataset
View on GitHub
一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.
☆24Jul 13, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
drscotthawley / fad_pytorch
View on GitHub
Frechet Audio Distance evaluation in PyTorch
☆36Jun 9, 2023Updated 3 years ago
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
migperfer / AutoMashupper
View on GitHub
Tool to aid in the creation of mashups
☆21Apr 7, 2020Updated 6 years ago
fabianostermann / ArtificialSongGenerator
View on GitHub
The ArtificialSongGenerator automatically composes and compiles the Artifical Audio Multitrack dataset (AAM).
☆27Nov 17, 2025Updated 8 months ago
BlaiMelendezCatalan / BAT
View on GitHub
☆62Feb 2, 2023Updated 3 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
BiometricVox / DAE_SpeakerID
View on GitHub
Denoising autoencoders for speaker identification on MCE 2018 challenge
☆12Nov 8, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
hymanhsu / JSGFDeducer
View on GitHub
JSGF Deducer based on JSGF grammar and WFST
☆11Jan 11, 2018Updated 8 years ago
madhavlab / wav2tok
View on GitHub
Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"
☆36Jun 30, 2026Updated 3 weeks ago
crystal0913 / merlin-tts
View on GitHub
c++ code for merlin tts
☆22Oct 19, 2019Updated 6 years ago
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
Koziev / StressModel
View on GitHub
Neural model for prediction of stress position in Russian words
☆13Jun 22, 2025Updated last year
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
jackyyy0228 / Chinese-ASR
View on GitHub
Chinese-ASR built on kaldi
☆14Jan 21, 2019Updated 7 years ago
PanagiotisP / svs-multiband
View on GitHub
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Jun 18, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jjunak-yun / FLowHigh_code
View on GitHub
[ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"
☆118Jan 17, 2025Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
tommy-fox / streaming-source-separation
View on GitHub
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
☆21Dec 8, 2022Updated 3 years ago
MingjieChen / EasyVC
View on GitHub
A toolkit for any-to-any encoder-decoder voice conversion systems
☆83Aug 10, 2023Updated 2 years ago
alisonbma / aiSFX
View on GitHub
Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & No…
☆49Jun 21, 2023Updated 3 years ago
Rongjiehuang / Multi-Singer
View on GitHub
PyTorch Implementation of Multi-Singer (ACM-MM'21)
☆139May 8, 2022Updated 4 years ago
seungheondoh / msu-benchmark
View on GitHub
music semantic understanding evaluation benchmark
☆24Aug 12, 2023Updated 2 years ago
seungheondoh / music-text-representation-pp
View on GitHub
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]
☆43Oct 7, 2024Updated last year
zxxwxyyy / sonique
View on GitHub
Video Background Music Generation Using Unpaired Audio-Visual Data
☆33Oct 8, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
hainan-xv / PASM
View on GitHub
Pronunciation-assisted Subword Modeling
☆31May 30, 2019Updated 7 years ago
qiuqiangkong / mini_music_tagging
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
amazon-science / unsupervised-melody-to-lyrics-generation
View on GitHub
This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Orab…
☆11Jul 6, 2023Updated 3 years ago
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
mdx-tutorial / mdx-tutorial.github.io
View on GitHub
Tutorial covering Open Source tools for Source Separation.
☆15Nov 12, 2021Updated 4 years ago