york135 / CTC_CE_for_ASTView external linksLinks
The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss and Cross-entropy Loss"
☆12Mar 25, 2025Updated 10 months ago
Alternatives and similar repositories for CTC_CE_for_AST
Users that are interested in CTC_CE_for_AST are comparing it to the libraries listed below
Sorting:
- Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions☆19Jul 19, 2024Updated last year
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated last year
- This repository presents FSD dataset for song deepfake detection.☆25Aug 18, 2025Updated 5 months ago
- ONSETS&VELOCITIES real-time piano detection - PyTorch training [EUSIPCO2023]☆28Aug 31, 2023Updated 2 years ago
- Official implementation of MelHuBERT☆68Oct 26, 2024Updated last year
- ☆33Sep 16, 2022Updated 3 years ago
- ☆28Aug 8, 2024Updated last year
- ☆17Jan 31, 2023Updated 3 years ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆145Jan 1, 2025Updated last year
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- Bark frequency filterbank + SPL differential envelope follower transient shaper☆11Dec 11, 2020Updated 5 years ago
- ☆13Sep 25, 2024Updated last year
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆10Sep 6, 2020Updated 5 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Homemade LightGBM and VGG-net experiment setup for DCASE2017 task 1☆11Aug 8, 2017Updated 8 years ago
- ☆10Apr 8, 2024Updated last year
- Accompanying repository for the DAFx24 paper "Interpolation Filters for Antiderivative Antialiasing"☆12Sep 5, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆11Sep 19, 2025Updated 4 months ago
- awesome-audio-visual-robustness☆11Jan 27, 2024Updated 2 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- An evolutionary algorithm that generates an accompaniment to a given melody that consists of triad chords while following music theory ru…☆10Sep 19, 2022Updated 3 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- The official implementation of "TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music"☆43Oct 25, 2022Updated 3 years ago
- ☆15May 16, 2024Updated last year
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 10 months ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- Hung-Yi Lee Linear Algebra 2018 Fall Homework☆10May 5, 2019Updated 6 years ago
- ☆14Aug 31, 2015Updated 10 years ago
- Python launcher of animated MIDI player by @cifkao & @magenta☆22Dec 13, 2023Updated 2 years ago
- Counterpoint by convolution☆14Mar 23, 2018Updated 7 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- TG-CRITIC: A TIMBRE-GUIDED MODEL FOR REFERENCE-INDEPENDENT SINGING EVALUATION☆15May 26, 2023Updated 2 years ago
- 求取语音的MFCC参数和GFCC参数,可用于语音信号特征提取☆10Jul 19, 2021Updated 4 years ago