m-koichi/ConformerSED

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/m-koichi/ConformerSED)

m-koichi / ConformerSED

☆31

Alternatives and similar repositories for ConformerSED

Users that are interested in ConformerSED are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DCASE-REPO / DESED_task
View on GitHub
Domestic environment sound event detection task
☆157Jun 11, 2024Updated 2 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
qiuqiangkong / sound_event_detection_dcase2017_task4
View on GitHub
☆55Jun 3, 2020Updated 6 years ago
JaesungHuh / VoxSRC2022
View on GitHub
VoxSRC2022 workshop development kit
☆19Jul 21, 2022Updated 4 years ago
frednam93 / FilterAugSED
View on GitHub
☆68Sep 13, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
frednam93 / FDY-SED
View on GitHub
☆96Jun 22, 2023Updated 3 years ago
turpaultn / dcase20_task4
View on GitHub
Baseline of DCASE 2020 task 4
☆42Oct 24, 2022Updated 3 years ago
turpaultn / DESED
View on GitHub
Repo associated to the DESED dataset, download and creation of data
☆154Jul 16, 2024Updated 2 years ago
peak1995 / tacotron-chinese
View on GitHub
☆15Apr 17, 2019Updated 7 years ago
Kazuhito00 / onnx-model-encrypt-sample
View on GitHub
ONNXモデルをpyca/cryptographyを用いて暗号化/復号化するサンプル
☆16Mar 19, 2022Updated 4 years ago
apple / ml-nvas3d
View on GitHub
☆49Jul 20, 2024Updated 2 years ago
yangdongchao / DCASE2021Task5
View on GitHub
The code for DCASE2021 task5 submission.
☆20Feb 21, 2022Updated 4 years ago
denfed / wave-spec-fusion
View on GitHub
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…
☆16Aug 9, 2021Updated 4 years ago
YoshikiMas / madeon-asr
View on GitHub
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
☆19Dec 1, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
fgnt / sed_scores_eval
View on GitHub
☆41Feb 18, 2026Updated 5 months ago
VSydorskyy / hubmap_2022_htt_solution
View on GitHub
Codebase for HuBMAP + HPA - Hacking the Human Body: Human Torus Team solution (3d Place)
☆16Sep 27, 2022Updated 3 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
multitel-ai / urban-sound-classification-and-comparison
View on GitHub
Urban Sound Classification : striving towards a fair comparison
☆17Dec 11, 2020Updated 5 years ago
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
toni-heittola / dcase2020_task1_baseline
View on GitHub
DCASE2020 Challenge Task 1 baseline system
☆25Jun 22, 2020Updated 6 years ago
marmoi / dcase2023_task4b_baseline
View on GitHub
Baseline code for DCASE 2023 task 4 B
☆15Apr 21, 2023Updated 3 years ago
ierolsen / Object-Detection-with-OpenCV
View on GitHub
This repo contains some object detection algorithms and techniques (Not ML algorithms). This is aimed to get coordinates, width, height, …
☆12Nov 26, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JaesungHuh / VoxSRC2021
View on GitHub
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021
☆19Jul 21, 2021Updated 5 years ago
StevenHickson / CreateNormals
View on GitHub
☆11Nov 22, 2019Updated 6 years ago
SIY1121 / HRTFSimulator
View on GitHub
☆10Aug 28, 2019Updated 6 years ago
longxiang92 / Flash-MNIST
View on GitHub
☆17Mar 14, 2018Updated 8 years ago
migperfer / TriAD-ISMIR2023
View on GitHub
Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions
☆20Jul 19, 2024Updated 2 years ago
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
NISTEP / minutes
View on GitHub
議事録メタデータセット
☆12Jun 10, 2018Updated 8 years ago
Kikyo-16 / Sound_event_detection
View on GitHub
This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…
☆129Jul 24, 2020Updated 6 years ago
VoxBlink / ScriptsForVoxBlink
View on GitHub
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
☆30Apr 16, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
zjysteven / MixOE
View on GitHub
[WACV'23] Mixture Outlier Exposure for Out-of-Distribution Detection in Fine-grained Environments
☆26Apr 12, 2023Updated 3 years ago
yamathcy / ISMIR2022J-POP
View on GitHub
Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…
☆23Apr 23, 2024Updated 2 years ago
MihawkHu / DCASE2020_task1
View on GitHub
Code for DCASE 2020 task 1a and task 1b.
☆88Jan 20, 2022Updated 4 years ago
nii-yamagishilab / SpeechSPC-mini
View on GitHub
Speech Security and Privacy Compendium - Mini
☆10Jun 18, 2024Updated 2 years ago
Andong-Li-speech / RTNet
View on GitHub
implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain
☆47Nov 4, 2020Updated 5 years ago
swagshaw / ASC-CL
View on GitHub
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Jul 19, 2022Updated 4 years ago
wangyu09 / exkaldi-rt
View on GitHub
An online speech recognition extension toolkit of Kaldi
☆55Jun 23, 2021Updated 5 years ago