daisukelab / sound-clf-pytorch
Sound classifier tutorials/examples in PyTorch
☆61Updated 2 years ago
Alternatives and similar repositories for sound-clf-pytorch:
Users that are interested in sound-clf-pytorch are comparing it to the libraries listed below
- Code for STFT Transformer used in BirdCLEF 2021 competition.☆78Updated 3 years ago
- ☆43Updated last month
- BirdCLEF 2021 - Birdcall Identification 4th place solution☆50Updated 3 years ago
- ☆29Updated 3 years ago
- ディレクトリ内に音データがあってそのスペクトログラムをすばやく確認したい時にさくっと使えるツール☆43Updated last year
- xvector model on jtubespeech☆43Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆73Updated 4 years ago
- Audio classification using Keras with ESC-50 dataset.☆15Updated 6 years ago
- context labels and pronunciation data for JSUT corpus☆68Updated 3 years ago
- Training code of Cornell Birdcall Identification Challenge 6th place solution☆50Updated 4 years ago
- ☆20Updated 5 years ago
- ☆49Updated 3 years ago
- An audio classification system for learning with out-of-distribution data☆33Updated 2 years ago
- A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-…☆53Updated last week
- 音声情報処理n本ノックを目指して☆128Updated 8 months ago
- 深層学習×音楽情報処理勉強会@筑波大学・人と音の情報学研究室☆19Updated last year
- Machine Learning Sound Classifier☆134Updated 5 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 3 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆16Updated 2 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆28Updated 9 months ago
- ☆482Updated 7 months ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation☆137Updated 6 months ago
- ☆89Updated 10 months ago
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆50Updated 4 months ago
- 6th place solution to Freesound Audio Tagging 2019 kaggle competition☆25Updated 4 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated last year
- ☆215Updated last year
- ☆32Updated 2 years ago
- A repository of Japanese Phoneme-Level BERT☆21Updated last year
- Chainer implementation of between-class learning for sound recognition https://arxiv.org/abs/1711.10282☆91Updated 6 years ago