Master Thesis
☆13Feb 20, 2017Updated 9 years ago
Alternatives and similar repositories for Classifying-Environmental-Sounds-with-Image-Networks
Users that are interested in Classifying-Environmental-Sounds-with-Image-Networks are comparing it to the libraries listed below
Sorting:
- 基于Swin-Transformer改进_YOLOv7电力杆塔识别系统☆13Nov 27, 2023Updated 2 years ago
- Tool to identify domains containing Pinyin language☆12Oct 18, 2014Updated 11 years ago
- Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"☆10Mar 11, 2020Updated 5 years ago
- Implementation in Python/Cython of the algorithm VMD_CVM for signal denoising☆11Jul 29, 2022Updated 3 years ago
- Repository of files shared during OpenPlanetary Data Cafés☆11Sep 15, 2022Updated 3 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- [DEPRECATED] The website for the Application Development Initiative, built on Eventum.☆11Sep 13, 2017Updated 8 years ago
- Skin Lesion Detector using HAM10000 dataset with Chainer / ChainerCV☆12Jan 7, 2019Updated 7 years ago
- ☆10Jun 26, 2018Updated 7 years ago
- pytorch implementation of grok☆12Jan 19, 2026Updated last month
- ☆18Dec 9, 2020Updated 5 years ago
- ☆12Mar 25, 2024Updated last year
- "FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding", NeurIPS 2023 Datasets and Benchmarks Track☆12Jun 20, 2024Updated last year
- Download AudioSet for Vision-Audio-Text Pre-training☆13May 16, 2022Updated 3 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- ☆14Mar 25, 2023Updated 2 years ago
- Sound classification using neural networks☆12Jun 6, 2018Updated 7 years ago
- Aquila is a digital signal processing library for C++11.☆15Nov 14, 2022Updated 3 years ago
- wenet_LLM_from_ASLP☆15Nov 26, 2024Updated last year
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- Feedforward Sequential Memory Networks☆16Aug 2, 2022Updated 3 years ago
- wechatApp h5Game demo:eat moon☆12Nov 9, 2018Updated 7 years ago
- The official implementation for IEEE-ICASSP 2024 paper "Flare-Free Vision: Empowering Uformer with Depth Insights"☆15Aug 27, 2024Updated last year
- Tools for the evaluation of audio captioning.☆18May 23, 2020Updated 5 years ago
- ☆23Aug 2, 2021Updated 4 years ago
- Voice Alignment and Conversion with Neural Networks and the WORLD codec.☆20Apr 27, 2019Updated 6 years ago
- The official PyTorch implementation of DeeDSR☆20Feb 28, 2025Updated last year
- 51报名管家微信小程序☆16Aug 21, 2017Updated 8 years ago
- ☆17Mar 14, 2018Updated 7 years ago
- Repository for augmenting data in forms, invoices and receipts for document image understanding☆17May 6, 2021Updated 4 years ago
- Code of our recently published attack FDA: Feature Disruptive Attack. Colab Notebook: https://colab.research.google.com/drive/1WhkKCrzFq5…☆21Nov 11, 2019Updated 6 years ago
- handwritten text recognition on IAM handwriting dataset☆14Mar 15, 2020Updated 5 years ago
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Jul 9, 2019Updated 6 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow☆17Jan 19, 2018Updated 8 years ago
- The code and pre-trained models of the paper "Masked Autoencoders as Image Processors" will be released in this repository.☆22Mar 31, 2023Updated 2 years ago
- ASR project with pytorch-lightning☆20Mar 21, 2025Updated 11 months ago
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Jul 4, 2022Updated 3 years ago