This study converts piano recordings to mel spectrogram and classifies them by SOTA pre-trained neural network backbones in CV. Comparative experiments show that SqueezeNet achieves a best classification accuracy of 92.37%.|该项目将钢琴录音转为为mel频谱图,使用微调后的前沿计算机视觉领域预训练深度学习骨干网络对其进行分类,对比实验可知SqueezeNet作为最优网络正确率可达92.37%
☆25Oct 31, 2025Updated 7 months ago
Alternatives and similar repositories for pianos
Users that are interested in pianos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sharer developed in VB.NET framework can turn a Windows notebook equipped with a shareable wireless network card into a WiFi transmitter;…☆20Apr 11, 2025Updated last year
- This is a TPS 3D game developed on Unity: our protagonist is trapped in a mysterious crypt with monsters and machines. He faces all kinds…☆21Nov 3, 2025Updated 7 months ago
- QPoisson is not only an image editor for implementing conventional image transformations such as mirroring, rotation, inversion, grayscal…☆27Nov 3, 2025Updated 7 months ago
- This project is a PyTorch implementation that uses deep CNN to recognize multi-digit numbers using the SVHN dataset derived from Google S…☆24Updated this week
- Using deep reinforcement learning to play Snake game. The used algorithm is PPO for discrete! It has the brilliant performance in the fi…☆35Nov 3, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- It uses Navier-Stokes equation as the physical model, the numerical solution obtained by real-time calculation includes scalar and veloci…☆22Nov 3, 2025Updated 7 months ago
- This repository provides LaTeX templates for academic papers, you can select the appropriate template for your target conference or journ…☆50Jan 29, 2026Updated 4 months ago
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆28Oct 31, 2025Updated 7 months ago
- ☆13Oct 24, 2023Updated 2 years ago
- ☆13Nov 11, 2024Updated last year
- Piano Skills Assessment [IEEE MMSP 2021]☆19Apr 25, 2025Updated last year
- Extract the stems (piano, bass, drums, vocals, etc.) of any audio/songs from YouTube.☆15Feb 25, 2021Updated 5 years ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆47Apr 29, 2026Updated last month
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Quick and Effective Camera-IMU Calibration for MODD2 dataset☆15Feb 10, 2020Updated 6 years ago
- Demo URL: https://genius-society-eluvletter.static.hf.space, the heartbeat animation indicates that the BGM is loading, please be patient…☆32May 11, 2026Updated last month
- This repository contains the annotations and download scripts for the audio files of the GiantSteps Key data set. This data set was publi…☆25Mar 19, 2025Updated last year
- Rendering of ImZero GUI library commands, generating of ImZero input commands.☆33Nov 13, 2025Updated 7 months ago
- Framework for estimating harmonic properties of music tracks.☆31Mar 24, 2023Updated 3 years ago
- A CNN which converts piano audio to a simplified MIDI format☆39May 26, 2018Updated 8 years ago
- Build a level 1 coding agent.☆17Jan 28, 2025Updated last year
- ☆26Feb 17, 2026Updated 3 months ago
- MIDI Piano synthesizer using DDSP.☆98May 24, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind☆22Jun 9, 2024Updated 2 years ago
- 基于esp32 pico v3 02的手表设计,并将云端模型和本地语音识别模型组合使用,提供智能化控制能力☆33Jan 18, 2026Updated 4 months ago
- 音乐频谱进度条☆27Jul 12, 2019Updated 6 years ago
- Prototype for a super simple authentication token service generator to support a mobile API.☆51Jul 21, 2022Updated 3 years ago
- 使用NAFNet进行图像去模糊☆23Nov 16, 2022Updated 3 years ago
- A collections of audio codecs with a standardized API☆41Apr 15, 2026Updated 2 months ago
- CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models [NAACL 2025]☆65Feb 28, 2025Updated last year
- ☆19Feb 3, 2025Updated last year
- Raspberry Pi 4のCPU動作を想定した人検出モデルとデモスクリプト☆39May 20, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 使用python分析音频文件,转换为乐谱☆23May 9, 2019Updated 7 years ago
- Official page of "DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing", IEEE/ACM Transactions on Audio, Speech,…☆33Nov 21, 2024Updated last year
- Official Repository for "Music Source Restoration"☆31Jun 1, 2025Updated last year
- ☆28May 12, 2026Updated last month
- Generate osu! standard beatmap object coordinates using a diffusion model with a transformer backbone☆46Mar 16, 2025Updated last year
- Based on Neural Amp Modeler 0.7.1 with some enhanced features☆12Apr 18, 2023Updated 3 years ago
- "Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"☆52Aug 23, 2025Updated 9 months ago