Subtitle-Synchronizer/jlibrosa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Subtitle-Synchronizer/jlibrosa)

Subtitle-Synchronizer / jlibrosa

Librosa equivalent Java library to process audio file adn extract features from it.

☆121

Alternatives and similar repositories for jlibrosa

Users that are interested in jlibrosa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ShoYamanishi / AndroidMFCC
View on GitHub
26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON intrinsics
☆15Dec 26, 2019Updated 6 years ago
VVasanth / Android_Tensorflow_AudioClassifier
View on GitHub
Android app that helps in classifying the audio based on tensorflow models with the application of audio processing techniques of MFCC.
☆15Oct 27, 2020Updated 5 years ago
chiachunfu / speech
View on GitHub
TensorFlow on mobile with speech-to-text DL models.
☆165Nov 21, 2017Updated 8 years ago
diaoenmao / Speech-Emotion-Recognition-with-Dual-Sequence-LSTM-Architecture
View on GitHub
[ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture
☆12Jan 17, 2025Updated last year
olix20 / google_keyword_detection_challenge
View on GitHub
https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/
☆21Mar 1, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
danijel3 / KaldiJava
View on GitHub
Java interfaces and tools for Kaldi speech recognition.
☆20Oct 2, 2016Updated 9 years ago
hangtingchen / MFCC
View on GitHub
C code to extract mfcc or fbank features from wav files
☆17Oct 25, 2019Updated 6 years ago
dhrebeniuk / RosaKit
View on GitHub
LibRosa port to Swift for ability using same prepossessing logic in iOS/MacOS platforms
☆94Nov 14, 2022Updated 3 years ago
xiaominfc / melspectrogram_cpp
View on GitHub
C/C++实现Python音频处理库librosa中melspectrogram的计算过程
☆31Jan 14, 2022Updated 4 years ago
tinoucas / spleeter-tflite-convert
View on GitHub
☆47Sep 27, 2020Updated 5 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
DarianHarrison / simple_mlp
View on GitHub
a simple MLP Network written in Rust with C++ torch bindings
☆10Sep 29, 2021Updated 4 years ago
ewan-xu / LibrosaCpp
View on GitHub
LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc
☆239Dec 28, 2020Updated 5 years ago
andyweiqiu / SpeechRecognition
View on GitHub
这是一个基于kaldi的iOS语音识别demo
☆28Mar 4, 2019Updated 7 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
bolajixi / Mulitimodal-Speech-Emotion-Recognition
View on GitHub
A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data
☆12May 16, 2022Updated 4 years ago
PhilippeRo / gst-vosk
View on GitHub
Gstreamer plugin for VOSK voice recognition engine
☆14Oct 2, 2022Updated 3 years ago
finecodekr / addresskr
View on GitHub
☆11May 27, 2026Updated last month
zjyyyy / HGFM
View on GitHub
HGFM : A Hierarchical Grained and Feature Model for Acoustic Emotion Recgnition
☆11Oct 30, 2020Updated 5 years ago
andrewcsmith / tf_infinite_ramble
View on GitHub
the infinite ramble in rust, powered by tensorflow. (mfcc cosine similarity matching)
☆13Apr 30, 2018Updated 8 years ago
deepspike / snn-for-asr
View on GitHub
Pytorch-Kaldi implementation of SNN-based ASR systems
☆18Feb 1, 2020Updated 6 years ago
google-research / leaf-audio
View on GitHub
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…
☆528Mar 1, 2022Updated 4 years ago
Gabeiscool420 / SoundSage---LLM-Audio-Processing
View on GitHub
Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…
☆50Oct 4, 2023Updated 2 years ago
ttslr / StrengthNet
View on GitHub
[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
☆83Nov 4, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
deeplyinc / Nonverbal-Vocalization-Dataset
View on GitHub
☆44Jan 13, 2022Updated 4 years ago
julianyulu / icassp2021-mscnn-spu
View on GitHub
Code for our paper "Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention" (ICASSP 2021, co-first authorship)
☆28Jun 8, 2021Updated 5 years ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
Moplast / TasNet-tensorflow
View on GitHub
A tensorflow implementation of TasNet (ICASSP 2018)
☆16Nov 27, 2018Updated 7 years ago
beehive-lab / kfusion-tornadovm
View on GitHub
🎥 A Java implementation of Kinect Fusion running on Tornado VM.
☆28Jul 16, 2026Updated last week
AnkushMalaker / pretrained-dcnn-attention-ser
View on GitHub
Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"
☆10Dec 19, 2021Updated 4 years ago
JunhoKim94 / ASR_project
View on GitHub
This repository created for the NHN ASR hackathon competition.
☆11Sep 20, 2023Updated 2 years ago
Raptor007 / AutoDJ
View on GitHub
Analyze music to detect beats, and play shuffled songs with beat-matched crossfade. Uses SDL for UI, WaveOut or SDL_audio for playback, …
☆14Apr 6, 2025Updated last year
jonnor / ESC-CNN-microcontroller
View on GitHub
Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks
☆107Aug 2, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
SarthakYadav / leaf-pytorch
View on GitHub
PyTorch implementation of the LEAF audio frontend
☆79Mar 29, 2023Updated 3 years ago
warnikchow / coaudiotext
View on GitHub
A short tutorial on Keras for the co-utilization of audio and text data (multi-modal analysis)
☆16Nov 21, 2022Updated 3 years ago
Seeed-Studio / OSHW-RPi-Series
View on GitHub
Presented by Seeed Studio, we offer our series open source hardware based on Raspberry Pi
☆22Jan 8, 2025Updated last year
athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
yujiacheng333 / Speech-Experiment
View on GitHub
整合了说话人识别和语音分离的数据集预处理，模型加载交互（基于TIMIT数据集）
☆17Apr 22, 2021Updated 5 years ago
ppfliu / emotion-recognition
View on GitHub
Group Gated Fusion on Attention-based Bidirectional Alignment for Multimodal Emotion Recognition
☆15May 10, 2022Updated 4 years ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago