Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)
☆11Aug 12, 2020Updated 5 years ago
Alternatives and similar repositories for readaudio
Users that are interested in readaudio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- A Playground for Variational Autoencoders☆12Feb 11, 2018Updated 8 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Repository for subjective and objective evaluation of source separation algorithms☆12Apr 18, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- creating audio preprocessing features in TensorFlow keras layers,☆14Jul 13, 2021Updated 4 years ago
- Codebase and utilities for using models trained by multiple music related tasks☆12Jul 6, 2023Updated 2 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- Web-based tool for straight-forward class annotation of audio files☆11Aug 19, 2020Updated 5 years ago
- ☆21Updated this week
- Collection of python scripts to demonstrate asynchronous programming in python☆11May 22, 2022Updated 3 years ago
- Gamma Agreement in Python☆45Mar 4, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- BlockCIrculantRNN (LSTM and GRU) using TensorFlow☆14Oct 30, 2018Updated 7 years ago
- Фонограми та синтагми: інструменти обробки☆21Jun 21, 2025Updated 9 months ago
- GPU-accelerated AES encryption project☆11Feb 13, 2015Updated 11 years ago
- ☆13Oct 20, 2021Updated 4 years ago
- 模型加速/模型压缩(已完成所有Lab)☆11Dec 24, 2023Updated 2 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- Consistent dictionary learning algorithm for signal declipping (Python code)☆20Oct 24, 2018Updated 7 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Google closure-library wrapper for node.js☆23Mar 17, 2013Updated 13 years ago
- Nano vLLM☆13Jun 26, 2025Updated 9 months ago
- Finetuning MiniCPM-V-2_6 for Object Detection Task☆14Aug 27, 2024Updated last year
- ☆22Aug 29, 2019Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Collection of models and extensions for deployment in PyTorch☆24Nov 20, 2022Updated 3 years ago
- CMake helper for cross-platform binary Python packages 🐍.☆12Jan 26, 2026Updated 2 months ago
- ☆14Mar 25, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Apr 24, 2024Updated last year
- API Server for storing and graphing real-time time-series data in MongoDB☆18Nov 3, 2014Updated 11 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Scalable radix top-k selection on GPUs.☆21Jan 27, 2025Updated last year
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Control of a pendulum in real-time. The script simulates a real-time test of a control algorithm, allowing the user to change online the …☆13Feb 19, 2023Updated 3 years ago
- Next generation graph processing platform☆12Aug 26, 2016Updated 9 years ago