☆30Aug 8, 2024Updated last year
Alternatives and similar repositories for conditional-flow-matching
Users that are interested in conditional-flow-matching are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Template for creating audio encoders compatible with X-ARES☆19Feb 11, 2026Updated 2 months ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 31, 2026Updated 2 weeks ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- Production first, nn-based on-device signal processing toolkit.☆65May 30, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- faster inference☆28Jan 20, 2025Updated last year
- ☆28Oct 7, 2025Updated 6 months ago
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- Decoders from Kaldi using OpenFst☆34Updated this week
- c# library for decoding K2 transducer Models,used in speech recognition (ASR)☆13Aug 20, 2025Updated 7 months ago
- [NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix☆202Feb 25, 2026Updated last month
- ☆16Apr 8, 2025Updated last year
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 7 months ago
- List of Large Lanugage Model Papers☆59Jun 5, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Mar 4, 2026Updated last month
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Reference-aware automatic speech evaluation toolkit☆181Dec 5, 2024Updated last year
- ☆11Oct 20, 2022Updated 3 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…☆12Mar 25, 2025Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- Java Implementation of the Sonopy Audio Feature Extraction Library by MycroftAI☆16Feb 10, 2020Updated 6 years ago
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆71Mar 22, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- asr2k☆52Jun 2, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54May 25, 2022Updated 3 years ago
- Official code for SongEcho☆55Mar 3, 2026Updated last month
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- PyTorch reimplementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆16Jul 23, 2021Updated 4 years ago
- ☆13Mar 30, 2023Updated 3 years ago
- My solution to course E6870 (Speech Recognition) of Columbia University.☆37May 13, 2018Updated 7 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Audio Codec Speech processing Universal PERformance Benchmark☆301Apr 1, 2026Updated 2 weeks ago
- Free ACELP vocoder☆17Sep 20, 2024Updated last year
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- ☆11Oct 14, 2023Updated 2 years ago
- Differentiable implementation of MSBG hearing loss model and MBSTOI intelligibility metric for Clarity Enhancement challenge.☆21Nov 19, 2021Updated 4 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆146Oct 9, 2025Updated 6 months ago
- Tools for the evaluation of audio captioning.☆19May 23, 2020Updated 5 years ago