soniox/soniox-compare

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/soniox/soniox-compare)

soniox / soniox-compare

Soniox Compare. Compare real-time voice AI side by side. No glossy charts, just results.

☆34

Alternatives and similar repositories for soniox-compare

Users that are interested in soniox-compare are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

goodmike31 / pl-asr-speech-data-survey
View on GitHub
Survey of available speech datasets for Polish ASR development
☆17Jan 1, 2025Updated last year
danijel3 / ClarinStudioKaldi
View on GitHub
A baseline Automatic Speech Recognition system for Polish based on Kaldi.
☆18Dec 21, 2021Updated 4 years ago
frankyoujian / Edge-Punct-Casing
View on GitHub
☆33Feb 4, 2025Updated last year
robmsmt / SpeechLoop
View on GitHub
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
☆19Oct 5, 2022Updated 3 years ago
poteboy / emox
View on GitHub
immediate visibility of the applied styles without back-and-forth between files
☆13Jun 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gladiaio / normalization
View on GitHub
A lightweight library for normalizing speech transcripts before computing WER
☆27Jul 14, 2026Updated last week
talhanai / wer-sigtest
View on GitHub
Script to perform statistical significance test between ASR hypotheses.
☆23Aug 13, 2017Updated 8 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago
YoshikiMas / madeon-asr
View on GitHub
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
☆19Dec 1, 2024Updated last year
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
voxeet / voxeet-uxkit-ios
View on GitHub
☆11Jun 5, 2023Updated 3 years ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
5Hyeons / StyleTTS2-Vocos
View on GitHub
StyleTTS2 + Vocos as a Decoder
☆13Mar 24, 2025Updated last year
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
karamouche / noisekit
View on GitHub
Generate degraded speech datasets for noise-robust ASR benchmarking
☆45Jun 9, 2026Updated last month
DDATT / Vits2-onnx-cpp
View on GitHub
Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++
☆19Apr 17, 2024Updated 2 years ago
ictnlp / DST
View on GitHub
DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently
☆11Jun 6, 2024Updated 2 years ago
Brand24-AI / mms_benchmark
View on GitHub
The most extensive open massively multilingual corpus of datasets for training sentiment models. The corpus consists of 79 manually selec…
☆16Nov 14, 2023Updated 2 years ago
AndrejGajdos / leaflet-markercluster-vs-supercluster
View on GitHub
Performance comparison of Leaflet markercluster and supercluster
☆12Apr 17, 2021Updated 5 years ago
MurageKabui / AutoIT-OCRSpace-UDF
View on GitHub
A AutoIT 3 wrapper library around the OCRSpace API.
☆14Apr 26, 2024Updated 2 years ago
huggingface / open_asr_leaderboard
View on GitHub
☆229Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TigreGotico / chatterbox-onnx
View on GitHub
chatterbox TTS + Voice Clone using onnx
☆28Updated this week
omelchert / optfrog
View on GitHub
Analytic signal spectrograms with optimized time-frequency resolution
☆10Oct 6, 2020Updated 5 years ago
socialfoundations / benchbench
View on GitHub
BenchBench is a Python package to evaluate multi-task benchmarks.
☆23Oct 12, 2025Updated 9 months ago
alphacep / openfst
View on GitHub
Openfst mirror with some fixes
☆16Aug 23, 2024Updated last year
ORI-Muchim / Efficient-Speech
View on GitHub
Lightweight Korean TTS Model based on FastSpeech2
☆15Mar 4, 2026Updated 4 months ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
tarnowski-git / Audio_Spectrum_Analyzer
View on GitHub
Desktop GUI applications to show audio waveform and spectrogram which is visual representation of sound using the amplitude of the freque…
☆12Jul 21, 2023Updated 3 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
akq / Leaflet.DonutCluster
View on GitHub
Display donut statistic information instead of only a circle with marker cluster and leaflet.
☆14Apr 8, 2019Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
deepgram-devs / voice-agent-medical-assistant-demo
View on GitHub
A Medical / Clinical Note Taking Demo Application using Deepgram Voice Agent API
☆17Jul 9, 2025Updated last year
techpro-studio / MetalAudioShaders
View on GitHub
MPS like shaders for audio processing. Conv1d, Spectrogram.
☆19Apr 3, 2021Updated 5 years ago
KEY60228 / reviewthem.nvim
View on GitHub
A Neovim plugin for streamlining code reviews directly in your editor. Inspired by ReviewIt.
☆19Updated this week
pipecat-ai / stt-benchmark
View on GitHub
Benchmarking STT service TTFB and semantic WER for real-time AI applications
☆90Updated this week
blazerunner44 / survey
View on GitHub
Easy to use survey system in PHP
☆10Mar 7, 2021Updated 5 years ago
jonnor / brewing-audio-event-detection
View on GitHub
Tracking beer/wine using Audio Event Detection with Machine Learning
☆15Jun 16, 2024Updated 2 years ago
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago