soniox / soniox-compareLinks
Soniox Compare. Compare real-time voice AI side by side. No glossy charts, just results.
☆18Updated 6 months ago
Alternatives and similar repositories for soniox-compare
Users that are interested in soniox-compare are comparing it to the libraries listed below
Sorting:
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆16Updated 3 years ago
- ☆11Updated 5 months ago
- Simple Kaldi recipe for forced alignment☆11Updated 2 years ago
- ☆20Updated 3 years ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆24Updated last year
- ☆17Updated 4 years ago
- Whisper finetuning☆15Updated 9 months ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆24Updated 3 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 7 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 10 months ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago
- ☆24Updated 5 months ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- radiomixer☆14Updated 3 years ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆18Updated last year
- Transfer learning approach to pronunciation scoring☆11Updated 2 years ago
- ☆17Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 3 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated last year
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 4 years ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆20Updated 8 months ago
- ☆31Updated last year
- A Weakly Supervised Forced Alignment for disluent speech☆15Updated 2 years ago
- Text-to-Speech Latency Benchmark☆22Updated 3 weeks ago
- Agent toolkit for 100 hours of speech and 10 GiB of text☆14Updated 6 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Updated 3 months ago
- Evaluate results from ASR/Speech-to-Text quickly☆41Updated 4 years ago