arenjansen/ZRTools

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/arenjansen/ZRTools)

arenjansen / ZRTools

Zero-Resource Speech Discovery, Search, and Evaluation Tools

☆29

Alternatives and similar repositories for ZRTools

Users that are interested in ZRTools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fgnt / LatticeWordSegmentation
View on GitHub
Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model
☆17Nov 24, 2016Updated 9 years ago
ga642381 / Taiwanese-Speech-Synthesis
View on GitHub
Taiwanese Speech Synthesis with Tacotron2
☆26Oct 2, 2022Updated 3 years ago
kamperh / speech_correspondence
View on GitHub
Correspondence and autoencoder neural network training for speech using Pylearn2.
☆14Dec 9, 2015Updated 10 years ago
zerospeech / zerospeech2017
View on GitHub
All you need to get started for the Zero Speech Challenge 2017
☆47Apr 23, 2019Updated 7 years ago
newslynx / zuckup
View on GitHub
get facebook data
☆10Sep 14, 2014Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kuan2jiu99 / audio-hallucination
View on GitHub
Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024
☆34Mar 14, 2025Updated last year
idiap / phonvoc
View on GitHub
Phonetic and phonological vocoding platform
☆17Nov 23, 2016Updated 9 years ago
Open-Speech-EkStep / crowdsource-dataplatform
View on GitHub
This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…
☆17Mar 6, 2023Updated 3 years ago
galv / galvASR
View on GitHub
ASR library
☆14Dec 3, 2018Updated 7 years ago
nigelgward / midlevel
View on GitHub
Prosodic features for machine-learning applications, in Matlab.
☆15Oct 14, 2025Updated 9 months ago
lucasondel / amdtk
View on GitHub
☆12Feb 26, 2018Updated 8 years ago
JSALT-2022-SSL / superb-prosody
View on GitHub
☆31Jul 13, 2023Updated 3 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
yuhaozhang / nnjm-global
View on GitHub
A python implementation of the neural network joint language model and an extension of it using global source context.
☆11May 17, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhaoyanpeng / xcfg
View on GitHub
X (weighted / probabilistic) Context-Free Grammars
☆25Jan 30, 2024Updated 2 years ago
NextCenturyCorporation / neon
View on GitHub
☆55Jan 10, 2020Updated 6 years ago
bootphon / learnable-strf
View on GitHub
Learnable STRF, from Riad et al. 2021 JASA
☆13Aug 21, 2021Updated 4 years ago
XapaJIaMnu / gLM
View on GitHub
A GPU language model, based on btree backed tries.
☆30Mar 6, 2018Updated 8 years ago
bajibabu / GlottGAN
View on GitHub
This repository contains the files used for our Interspeech 2017 paper.
☆16May 30, 2017Updated 9 years ago
neubig / latticelm
View on GitHub
Software for unsupervised word segmentation and language model learning using lattices
☆45Aug 17, 2016Updated 9 years ago
joaoantoniocn / AM-MobileNet1D
View on GitHub
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆31Oct 3, 2023Updated 2 years ago
jpinedaa / Voice-ML
View on GitHub
MobileNet trained with VoxCeleb dataset and used for voice verification
☆18Oct 26, 2022Updated 3 years ago
i-lijun / UnsupConstParseEval
View on GitHub
An Empirical Comparison of Unsupervised Constituency Parsing Methods
☆14Aug 15, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dobby-seo / kosr
View on GitHub
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
☆31Feb 19, 2021Updated 5 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
ejhumphrey / dl4mir-dissertation
View on GitHub
Humphrey, E. J. "An Exploration of Deep Learning in Music Informatics." (2015) New York University.
☆14Feb 23, 2016Updated 10 years ago
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
abhinavk96 / Transcriptor
View on GitHub
A transcription text editor with respeak module
☆14Jan 24, 2026Updated 5 months ago
andybi7676 / reborn-uasr
View on GitHub
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
☆15Dec 11, 2024Updated last year
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
fgnt / nhpylm
View on GitHub
Python bindings for a c++ based implementation of the Nested Hierarchical Pitman-Yor Language model
☆13Nov 24, 2016Updated 9 years ago
kamperh / speech_dtw
View on GitHub
Dynamic time warping (DTW) functions for specifically speech alignment.
☆30May 6, 2024Updated 2 years ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
zszheng147 / VoiceCraft-X
View on GitHub
☆40Nov 18, 2025Updated 8 months ago
zxie / nn
View on GitHub
☆19May 16, 2015Updated 11 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago