ucbvislab/p2fa-vislab

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ucbvislab/p2fa-vislab)

ucbvislab / p2fa-vislab

A script for audio/transcript alignment. Fork of p2fa.

☆69

Alternatives and similar repositories for p2fa-vislab

Users that are interested in p2fa-vislab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ucbvislab / vdigests
View on GitHub
☆15Nov 9, 2020Updated 5 years ago
ucbvislab / radiotool
View on GitHub
a python library for manipulating audio files
☆45Oct 6, 2015Updated 10 years ago
ThomasFeher / oms
View on GitHub
octave multi-channel signal processing
☆10May 11, 2014Updated 12 years ago
jaekookang / p2fa_py3
View on GitHub
Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3
☆107Feb 27, 2024Updated 2 years ago
chenchenzi / P2FA_Mandarin_py3
View on GitHub
Modified Python3 P2FA for Mandarin
☆10Sep 21, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JoFrhwld / FAVE
View on GitHub
A repository for maintaing the fave-align and fave-extract toolkits
☆118Mar 29, 2024Updated 2 years ago
ucbvislab / speecheditor
View on GitHub
GUI text-based speech and music editor for creating radio/audio stories
☆80Dec 8, 2022Updated 3 years ago
luster / tawfDereverb
View on GitHub
Fast Audio Dereverberation MATLAB System - Senior project at Cooper Union
☆21Apr 30, 2014Updated 12 years ago
bregmanstudio / voxid
View on GitHub
singing voice analysis and detection tools
☆21Jun 10, 2015Updated 11 years ago
shamidreza / dnnmapper
View on GitHub
Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…
☆32May 30, 2018Updated 8 years ago
mozilla / DSAlign
View on GitHub
DeepSpeech based forced alignment tool
☆239Dec 12, 2020Updated 5 years ago
TWOEARS / auditory-front-end
View on GitHub
Two!Ears Auditory Model - Auditory front-end module
☆16Jan 24, 2018Updated 8 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
gewang / ofxChucK
View on GitHub
OpenFrameworks + ChucK
☆17Mar 18, 2016Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
praaline / Praaline
View on GitHub
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
☆30Sep 21, 2022Updated 3 years ago
aikuma / aikuma-ng
View on GitHub
Speech annotation web app for regular folk
☆23Aug 5, 2016Updated 9 years ago
cbdb-project / sentence-segmentation-for-chinese-historical-texts
View on GitHub
This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…
☆32Nov 19, 2021Updated 4 years ago
gkoumasd / MSAF
View on GitHub
Fusion Modality Approaches for sentiment analysis and emotion recognition task.
☆12Feb 5, 2021Updated 5 years ago
alexnorton / overtyper
View on GitHub
Experiment in automatic insertion of timed transcript corrections
☆21Oct 31, 2017Updated 8 years ago
hbuschme / TextGridTools
View on GitHub
Read, write, and manipulate Praat TextGrid files with Python
☆131Feb 25, 2026Updated 4 months ago
HaohanWang / SelectAdditiveLearning
View on GitHub
implementation for the paper "Select-Additive Learning: Improving Cross-individual Generalization in Multimodal Sentiment Analysis"
☆23Nov 8, 2017Updated 8 years ago
dansoutner / kaldi2htk
View on GitHub
Script for converting kaldi GMM/HMM models to HTK format
☆11Jul 18, 2024Updated 2 years ago
nassosoassos / sail_align
View on GitHub
SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…
☆99Apr 5, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
numediart / mage
View on GitHub
MAGE is a C/C++ software toolkit for reactive implementation of HMM-based speech and singing synthesis.
☆62Jul 18, 2014Updated 12 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
pettarin / forced-alignment-tools
View on GitHub
A collection of links and notes on forced alignment tools
☆942Updated this week
victorywys / RAVEN
View on GitHub
☆68Aug 15, 2019Updated 6 years ago
idiap / pkwrap
View on GitHub
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆74Jun 8, 2022Updated 4 years ago
lyapple2008 / SpeechEnhancement
View on GitHub
To record some code and note about speech enhancement algorithm
☆31Feb 7, 2017Updated 9 years ago
arne-cl / discoursegraphs
View on GitHub
linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).
☆51Nov 14, 2025Updated 8 months ago
dogancan / expected-edit-distance
View on GitHub
Expected edit distance implementation using OpenFst tools
☆11May 13, 2015Updated 11 years ago
shtoshni / speech_parsing
View on GitHub
Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"
☆13May 6, 2017Updated 9 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
CrowdCurio / audio-annotator
View on GitHub
A JavaScript interface for annotating and labeling audio files.
☆467Mar 7, 2020Updated 6 years ago
willhope / Noise-reduction
View on GitHub
Noise reduction using Log_MMSE method implement by C language
☆75Aug 25, 2015Updated 10 years ago
nbonneel / blindconsistency
View on GitHub
Blind Video Temporal Consistency
☆38Jun 28, 2016Updated 10 years ago
MontrealCorpusTools / speechcorpustools
View on GitHub
Easier analysis of large speech corpora
☆24Jun 22, 2021Updated 5 years ago
shimo-lab / pvclust
View on GitHub
An R package for hierarchical clustering with p-values
☆52May 18, 2026Updated 2 months ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
naxingyu / opensmile
View on GitHub
A github repo of the openSMILE feature extraction tool.
☆221Nov 10, 2021Updated 4 years ago