A script for audio/transcript alignment. Fork of p2fa.
☆69Mar 15, 2018Updated 8 years ago
Alternatives and similar repositories for p2fa-vislab
Users that are interested in p2fa-vislab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 9, 2020Updated 5 years ago
- octave multi-channel signal processing☆10May 11, 2014Updated 12 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆107Feb 27, 2024Updated 2 years ago
- Modified Python3 P2FA for Mandarin☆10Sep 21, 2020Updated 5 years ago
- A repository for maintaing the fave-align and fave-extract toolkits☆118Mar 29, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆32May 30, 2018Updated 7 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆30Sep 21, 2022Updated 3 years ago
- Two!Ears Auditory Model - Auditory front-end module☆16Jan 24, 2018Updated 8 years ago
- Speech annotation web app for regular folk☆23Aug 5, 2016Updated 9 years ago
- Python interface for forced audio alignment using HTK and SoX☆350Jun 28, 2020Updated 5 years ago
- Fusion Modality Approaches for sentiment analysis and emotion recognition task.☆12Feb 5, 2021Updated 5 years ago
- Experiment in automatic insertion of timed transcript corrections☆21Oct 31, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆31Nov 19, 2021Updated 4 years ago
- Read, write, and manipulate Praat TextGrid files with Python☆132Feb 25, 2026Updated 2 months ago
- implementation for the paper "Select-Additive Learning: Improving Cross-individual Generalization in Multimodal Sentiment Analysis"☆23Nov 8, 2017Updated 8 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆99Apr 5, 2022Updated 4 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- A collection of links and notes on forced alignment tools☆938Apr 18, 2026Updated last month
- MAGE is a C/C++ software toolkit for reactive implementation of HMM-based speech and singing synthesis.☆62Jul 18, 2014Updated 11 years ago
- ☆68Aug 15, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Time-Domain Pitch and Time Scale Modification of Speech Signal☆34Jul 29, 2008Updated 17 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- Augmentation scripts for the bAbI Dialog Tasks dataset☆13Oct 16, 2018Updated 7 years ago
- To record some code and note about speech enhancement algorithm☆31Feb 7, 2017Updated 9 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 11 years ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- A JavaScript interface for annotating and labeling audio files.☆465Mar 7, 2020Updated 6 years ago
- Blind Video Temporal Consistency☆38Jun 28, 2016Updated 9 years ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆13May 6, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- A github repo of the openSMILE feature extraction tool.☆221Nov 10, 2021Updated 4 years ago
- The Main Aim of this project is to segment and cluster an audio sample based on speaker when number of speakers are not known before hand…☆25Jan 25, 2022Updated 4 years ago
- Pre-trained models for Honk☆11Apr 1, 2019Updated 7 years ago
- An R package for hierarchical clustering with p-values☆50May 18, 2026Updated last week
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Nov 26, 2018Updated 7 years ago
- 变声处理。从别人那里拿来的,怕忘记☆14Jul 20, 2016Updated 9 years ago