A script for audio/transcript alignment. Fork of p2fa.
☆69Mar 15, 2018Updated 8 years ago
Alternatives and similar repositories for p2fa-vislab
Users that are interested in p2fa-vislab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 9, 2020Updated 5 years ago
- octave multi-channel signal processing☆10May 11, 2014Updated 11 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆107Feb 27, 2024Updated 2 years ago
- Modified Python3 P2FA for Mandarin☆10Sep 21, 2020Updated 5 years ago
- A repository for maintaing the fave-align and fave-extract toolkits☆118Mar 29, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GUI text-based speech and music editor for creating radio/audio stories☆80Dec 8, 2022Updated 3 years ago
- Fast Audio Dereverberation MATLAB System - Senior project at Cooper Union☆22Apr 30, 2014Updated 11 years ago
- singing voice analysis and detection tools☆21Jun 10, 2015Updated 10 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆32May 30, 2018Updated 7 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- a python library for manipulating audio files☆45Oct 6, 2015Updated 10 years ago
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆30Sep 21, 2022Updated 3 years ago
- OpenFrameworks + ChucK☆16Mar 18, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Two!Ears Auditory Model - Auditory front-end module☆16Jan 24, 2018Updated 8 years ago
- Speech annotation web app for regular folk☆22Aug 5, 2016Updated 9 years ago
- Python interface for forced audio alignment using HTK and SoX☆350Jun 28, 2020Updated 5 years ago
- Fusion Modality Approaches for sentiment analysis and emotion recognition task.☆12Feb 5, 2021Updated 5 years ago
- Experiment in automatic insertion of timed transcript corrections☆21Oct 31, 2017Updated 8 years ago
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆30Nov 19, 2021Updated 4 years ago
- Read, write, and manipulate Praat TextGrid files with Python☆131Feb 25, 2026Updated last month
- implementation for the paper "Select-Additive Learning: Improving Cross-individual Generalization in Multimodal Sentiment Analysis"☆23Nov 8, 2017Updated 8 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆99Apr 5, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- A collection of links and notes on forced alignment tools☆938Nov 10, 2021Updated 4 years ago
- MAGE is a C/C++ software toolkit for reactive implementation of HMM-based speech and singing synthesis.☆62Jul 18, 2014Updated 11 years ago
- ☆68Aug 15, 2019Updated 6 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- Augmentation scripts for the bAbI Dialog Tasks dataset☆13Oct 16, 2018Updated 7 years ago
- To record some code and note about speech enhancement algorithm☆31Feb 7, 2017Updated 9 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆51Nov 14, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- A JavaScript interface for annotating and labeling audio files.☆465Mar 7, 2020Updated 6 years ago
- Noise reduction using Log_MMSE method implement by C language☆75Aug 25, 2015Updated 10 years ago
- Blind Video Temporal Consistency☆38Jun 28, 2016Updated 9 years ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆13May 6, 2017Updated 8 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago