Code associated with the paper: Neural Representations for Modeling Variation in Speech.
☆18Mar 10, 2022Updated 4 years ago
Alternatives and similar repositories for neural-acoustic-distance
Users that are interested in neural-acoustic-distance are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- follow NVIDIA, simplify it and support data parallel.☆13Sep 26, 2019Updated 6 years ago
- Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"☆10Jan 25, 2021Updated 5 years ago
- Web scraper for BibleGateway that will retrieve the entire Bible in a translation of the user's choice to be stored in plain text.☆10Jul 21, 2019Updated 6 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A module for normalising text.☆10Nov 6, 2019Updated 6 years ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆35Aug 27, 2023Updated 2 years ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated 2 years ago
- DALI datasets split used to train models presented in the paper Multilingual lyrics-to-audio alignment (ISMIR 2020).☆13May 25, 2021Updated 5 years ago
- Python interface for COLMAP reconstructions☆21Jul 28, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tools and scripts for working with ELAN☆10Aug 4, 2022Updated 3 years ago
- Code for our CICAI 2022 paper "3D Face Cartoonizer: Generating Personalized 3D Cartoon Faces from 2D Real Photos with a Hybrid Dataset".☆10Aug 9, 2022Updated 3 years ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Aug 22, 2022Updated 3 years ago
- Leveraging BERT to Improve Spoken Language Identification☆18Nov 22, 2022Updated 3 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆44Sep 7, 2025Updated 9 months ago
- ☆16Jun 13, 2024Updated 2 years ago
- ☆15Nov 26, 2024Updated last year
- Hearing loss simulation VST plugin☆14Mar 14, 2025Updated last year
- [ACM MM 2023] KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration☆12Nov 21, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Formant Tracking & Estimation☆83Dec 15, 2024Updated last year
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- ☆16Dec 6, 2023Updated 2 years ago
- ☆28Mar 10, 2017Updated 9 years ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription☆51May 7, 2024Updated 2 years ago
- NAISR: A 3D Neural Additive Model for Interpretable Shape Representation☆18Apr 29, 2024Updated 2 years ago
- Command line tool for forced-alignment of Spanish speech data☆13Dec 31, 2025Updated 5 months ago
- Charsiu: A neural phonetic aligner.☆345Sep 19, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆11Nov 9, 2023Updated 2 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 9 years ago
- SWIPE Algorithm implementation in Python☆12Dec 24, 2016Updated 9 years ago
- Implementation of paper "End-to-end lyrics alignment for polyphonic music using an audio-to-character recognition model"☆18Nov 20, 2022Updated 3 years ago
- Semi-Supervised Contrastive Learning for music classification - towards HIL-representation learning.☆17Jul 24, 2024Updated last year
- Praat-based tools for EGG analysis☆20Sep 21, 2023Updated 2 years ago