Bartelds/neural-acoustic-distance

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Bartelds/neural-acoustic-distance)

Bartelds / neural-acoustic-distance

Code associated with the paper: Neural Representations for Modeling Variation in Speech.

☆18

Alternatives and similar repositories for neural-acoustic-distance

Users that are interested in neural-acoustic-distance are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Bartelds / acoustic-distance-measure
View on GitHub
Acoustic distance measure for comparing pronunciations
☆17Aug 2, 2022Updated 3 years ago
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
xcmyz / Tacotron2-Pytorch
View on GitHub
follow NVIDIA, simplify it and support data parallel.
☆13Sep 26, 2019Updated 6 years ago
soskuthy / gamm_strategies
View on GitHub
Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"
☆10Jan 25, 2021Updated 5 years ago
danieltmc / BibleScraper
View on GitHub
Web scraper for BibleGateway that will retrieve the entire Bible in a translation of the user's choice to be stored in plain text.
☆10Jul 21, 2019Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Nathaniel-Haines / Reliability_2020
View on GitHub
☆12Mar 24, 2024Updated 2 years ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
kwatcharasupat / musdb25
View on GitHub
MUSDB25 - A Fully Multitrack Dataset for Music Source Separation
☆13Mar 29, 2025Updated last year
resemble-ai / normalise
View on GitHub
A module for normalising text.
☆10Nov 6, 2019Updated 6 years ago
MTG / SingWithExpressions
View on GitHub
This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics
☆15Oct 28, 2024Updated last year
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
emirdemirel / DALI-TestSet4ALT
View on GitHub
This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.
☆12Nov 30, 2021Updated 4 years ago
CODEJIN / Speaker_Embedding_Torch
View on GitHub
PyTorch based speaker embedding model
☆16Apr 13, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rmbrualla / pycolmap
View on GitHub
Python interface for COLMAP reconstructions
☆21Jul 28, 2024Updated last year
CoEDL / elan-helpers
View on GitHub
Tools and scripts for working with ELAN
☆10Aug 4, 2022Updated 3 years ago
mingsjtu / 3DCartoonGenerator
View on GitHub
Code for our CICAI 2022 paper "3D Face Cartoonizer: Generating Personalized 3D Cartoon Faces from 2D Real Photos with a Hybrid Dataset".
☆10Aug 9, 2022Updated 3 years ago
sayakpaul / deploy-hf-tf-vision-models
View on GitHub
This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.
☆30Aug 22, 2022Updated 3 years ago
ms609 / Quartet
View on GitHub
R package to calculate the similarity of two trees based on the number of shared four-taxon subtrees (or splits)
☆17Jun 19, 2026Updated 2 weeks ago
quantling / pyndl
View on GitHub
pyndl implements a Naive discriminative learning which is a learning and classification models based on the Rescorla-Wagner equations in …
☆13Dec 8, 2025Updated 7 months ago
THUsatlab / BERT-LID
View on GitHub
Leveraging BERT to Improve Spoken Language Identification
☆17Nov 22, 2022Updated 3 years ago
realyinchen / pytorch-deep-learning
View on GitHub
☆16Jun 13, 2024Updated 2 years ago
xiaoxue1117 / speech-mamba-public
View on GitHub
☆15Nov 26, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
amourgela / hearinglosssimulationplugin
View on GitHub
Hearing loss simulation VST plugin
☆14Mar 14, 2025Updated last year
zhiqic / KeyPosS
View on GitHub
[ACM MM 2023] KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration
☆12Nov 21, 2023Updated 2 years ago
MLSpeech / DeepFormants
View on GitHub
Formant Tracking & Estimation
☆83Dec 15, 2024Updated last year
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
GRousselet / bootstrap
View on GitHub
☆16Dec 6, 2023Updated 2 years ago
soskuthy / gamm_intro
View on GitHub
☆28Mar 10, 2017Updated 9 years ago
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
jtimonen / lgpr
View on GitHub
R-package for interpretable nonparametric modeling of longitudinal data using additive Gaussian processes. Contains functionality for in…
☆28Oct 30, 2025Updated 8 months ago
chengl7 / LonGP
View on GitHub
Gaussian process regression + automatical model selection for logitudinal -omics data
☆20Mar 11, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
guxm2021 / ALT_SpeechBrain
View on GitHub
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
☆51May 7, 2024Updated 2 years ago
uncbiag / NAISR
View on GitHub
NAISR: A 3D Neural Additive Model for Interpretable Shape Representation
☆18Apr 29, 2024Updated 2 years ago
EricWilbanks / faseAlign
View on GitHub
Command line tool for forced-alignment of Spanish speech data
☆13Dec 31, 2025Updated 6 months ago
lingjzhu / charsiu
View on GitHub
Charsiu: A neural phonetic aligner.
☆346Sep 19, 2022Updated 3 years ago
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
songc42 / Feature-proliferation
View on GitHub
☆11Nov 9, 2023Updated 2 years ago
MLSpeech / DeepPhoneticToolsTutorial
View on GitHub
Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15
☆12Apr 17, 2017Updated 9 years ago