A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
☆88Jul 24, 2025Updated 7 months ago
Alternatives and similar repositories for SpeakingFaces
Users that are interested in SpeakingFaces are comparing it to the libraries listed below
Sorting:
- ☆15Jul 11, 2022Updated 3 years ago
- ☆15May 8, 2021Updated 4 years ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- Feature extractor for DL speech processing.☆66Apr 13, 2022Updated 3 years ago
- A tutorial diphone synthesizer in Python☆25Nov 26, 2018Updated 7 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- ☆11May 7, 2022Updated 3 years ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆16Feb 1, 2026Updated last month
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Software for Decoding of High Order Ambisonics to Irregular Layouts☆12Mar 20, 2014Updated 11 years ago
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆13Nov 14, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆13May 6, 2017Updated 8 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated last year
- This is the experimental description of MnTTS2.☆11Apr 11, 2024Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- Neural Dereverberation☆36May 22, 2019Updated 6 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- ☆30Jan 22, 2026Updated last month
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- microphone array speech generator (MASG) in room acoustic☆39Jan 2, 2020Updated 6 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- ☆17Oct 16, 2018Updated 7 years ago