A Hackable speech recognition library.
☆25Oct 16, 2024Updated last year
Alternatives and similar repositories for thunder-speech
Users that are interested in thunder-speech are comparing it to the libraries listed below
Sorting:
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- ☆10Jun 23, 2023Updated 2 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Nvidia GPU Fan Controller for linux☆15May 27, 2024Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- Wav2vec resources and models for Brazilian Portuguese☆36Jul 15, 2022Updated 3 years ago
- Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.☆22Nov 6, 2021Updated 4 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 8 months ago
- Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text☆24Aug 15, 2022Updated 3 years ago
- Durham University Oberon-2 Compiler☆22Jan 21, 2015Updated 11 years ago
- Workflow for forced alignment between languages☆23Jan 13, 2026Updated last month
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- ☆57Apr 18, 2023Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆58Jun 10, 2024Updated last year
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆25Jun 14, 2022Updated 3 years ago
- Making Espnet easier to use☆54Apr 9, 2021Updated 4 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- ☆32Jan 6, 2022Updated 4 years ago
- Linguistic processing for Common Voice☆58Jan 18, 2024Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- A lightweight implementation of shapes drawn across a geo-temporal plane.☆12Jan 27, 2026Updated last month
- edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries.…☆10Nov 14, 2021Updated 4 years ago
- ☆71Jul 13, 2023Updated 2 years ago
- ☆70Sep 13, 2024Updated last year
- Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513☆64Feb 13, 2023Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Apr 21, 2021Updated 4 years ago
- ☆41May 15, 2023Updated 2 years ago
- ☆37Jun 30, 2022Updated 3 years ago