Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
Alternatives and similar repositories for Semi-Supervsied-Spoken-Language-Understanding-PyTorch
Users that are interested in Semi-Supervsied-Spoken-Language-Understanding-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆21Apr 1, 2022Updated 3 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆231Mar 23, 2021Updated 5 years ago
- ☆48Jan 8, 2021Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Feb 1, 2020Updated 6 years ago
- A Fairseq implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆11Dec 21, 2020Updated 5 years ago
- Various algorithms for voice activity detection☆22Jan 31, 2017Updated 9 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- This project attempts to build neural network training and lightweighting cookbook including three kinds of lightweighting solutions, i.e…☆22Jul 20, 2022Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆26Feb 25, 2024Updated 2 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- The dataset with English, German and Spanish speech samples.☆21Jun 1, 2021Updated 4 years ago
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Jan 21, 2022Updated 4 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆61Jul 1, 2025Updated 8 months ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Jan 21, 2021Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- Making Espnet easier to use☆54Apr 9, 2021Updated 4 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- This repository contains the code and instructions needed to reproduce the dataset splits for out paper "Speech Translation for Code-Swit…☆28Apr 8, 2022Updated 3 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- Review of papers I read☆14Dec 11, 2020Updated 5 years ago
- Minimal module for computing audio spectrograms☆15Feb 28, 2019Updated 7 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- Korean ASR Corpus generated from TEDx talks☆27Jan 11, 2019Updated 7 years ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆24Jun 12, 2023Updated 2 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆197Feb 10, 2022Updated 4 years ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- baseline for MGTV competition 2022 PIR☆11Apr 11, 2022Updated 3 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- ☆10Dec 21, 2022Updated 3 years ago
- Clojure library for parsing and seamless working with native C structs/structured byte buffers☆14May 26, 2015Updated 10 years ago