facebookresearch / speech_translationLinks

Demo and samples for universal speech translator

☆24

Alternatives and similar repositories for speech_translation

Users that are interested in speech_translation are comparing it to the libraries listed below

Sorting:

MiuLab / SpokenVec
Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding
☆24Updated 2 years ago
farisalasmary / deepspeech2-online-decoder
Online (real-time) decoder to be used with DeepSpeech2 model
☆25Updated 5 years ago
Deepest-Project / Transformer-TTS
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Updated 2 years ago
alicank / Translation-Augmented-LibriSpeech-Corpus
Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…
☆44Updated 3 years ago
sigmorphon / 2020
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…
☆36Updated 3 months ago
sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆46Updated 4 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Updated 4 years ago
lingjzhu / spoken_sent_embedding
Unsupervised spoken sentence embeddings
☆14Updated 2 years ago
TanUkkii007 / papers-i-read
☆23Updated 7 years ago
derejetabzaw / emotivespeech
Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox
☆13Updated 7 years ago
amazon-science / proteno
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45Updated 4 years ago
hfutami / distill-bert-for-seq2seq-asr
☆24Updated 5 years ago
tts-tutorial / ijcai2021
☆12Updated 2 years ago
idiap / inv-tn
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Updated 7 years ago
Edresson / SC-GlowTTS
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
☆107Updated 3 years ago
wq2012 / CurriculumVitae
Curriculum Vitae of Quan Wang
☆15Updated last month
xcmyz / Transformer-TTS
TTS model based on Transformer.
☆58Updated 6 years ago
Chia-Hsuan-Lee / Spoken-SQuAD
A spoken question answering dataset on SQUAD
☆49Updated 3 months ago
CiscoDevNet / g2p_seq2seq_pytorch
Grapheme to phoneme model for PyTorch
☆41Updated 3 years ago
zerospeech / zerospeech2021_baseline
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Updated 2 years ago
NVIDIA / speechsquad
Conversational AI Benchmark.
☆68Updated 2 years ago
asappresearch / sew
☆76Updated 3 years ago
khiajohnson / SpiCE-Corpus
An open-access corpus of conversational bilingual speech in Cantonese and English
☆40Updated 3 years ago
m-wiesner / nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Updated last year
RayeRen / unsuper_tts_asr
Audio samples from ICML2019 "Almost Unsupervised Text to Speech and Automatic Speech Recognition"
☆17Updated 6 years ago
getalp / mass-dataset
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances
☆50Updated 10 months ago
for-github-backup / deprecated.github.io
☆57Updated 3 years ago
sonos / spoken-language-understanding-research-datasets
☆49Updated 3 years ago
asappresearch / wav2seq
Official code for Wav2Seq
☆95Updated 3 years ago
karanmakhija867 / bert_punct
Punctuation restoration in ASR text
☆33Updated 6 years ago