mtreviso / deepbondLinks

Deep neural approach to Boundary and Disfluency Detection - Based on my Master's work

☆19

Alternatives and similar repositories for deepbond

Users that are interested in deepbond are comparing it to the libraries listed below

Sorting:

Prem-kumar27 / Fast-KTSpeechCrawler
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆24Updated 4 years ago
cyfer0618 / kaldi-pytorch-rnnlm
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Updated 5 years ago
pariajm / e2e-asr-and-disfluency-removal-evaluator
A new metric for evaluating end-to-end speech recognition and disfluency removal systems
☆19Updated 4 years ago
lingjzhu / spoken_sent_embedding
Unsupervised spoken sentence embeddings
☆14Updated 2 years ago
pariajm / joint-disfluency-detector-and-parser
Improving Disfluency Detection by Self-Training a Self-Attentive Model
☆47Updated 4 years ago
MiuLab / Lattice-Transformer-SLU
Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"
☆11Updated 4 years ago
m-wiesner / nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Updated 11 months ago
ondrejklejch / acoustic_punctuation
NMT based punctuation prediction system using lexical and acoustic features .
☆14Updated 5 years ago
mattiadg / FBK-Fairseq-ST
An adaptation of Fairseq to (End-to-end) speech translation.
☆21Updated 3 years ago
frozentoad9 / CMST
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Updated 2 years ago
sonos / spoken-language-understanding-research-datasets
☆49Updated 3 years ago
qiujiali / lattice_rnn
Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation
☆16Updated 4 years ago
Sreyan88 / Disfluency-Detection-with-Span-Classification
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆13Updated 2 years ago
wbengine / SPMILM
☆18Updated 8 years ago
amazon-science / proteno
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45Updated 4 years ago
skit-ai / slu-prosody
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆26Updated 2 years ago
kamperh / globalphone_awe
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Updated 4 years ago
Nathan-Roll1 / PSST
Prosodic Speech Segmentation with Transformers
☆25Updated last year
roholazandie / ryan-tts
☆18Updated 3 years ago
sigmorphon / 2020
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…
☆36Updated 2 months ago
Chia-Hsuan-Lee / Spoken-SQuAD
A spoken question answering dataset on SQUAD
☆49Updated last month
MiuLab / SpokenVec
Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding
☆24Updated 2 years ago
xinjli / asr2k
asr2k
☆50Updated last year
tongjinle123 / speech-transformer-pytorch_lightning
ASR project with pytorch-lightning
☆20Updated 3 months ago
getalp / mass-dataset
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances
☆50Updated 9 months ago
alicank / Translation-Augmented-LibriSpeech-Corpus
Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…
☆44Updated 2 years ago
xinjli / phonepiece
phone inventory library
☆16Updated 2 years ago
alefiury / SE-R-2022-SER-Track
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆15Updated 2 years ago
farisalasmary / deepspeech2-online-decoder
Online (real-time) decoder to be used with DeepSpeech2 model
☆25Updated 5 years ago
Observeai-Research / Phoneme-BERT
☆34Updated 4 years ago