may- / joeys2tLinks

Minimalist Speech-to-Text toolkit for educational purposes

☆12

Alternatives and similar repositories for joeys2t

Users that are interested in joeys2t are comparing it to the libraries listed below

Sorting:

frozentoad9 / CMST
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Updated 2 years ago
ashi-ta / speechGLUE
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Updated 2 years ago
ErikEkstedt / conv_ssl
☆14Updated 2 years ago
ErikEkstedt / datasets_turntaking
Datasets for turn-taking research
☆13Updated last year
Sreyan88 / Disfluency-Detection-with-Span-Classification
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆13Updated 2 years ago
neulab / newlang-tech
A guide to building language technology in new languages.
☆58Updated 3 years ago
jasonppy / syllable-discovery
Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
☆32Updated last year
voidful / SpeechMix
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
☆47Updated 2 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
☆11Updated 3 years ago
openaudiolab / LLaST
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆25Updated 10 months ago
ErikEkstedt / vap_turn_taking
vad
☆18Updated 2 years ago
skit-ai / slu-prosody
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆26Updated 2 years ago
vectominist / MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆54Updated 2 years ago
kingabzpro / WOLOF-ASR-Wav2Vec2
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
☆17Updated 3 years ago
xinjli / asr2k
asr2k
☆50Updated last year
xinjli / phonepiece
phone inventory library
☆16Updated 2 years ago
hlt-mt / FBK-fairseq
Repository containing the open source code of works published at the FBK MT unit.
☆46Updated last week
EveryVoiceTTS / EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
☆36Updated this week
slp-rl / SLM-Discrete-Representations
This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…
☆19Updated 2 years ago
Observeai-Research / Phoneme-BERT
☆34Updated 4 years ago
pariajm / e2e-asr-and-disfluency-removal-evaluator
A new metric for evaluating end-to-end speech recognition and disfluency removal systems
☆19Updated 4 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
Using YouTube to prepare a speech recognition dataset for any language
☆10Updated 4 years ago
CohenPr-XPF / XPF
☆36Updated last year
besacier / ASR2022
☆56Updated 2 years ago
mattroddy / lstm_turn_taking_prediction
☆21Updated 6 years ago
pariajm / deep-disfluency-detector
Disfluency Detection using Auto-Correlational Neural Networks
☆44Updated 4 years ago
Sosdatasets / SoS_Dataset
☆11Updated 11 months ago
zouharvi / pwesuite
Suite for phonetic word embeddings, especially their evaluation and baseline models.
☆29Updated 3 months ago
wiebket / bt4vt
Bias Tests for Voice Technologies (bt4vt)
☆12Updated last year
cldf / segments
Unicode Standard tokenization routines and orthography profile segmentation
☆37Updated 4 months ago