laboroai / TEDxJP-10K
☆18Updated 4 years ago
Alternatives and similar repositories for TEDxJP-10K:
Users that are interested in TEDxJP-10K are comparing it to the libraries listed below
- ☆86Updated 4 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Updated 2 years ago
- CMU multilingual speech repository☆31Updated 2 years ago
- multilingual speech aligner☆73Updated last year
- Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus☆21Updated 9 months ago
- context labels and pronunciation data for JSUT corpus☆68Updated 3 years ago
- ☆16Updated last year
- ☆32Updated 2 years ago
- ASR text preprocessing utility☆21Updated 8 months ago
- ☆34Updated 3 years ago
- ☆38Updated 3 years ago
- A repository of Japanese Phoneme-Level BERT☆22Updated last year
- ☆21Updated 7 months ago
- Implementation of vocoders empowered with pytorch lightning☆17Updated last year
- ☆25Updated 8 months ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- HTS-style full-context labels for JSUT v1.1☆46Updated 3 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆13Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Updated 2 years ago
- Official implementation of MelHuBERT☆65Updated 5 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆75Updated last year
- ☆16Updated last year
- ☆31Updated last year
- ☆47Updated 3 months ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆19Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 7 months ago
- ☆13Updated 2 years ago
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆33Updated last year