ErikEkstedt / conv_sslView external linksLinks
☆14Feb 9, 2023Updated 3 years ago
Alternatives and similar repositories for conv_ssl
Users that are interested in conv_ssl are comparing it to the libraries listed below
Sorting:
- Datasets for turn-taking research☆17Dec 21, 2023Updated 2 years ago
- vad☆25Apr 3, 2023Updated 2 years ago
- Voice Activity Projection Models: Self-supervised learning of Turn-taking Events☆93May 29, 2024Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 10 months ago
- ☆17Apr 28, 2021Updated 4 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Jan 15, 2026Updated last month
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- ☆18Mar 13, 2024Updated last year
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆38Apr 20, 2021Updated 4 years ago
- Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?☆19Oct 5, 2022Updated 3 years ago
- ☆17Mar 1, 2024Updated last year
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆18May 23, 2024Updated last year
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Jan 23, 2022Updated 4 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- ☆16Oct 7, 2022Updated 3 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- ☆49Nov 24, 2022Updated 3 years ago
- ☆23Jan 6, 2023Updated 3 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆26Jul 28, 2023Updated 2 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Nov 23, 2023Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆64May 18, 2024Updated last year
- ☆32Dec 23, 2025Updated last month
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆36Dec 17, 2024Updated last year
- An echo cancellation library for browsers using DTLN-aec☆26Oct 18, 2023Updated 2 years ago