klintan/swedish-asr-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/klintan/swedish-asr-dataset)

klintan / swedish-asr-dataset

Jupyter Notebooks for creating Speech datasets

☆46

Alternatives and similar repositories for swedish-asr-dataset

Users that are interested in swedish-asr-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ainy / shershe
View on GitHub
Speech recognition dataset based on russian audiobook, sentance-level split
☆18Oct 6, 2018Updated 7 years ago
MycroftAI / pylisten
View on GitHub
A simple pyaudio microphone interface
☆11Jul 27, 2018Updated 7 years ago
ozdefir / finetuneas
View on GitHub
An HTML interface for finetuning the sync map output from aeneas
☆53Jul 5, 2022Updated 4 years ago
senior-sigan / denoise-autoencoder
View on GitHub
Denoise audio with convolutional autoencoder
☆16Nov 19, 2017Updated 8 years ago
pengzhendong / ngram-punctuator
View on GitHub
An N-gram punctuator for Chinese and English.
☆18Oct 14, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
sajadalipour7 / Persian-Grapheme-To-Phoneme-With-Transformer
View on GitHub
Persian Grapheme To Phoneme with Transformer in Pytorch
☆11Sep 21, 2023Updated 2 years ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
OlaWod / PitchVC
View on GitHub
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
☆35Jun 6, 2024Updated 2 years ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
Idlak / Living-Audio-Dataset
View on GitHub
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆43Aug 3, 2022Updated 3 years ago
gheyret / UQSpeechDataset
View on GitHub
Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット
☆35Apr 3, 2022Updated 4 years ago
kylerbrown / textgrid
View on GitHub
simple textgrid to csv converter
☆27Jul 29, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
akhil2495 / multi-modal-emotion-recognition
View on GitHub
A repository for emotion recognition from speech, text and mocap data from IEMOCAP dataset
☆13Dec 12, 2018Updated 7 years ago
ErikEkstedt / conv_ssl
View on GitHub
☆14Feb 9, 2023Updated 3 years ago
PINTO0309 / onnx-aec
View on GitHub
A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.
☆13Oct 22, 2024Updated last year
WhissleAI / PromptingNemo
View on GitHub
All-in-one Speech Transcription
☆11Jun 5, 2026Updated last month
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Oct 14, 2024Updated last year
ggeop / DataDialogueLLM
View on GitHub
Data Dialogue enables natural language querying of databases by integrating LLMs with SQL databases.
☆15May 3, 2025Updated last year
kaistmm / AdaptVC
View on GitHub
☆17Jun 2, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ZQuang2202 / Zipformer_Lightning
View on GitHub
An upgrade framework for train and validate compare with icefall using Lightning.
☆16Mar 26, 2025Updated last year
bookbot-hive / k2-indonesian-asr
View on GitHub
Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).
☆16Jun 30, 2023Updated 3 years ago
hutomadotAI / Hutoma-Conversational-AI-Platform
View on GitHub
Hu:toma AI is an open source stack designed to help you create compelling conversational interfaces with little effort and above industry…
☆38Sep 11, 2019Updated 6 years ago
kamilakesbi / DiarizersLM
View on GitHub
☆15Jul 16, 2024Updated 2 years ago
MichaelMoroz / ShaderToy2CPP
View on GitHub
a close enough approximation of the shadertoy framework
☆12Jul 2, 2020Updated 6 years ago
piperandrew / textMiningR
View on GitHub
This is a library of R scripts for the large-scale analysis of texts.
☆14Jun 20, 2026Updated last month
mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
zhu-han / SpeechLLM
View on GitHub
LLM-based ASR recipe with Zipformer encoder and Qwen LLM
☆35Sep 25, 2025Updated 10 months ago
L-A-Sandhu / Physics-Informed-Vectors-For-Wind-Speed-Prediction
View on GitHub
Official Implementation of Integrating Physics-Informed Vectors for Improved Wind Speed Forecasting with Neural Networks
☆13Jul 2, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
af-ai-center / bert
View on GitHub
Code and Swedish pre-trained models for BERT
☆12Feb 5, 2020Updated 6 years ago
georgepar / kaldi-docker
View on GitHub
Build kaldi inside docker containers with option for CUDA support
☆12Feb 6, 2017Updated 9 years ago
mipuc / hts-engine-world
View on GitHub
☆17Nov 17, 2020Updated 5 years ago
audio-captioning / caption-evaluation-tools
View on GitHub
Tools for the evaluation of audio captioning.
☆19May 23, 2020Updated 6 years ago
kastnerkyle / raw_voice_cleanup
View on GitHub
Examples of cleaning up raw voices
☆18Mar 2, 2022Updated 4 years ago
kingabzpro / WOLOF-ASR-Wav2Vec2
View on GitHub
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
☆18Nov 13, 2021Updated 4 years ago
pengzhendong / streaming-ChatTTS
View on GitHub
☆23Oct 30, 2024Updated last year