CoEDL/vad-sli-asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CoEDL/vad-sli-asr)

CoEDL / vad-sli-asr

A pipeline to isolate and transcribe one language in mixed-language speech

☆20

Alternatives and similar repositories for vad-sli-asr

Users that are interested in vad-sli-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
jasonppy / syllable-discovery
View on GitHub
Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
☆35Aug 27, 2023Updated 2 years ago
CoEDL / elan-helpers
View on GitHub
Tools and scripts for working with ELAN
☆10Aug 4, 2022Updated 3 years ago
Bartelds / neural-acoustic-distance
View on GitHub
Code associated with the paper: Neural Representations for Modeling Variation in Speech.
☆18Mar 10, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Speech-Lab-IITM / Hindi-ASR-Challenge
View on GitHub
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
☆10Nov 5, 2020Updated 5 years ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
neulab / AfricanVoices
View on GitHub
Hosts text-to-speech corpus and speech synthesizers for African languages.
☆19May 31, 2023Updated 3 years ago
ReadAlongs / SoundSwallower
View on GitHub
An even smaller speech recognizer / force aligner
☆36May 5, 2026Updated 2 months ago
KathyReid / cvaccents
View on GitHub
A set of tools for working with accent data in Mozilla's Common Voice dataset
☆14Nov 3, 2023Updated 2 years ago
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 4 years ago
resemble-ai / normalise
View on GitHub
A module for normalising text.
☆10Nov 6, 2019Updated 6 years ago
IndoNLP / nusa-catalogue
View on GitHub
Dataset Catalogue Homepage for Indonesian Languages
☆12Feb 19, 2024Updated 2 years ago
Bartelds / acoustic-distance-measure
View on GitHub
Acoustic distance measure for comparing pronunciations
☆17Aug 2, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
EveryVoiceTTS / EveryVoice
View on GitHub
The EveryVoice TTS Toolkit - Text To Speech for your language
☆43Updated this week
AI4Bharat / IndicWav2Vec
View on GitHub
Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
☆117Aug 28, 2025Updated 11 months ago
RichardLitt / stranded-by-trump
View on GitHub
Helping travelers stranded by Trump
☆11Oct 5, 2022Updated 3 years ago
City-of-Helsinki / django-munigeo
View on GitHub
Reusable Django application for storing and accessing municipality-related geospatial data
☆14Jun 5, 2026Updated last month
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
alphacep / unimrcp-vosk-plugin
View on GitHub
Open source cross-platform implementation of MRCP protocol
☆20Mar 3, 2022Updated 4 years ago
roedoejet / convertextract
View on GitHub
Extract and find/replace text based on arbitrary correspondences while preserving original file formatting. This library is a fork from t…
☆11Sep 8, 2023Updated 2 years ago
barneyhill / minBERT
View on GitHub
A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)
☆12Mar 20, 2023Updated 3 years ago
adlnlp / form_nlu
View on GitHub
☆19Nov 1, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dvianna / LegalQA-bloomz-560m
View on GitHub
Finetuning a small BLOOMZ model (bloomz-560m) on a small dataset and with limited resources.
☆18May 10, 2023Updated 3 years ago
marisademeglio / media-overlays-js
View on GitHub
EPUB Media Overlays javascript implementation
☆14Aug 19, 2016Updated 9 years ago
perfall / Edyson
View on GitHub
Flask-based web framework for visualisation and explorative listening of audio.
☆55May 1, 2023Updated 3 years ago
techiaith / docker-huggingface-stt-cy
View on GitHub
Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace
☆13Nov 29, 2022Updated 3 years ago
NRC-ILT / g2p
View on GitHub
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
☆203Updated this week
dhdaines / paves
View on GitHub
Bajo los adoquines, la PLAYA 🏖️
☆17Jul 3, 2026Updated 3 weeks ago
mainlp / germanic-lrl-corpora
View on GitHub
Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…
☆28Feb 16, 2026Updated 5 months ago
watsonbox / sphinxtrain-ruby
View on GitHub
Toolkit for training/adapting CMU Sphinx acoustic models
☆17May 25, 2018Updated 8 years ago
EricWilbanks / faseAlign
View on GitHub
Command line tool for forced-alignment of Spanish speech data
☆13Dec 31, 2025Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
georgian-io / Knowledge-Distillation-Toolkit
View on GitHub
[DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.
☆138Feb 20, 2024Updated 2 years ago
RiTA-nlp / ITALIC
View on GitHub
ITALIC: An ITALian Intent Classification Dataset
☆14Nov 24, 2023Updated 2 years ago
RegNLP / ObliQADataset
View on GitHub
☆18Dec 8, 2024Updated last year
ReadAlongs / Studio-Web
View on GitHub
Suite of web packages for creating interactive ReadAlongs
☆17Updated this week
ice-lab / site-v2
View on GitHub
ice.js 2 官网&文档
☆10Nov 17, 2022Updated 3 years ago
Dustyposa / rasa-demo
View on GitHub
Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack
☆11Jan 14, 2021Updated 5 years ago
vinary-tree / liblevenshtein-coffeescript
View on GitHub
Various utilities regarding Levenshtein transducers. (CoffeeScript / JavaScript / Node.js)
☆13Jun 20, 2016Updated 10 years ago