nethermanpro/ComSL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nethermanpro/ComSL)

nethermanpro / ComSL

☆11

Alternatives and similar repositories for ComSL

Users that are interested in ComSL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ictnlp / BT4ST
View on GitHub
Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".
☆11Oct 25, 2023Updated 2 years ago
ictnlp / ComSpeech
View on GitHub
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
☆27Jul 2, 2024Updated 2 years ago
ictnlp / CMOT
View on GitHub
Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"
☆17Oct 29, 2024Updated last year
ReneeYe / XSTNet
View on GitHub
This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
☆18May 1, 2022Updated 4 years ago
csukuangfj / kaldi_native_io
View on GitHub
python wrapper for kaldi's native I/O
☆27Jan 9, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
openaudiolab / LLaST
View on GitHub
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆26Aug 11, 2024Updated last year
dqqcasia / mosst
View on GitHub
☆27Aug 31, 2022Updated 3 years ago
arnabdas8901 / StarGAN-VC_PlusPlus
View on GitHub
☆11Aug 11, 2023Updated 2 years ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
choijeongsoo / utut
View on GitHub
[TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation
☆31Sep 6, 2024Updated last year
yazone / g2pE_mobile
View on GitHub
g2p for english tts
☆19Nov 10, 2022Updated 3 years ago
AGENDD / RWKV-ASR
View on GitHub
This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …
☆54Dec 23, 2024Updated last year
agrija9 / Avalinguo-Audio-Set
View on GitHub
Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification
☆13Aug 13, 2018Updated 7 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Glaciohound / Chimera-ST
View on GitHub
A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021
☆47Feb 21, 2022Updated 4 years ago
The-Data-Dilemma / MediBeng-Whisper-Tiny
View on GitHub
MediBeng Whisper Tiny improves doctor-patient transcription by training the Whisper Tiny model to translate mixed Bengali-English speech…
☆29Jul 24, 2025Updated 11 months ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 5 years ago
ZQuang2202 / Zipformer_Lightning
View on GitHub
An upgrade framework for train and validate compare with icefall using Lightning.
☆16Mar 26, 2025Updated last year
csalt-research / accented-codebooks-asr
View on GitHub
☆19Sep 10, 2024Updated last year
rasbt / b3-basic-batchsize-benchmark
View on GitHub
Experiments for the blog post "No, We Don't Have to Choose Batch Sizes As Powers Of 2"
☆20Jul 5, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 2 months ago
GuillaumeBriffoteaux / pySBO
View on GitHub
Python platform for parallel Surrogate-Based Optimization
☆12Nov 27, 2024Updated last year
jules-leguy / BBOMol
View on GitHub
Surrogate-based black-box optimization method for molecular properties
☆13Oct 22, 2022Updated 3 years ago
Takaaki-Saeki / ssl_speech_restoration_v2
View on GitHub
☆17Dec 18, 2023Updated 2 years ago
yxduir / LLM-SRT
View on GitHub
☆28Mar 11, 2026Updated 4 months ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
wawabinger / Awesome-Audio-Watermarking
View on GitHub
🔥UP-TO-DATE audio watermarking techniques🔥
☆15Feb 8, 2026Updated 5 months ago
titu1994 / warprnnt_numba
View on GitHub
WarpRNNT loss ported in Numba CPU/CUDA for Pytorch
☆17Mar 11, 2022Updated 4 years ago
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
ImPavloh / WhiTTsper-The-Lora
View on GitHub
Demo combining Whisper for speech recognition and Google TTS for speech synthesis to interact with Alpaca-LoRA.
☆20Apr 30, 2024Updated 2 years ago
MegEngine / End-to-end-ASR-Transformer
View on GitHub
An end to end ASR Transformer model training repo
☆13Dec 8, 2021Updated 4 years ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago