andybi7676/reborn-uasr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/andybi7676/reborn-uasr)

andybi7676 / reborn-uasr

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

☆15

Alternatives and similar repositories for reborn-uasr

Users that are interested in reborn-uasr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

voidful / MMLM
View on GitHub
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra
☆16Dec 10, 2024Updated last year
chorowski-lab / hCPC
View on GitHub
Implementation of multi-level Contrastive Predictive Coding (CPC) methods
☆20Jan 12, 2023Updated 3 years ago
voidful / nlp2
View on GitHub
⚙️Tool for NLP - handle file and text
☆15Feb 16, 2025Updated last year
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
kgnlp / allophant
View on GitHub
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
☆30Mar 14, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆53May 1, 2025Updated last year
SesameAILabs / silentcipher
View on GitHub
☆21Mar 17, 2025Updated last year
erogol / RSOM
View on GitHub
Rectifying Self Organizing Map
☆29Oct 7, 2024Updated last year
najeebkhan / text-to-speech-synthesis
View on GitHub
Hidden Markov model based text to speech synthesis system similar to HTS implemented in C#
☆11Dec 16, 2016Updated 9 years ago
IDEA-XL / SubgDiff
View on GitHub
The official implementation of NeurIPS2024 paper "SubgDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning."
☆11May 28, 2025Updated last year
AlanBaade / SyllableLM
View on GitHub
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆63Jul 1, 2025Updated last year
Berkeley-Speech-Group / sylber
View on GitHub
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
☆80Mar 17, 2025Updated last year
AMAAI-Lab / DART
View on GitHub
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
☆15Apr 22, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Lexsi-Labs / aligntune
View on GitHub
Aligntune : A Modular Toolkit for Post Training Alignment of LLMs
☆37Jul 8, 2026Updated last week
facebookresearch / stepdiff
View on GitHub
Data release for Step Differences in Instructional Video (CVPR24)
☆15Jun 19, 2024Updated 2 years ago
gyt1145028706 / XY-Tokenizer
View on GitHub
This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs
☆94Sep 19, 2025Updated 9 months ago
HLTCHKUST / ASCEND
View on GitHub
ASCEND Chinese-English code-switching dataset
☆33Jul 12, 2022Updated 4 years ago
lucidrains / hippoformer
View on GitHub
Unofficial implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers
☆53Apr 28, 2026Updated 2 months ago
maxrmorrison / promonet
View on GitHub
Prosody and Pronunciation Modification Network
☆64May 5, 2025Updated last year
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
youngsheen / GPST
View on GitHub
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆70Nov 1, 2024Updated last year
dipika-singhania / ICC-Semi-Supervised-TAS
View on GitHub
Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation
☆11Jul 24, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
maxrmorrison / torbi
View on GitHub
Viterbi decoding in PyTorch
☆42May 5, 2026Updated 2 months ago
haozhiwen-fighting / Contrast-enhanced-Ultrasound-for-Thyroid-Nodules-Diagnosis
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
Audio-Foundation-Models / ConversationTTS
View on GitHub
☆101Jan 19, 2026Updated 5 months ago
YangXusheng-yxs / CodecFormer_5Hz
View on GitHub
☆35Oct 23, 2025Updated 8 months ago
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
ZhangXinWhut / SimWhisper-Codec
View on GitHub
Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"
☆37Jan 28, 2026Updated 5 months ago
sarulab-speech / Sidon
View on GitHub
Training code and dataset cleasing with Sidon
☆146Apr 24, 2026Updated 2 months ago
luisbvcc1 / NTUcool_VideoDownload
View on GitHub
☆36Dec 13, 2023Updated 2 years ago
slp-rl / PAST
View on GitHub
☆48Jul 7, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
diamondStar35 / top_speed
View on GitHub
An audio racing game
☆58Updated this week
dpfried / action-segmentation
View on GitHub
Weakly-supervised action segmentation in video
☆16Feb 13, 2022Updated 4 years ago
avishaiElmakies / unsupervised_speech_segmentation_using_slm
View on GitHub
☆20Jan 8, 2025Updated last year
huangjin520 / EMGANet
View on GitHub
[JBHI'2025] Edge-Aware Multi-Scale Group-Mix Attention Network for Breast Cancer Ultrasound Image Segmentation
☆29Updated this week
mltony / nvda-indent-nav
View on GitHub
☆18Apr 13, 2026Updated 3 months ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
kaistmm / fregrad
View on GitHub
[ICASSP 2024] Official code for FreGrad
☆35May 13, 2024Updated 2 years ago