s920128/NAR-BERT-ASR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/s920128/NAR-BERT-ASR)

s920128 / NAR-BERT-ASR

NAR-BERT-ASR

☆10

Alternatives and similar repositories for NAR-BERT-ASR

Users that are interested in NAR-BERT-ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
tango4j / llm_speaker_tagging
View on GitHub
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated 2 years ago
DianboWork / M3T-CNERTA
View on GitHub
☆11Aug 10, 2022Updated 3 years ago
kehanlu / server-monitor
View on GitHub
A light webserver for monitoring RAM and GPU usage on multiple servers.
☆21Mar 31, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sinovation / ZEN2
View on GitHub
The enhanced version of ZEN, larger and more powerful.
☆31Jul 22, 2022Updated 4 years ago
MiuLab / SpokenCSE
View on GitHub
Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding
☆11May 19, 2023Updated 3 years ago
Xianchao-Wu / wenet-deep-sparse-conformer
View on GitHub
☆15Aug 25, 2022Updated 3 years ago
narihira2000 / GAS-NTUST-bulletin
View on GitHub
一個透過Google App Script發送台科公佈欄資訊的機器人
☆23Sep 22, 2022Updated 3 years ago
Mashiro009 / slidespeech_dl
View on GitHub
☆24Sep 20, 2024Updated last year
netleibi / fastchunking
View on GitHub
Fast text chunking algorithms for Python
☆12Oct 7, 2020Updated 5 years ago
TeaPoly / CE-OptimizedLoss
View on GitHub
Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…
☆25Oct 11, 2024Updated last year
facebookresearch / fbai-speech
View on GitHub
Repo for the FB AI Speech team.
☆27Aug 24, 2021Updated 4 years ago
WThirteen / asr_AISHELL-3
View on GitHub
Chinese speech recognition | 中文语音识别（使用AISHELL-3数据集训练语音识别模型）
☆11Oct 17, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
Alibaba-NLP / AISHELL-NER
View on GitHub
[ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech
☆26Apr 20, 2022Updated 4 years ago
NaoyukiKanda / LibriSpeechMix
View on GitHub
☆38Mar 30, 2021Updated 5 years ago
MingLunHan / CIF-PyTorch
View on GitHub
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…
☆78Jul 14, 2026Updated last week
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
raotnameh / End-to-end-E2E-Named-Entity-Recognition-from-English-Speech
View on GitHub
☆32Dec 2, 2020Updated 5 years ago
yhsong06 / LAU-Net
View on GitHub
☆16May 23, 2025Updated last year
malradhi / PACodec
View on GitHub
[ICASSP 2026]Official code for "Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum"
☆27Jan 22, 2026Updated 6 months ago
shoubhikraj / intel-cpu-patch
View on GitHub
Provides a python script that can patch executables compiled with Intel compiler or Intel MKL, for better performance on AMD processors
☆11Jun 5, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
albertwy / SWRM
View on GitHub
Code for Findings of ACL 2022 Paper "Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors"
☆26Jun 15, 2022Updated 4 years ago
punyhumangames / ArcInventoryExample
View on GitHub
Example project for Arc Inventory
☆14Mar 10, 2025Updated last year
JacobLinCool / MPSENet
View on GitHub
Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.
☆22Nov 1, 2024Updated last year
chimechallenge / C8DASR-Baseline-NeMo
View on GitHub
NeMo: a toolkit for conversational AI
☆13May 4, 2024Updated 2 years ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
bean661 / WoZaiXiaoYuanPuncher
View on GitHub
我在校园自动健康打卡程序
☆14Aug 1, 2022Updated 3 years ago
MingLunHan / CIF-HieraDist
View on GitHub
[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation
☆41Jul 14, 2026Updated last week
yuwchen / InQSS
View on GitHub
☆15Oct 6, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
amazon-science / iwslt-autodub-task
View on GitHub
☆21Mar 4, 2024Updated 2 years ago
fengpeng-yue / ASRTTS
View on GitHub
ASR & TTS joint training, asr, tts, machine speech chain
☆16Oct 16, 2021Updated 4 years ago
fwinkelbauer / chunkyard
View on GitHub
A backup tool
☆17May 21, 2026Updated 2 months ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
wutong8023 / SpeechRE
View on GitHub
☆11Nov 11, 2022Updated 3 years ago
Tylrin / Ability
View on GitHub
The Ability plugin is a library for useful GAS base classes.
☆17Dec 1, 2021Updated 4 years ago
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year