nvidia-riva / nemo2riva

NeMo -> Riva Conversion Tool

☆12

Alternatives and similar repositories for nemo2riva:

Users that are interested in nemo2riva are comparing it to the libraries listed below

idiap / contextual-biasing-on-gpus
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆19Updated last year
Open-Speech-EkStep / ULCA-asr-dataset-corpus
☆41Updated 2 years ago
NVIDIA / NeMo-speech-data-processor
A toolkit for processing speech data and creating speech datasets
☆104Updated this week
nvidia-riva / riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
☆83Updated last month
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆74Updated last year
revdotcom / speech-datasets
Various speech datasets made available to the public
☆110Updated last month
hainan-xv / PASM
Pronunciation-assisted Subword Modeling
☆29Updated 5 years ago
pzelasko / kaldialign
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆62Updated 10 months ago
alumae / online_speaker_change_detector
Online streaming speaker change detection model in Pytorch
☆37Updated last year
desh2608 / spyder
Simple Python package for fast DER computation
☆32Updated last year
BUTSpeechFIT / DiaPer
☆57Updated 11 months ago
diego-fustes / asr-rescoring
Rescoring methods for end-to-end Automatic Speech Recognition
☆27Updated 4 years ago
chimechallenge / chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆21Updated last month
csukuangfj / kaldi_native_io
python wrapper for kaldi's native I/O
☆27Updated 3 weeks ago
backspacetg / simul_whisper
Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
☆56Updated 2 weeks ago
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆20Updated 10 months ago
mkunes / w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆36Updated last year
RuABraun / texterrors
☆34Updated 4 months ago
qiujiali / lattice-rescore
☆16Updated 2 years ago
asappresearch / slue-toolkit
A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…
☆64Updated 11 months ago
egruttadauria98 / SSpaVAlDo
☆31Updated 9 months ago
Mu-Y / DiariST
☆19Updated last year
ccoreilly / wav2vec2-service
☆38Updated 3 years ago
csukuangfj / kaldilm
Python wrapper for kaldi's arpa2fst
☆38Updated last month
aispeech-lab / w2v-cif-bert
☆37Updated 3 years ago
fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆48Updated last year
k2-fsa / text_search
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
☆63Updated 5 months ago
FrenchKrab / IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆78Updated last year
speech-paper-reading / speech-paper-reading
Repository for speech paper reading
☆32Updated 3 years ago
cornerfarmer / ctc_segmentation
Segment a given audio into utterances using a trained end-to-end ASR model.
☆72Updated 4 years ago