rinnakk / nue-asrLinks

Nue-ASR inference code by rinna Co., Ltd.

☆35

Alternatives and similar repositories for nue-asr

Users that are interested in nue-asr are comparing it to the libraries listed below

Sorting:

kotoba-tech / kotoba-speech-release
☆48Updated last year
nu-dialogue / moshi-finetune
Fine-tuning Moshi/J-Moshi on your own spoken dialogue data
☆75Updated 3 months ago
tonnetonne814 / PITS-44100-Ja
44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。
☆20Updated 2 years ago
MaAI-Kyoto / MaAI
A real-time and light-weight software for generation of non-linguistic behaviors (turn-taking, backchannel, and head-nodding) in conversa…
☆65Updated last week
tonnetonne814 / PL-Bert-VITS2
VITS2 using Phoneme-Level Japanese BERT
☆14Updated last year
inokoj / VAP-Realtime
A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-…
☆92Updated 3 months ago
tonnetonne814 / QuickVC-44100-Ja_HuBERT
44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。
☆15Updated 2 years ago
sarulab-speech / jsut-label
context labels and pronunciation data for JSUT corpus
☆74Updated 4 years ago
sarulab-speech / xvector_jtubespeech
xvector model on jtubespeech
☆45Updated 2 years ago
sarulab-speech / audio-foundation-model-dataset
☆58Updated 10 months ago
yukara-ikemiya / wavefit-pytorch
PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.
☆60Updated 2 months ago
projectlucas / efficient_whisper
Robust Speech Recognition via Large-Scale Weak Supervision
☆19Updated 2 years ago
unilight / jatts
JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit
☆43Updated 5 months ago
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆102Updated last year
tonnetonne814 / SiFi-VITS2-44100-Ja
DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.
☆53Updated 2 years ago
6gsn / marine
☆35Updated 3 years ago
DwangoMediaVillage / pydomino
日本語音声に対して音素ラベルをアラインメントするためのツールです
☆34Updated 2 months ago
Respaired / Tsukasa-Speech
a Frontier Japanese Speech Generation net
☆57Updated 6 months ago
sarulab-speech / Coco-Nut
Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
☆21Updated last year
litagin02 / anime_speaker_embedding
Speaker embedding for anime speech domain based on ECAPA_TDNN
☆15Updated 4 months ago
laboroai / TEDxJP-10K
☆23Updated 4 years ago
Respaired / RiFornet_Vocoder
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆21Updated 3 months ago
tsukumijima / pyopenjtalk-plus
pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvements
☆51Updated last week
reppy4620 / vocoders
My vocoder experiments
☆31Updated 3 months ago
sarulab-speech / ml-audiocaps
Multi-lingual AudioCaps
☆11Updated last year
efeslab / LiteASR
[EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
☆133Updated 5 months ago
lourson1091 / audiobertscore
☆15Updated this week
sarulab-speech / tdmelodic_openjtalk
tdmelodic for open-jtalk
☆24Updated 4 years ago
kaiidams / Kokoro-Speech-Dataset
A public domain single speaker Japanese speech dataset
☆61Updated 2 years ago
xincanfeng / vitsGPT
☆57Updated last year