KakaruHayate / R3MOEView external linksLinks
[RecurrentNN × Regression × Regularized]-base Mouth Opening Estimation via SSL(Semi-supervised Learning).
☆21Dec 6, 2025Updated 2 months ago
Alternatives and similar repositories for R3MOE
Users that are interested in R3MOE are comparing it to the libraries listed below
Sorting:
- Hubert-based Forced Aligner☆32Updated this week
- ☆15Mar 31, 2025Updated 10 months ago
- ONNX deployment of the CREPE pitch tracker☆26Oct 27, 2022Updated 3 years ago
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Oct 12, 2020Updated 5 years ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- A api wrapper for VOCALOID6.☆19Oct 24, 2022Updated 3 years ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 3 months ago
- ☆21Dec 18, 2025Updated last month
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆25May 28, 2024Updated last year
- ☆24Apr 10, 2023Updated 2 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- 胖宝宝☆36Mar 15, 2025Updated 11 months ago
- DiffSinger Editor developed by OpenVPI☆36Oct 21, 2025Updated 3 months ago
- A Program to Generate Koikatsu Character Data with Deep Learning Models / コイカツのキャラクターデータを深層学習モデルで生成するプログラム☆11Apr 22, 2022Updated 3 years ago
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆20Dec 19, 2025Updated last month
- It's Corn (PogChamps #3) Kaggle Competition 1st Place Winning Solution☆10Oct 4, 2022Updated 3 years ago
- ☆12Sep 18, 2022Updated 3 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- ☆16Jun 12, 2025Updated 8 months ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- ☆188Oct 14, 2025Updated 4 months ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- zako.work subdomains☆11Jan 2, 2026Updated last month
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆11Nov 7, 2024Updated last year
- RIFE with IFUNet, FusionNet and RefineNet☆12Jun 30, 2022Updated 3 years ago
- SimplifiedTransformer simplifies transformer block without affecting training. Skip connections, projection parameters, sequential sub-bl…☆15Feb 6, 2026Updated last week
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- Lightweight multi-scale distillation attention network for image super-resolution☆11Sep 5, 2025Updated 5 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- Detects shot boundaries from news with K-Means. Using Bhattacharya Coefficient for distance.☆10Jun 1, 2017Updated 8 years ago
- ☆11Jan 12, 2023Updated 3 years ago
- ☆11Sep 26, 2024Updated last year
- [ICASSP 2025] Official implementation of "ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning".☆14Feb 2, 2025Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- Public female English corpus used for Project AI❤dol☆14Dec 28, 2025Updated last month