☆13Jul 10, 2021Updated 4 years ago
Alternatives and similar repositories for paper_summary
Users that are interested in paper_summary are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jul 30, 2025Updated 7 months ago
- farmer is an automated machine learning library.👨🌾☆12Oct 15, 2021Updated 4 years ago
- 日本語音声に対して音素ラベルをアラインメントするためのツールです☆38Aug 19, 2025Updated 7 months ago
- ☆12Mar 11, 2025Updated last year
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆44Mar 13, 2026Updated last week
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆62Sep 1, 2024Updated last year
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆14Dec 21, 2024Updated last year
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus☆21Jun 12, 2024Updated last year
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 9 months ago
- ☆16Dec 18, 2023Updated 2 years ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 6 months ago
- [INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …☆172May 20, 2025Updated 10 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Mar 17, 2026Updated last week
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 9 months ago
- Make your own Miku song from anywhere on your Mac. Ported from https://aidn.jp/mikutap/☆11Jan 6, 2023Updated 3 years ago
- Reference-aware automatic speech evaluation toolkit☆180Dec 5, 2024Updated last year
- [ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates☆42Mar 10, 2026Updated 2 weeks ago
- ☆15Nov 10, 2025Updated 4 months ago
- ☆12Aug 12, 2021Updated 4 years ago
- [INTERSPEECH 2025] The official implementation of DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for…☆16Sep 7, 2025Updated 6 months ago
- Official Repository of UltraVoice☆59Oct 28, 2025Updated 4 months ago
- Pytorch implementation of MoLA☆21Jun 9, 2025Updated 9 months ago
- ☆15Apr 2, 2025Updated 11 months ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated last year
- Aivis Voice Model File (.aivm/.aivmx) Generator / Editor☆15Feb 5, 2026Updated last month
- ☆17May 28, 2018Updated 7 years ago
- used to evaluate wavenet vocoder by rmse f0, MCD, rmse ap...☆15Jan 20, 2020Updated 6 years ago
- xvector model on jtubespeech☆47Nov 5, 2023Updated 2 years ago
- Keras implementation of the Structured Self-Attentive Sentence Embedding model☆19Aug 13, 2018Updated 7 years ago
- ☆28Aug 22, 2025Updated 7 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- A real-time and light-weight software for generation of non-linguistic behaviors (turn-taking, backchannel, and head-nodding) in conversa…☆83Feb 20, 2026Updated last month
- besigo is full spectrum lens simulator with KelemenMLT renderer.☆12Sep 4, 2015Updated 10 years ago
- Simple Python script to compute equal error rate (EER) for machine learning model evaluation.☆41Mar 12, 2020Updated 6 years ago
- Interface for graphics module providing various outputs options to render☆16Jan 30, 2026Updated last month
- Distributed & asynchronous DQN implementation using gRPC and PyTorch.☆10Feb 15, 2021Updated 5 years ago
- GPU-accelerated path tracer☆13Sep 30, 2015Updated 10 years ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 9 months ago