ryota-komatsu/slp2025

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ryota-komatsu/slp2025)

ryota-komatsu / slp2025

Survey of audio language models

☆65

Alternatives and similar repositories for slp2025

Users that are interested in slp2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

unilight / jatts
View on GitHub
JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit
☆43Mar 13, 2026Updated 4 months ago
sarulab-speech / audio-foundation-model-dataset
View on GitHub
☆65Jan 8, 2025Updated last year
onolab-tmu / asp-tutorial-2022
View on GitHub
Ono laboratory audio signal processing exercise for beginners.
☆19May 10, 2023Updated 3 years ago
neodyland / entropix
View on GitHub
Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral
☆17Jan 12, 2025Updated last year
takamichi-lab / paperwriting_checklist
View on GitHub
論文執筆チェックリスト
☆21Jul 3, 2026Updated 3 weeks ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Parakeet-Inc / J-HARD-TTS-Eval
View on GitHub
☆21Jan 28, 2026Updated 6 months ago
DwangoMediaVillage / pydomino
View on GitHub
日本語音声に対して音素ラベルをアラインメントするためのツールです
☆40Aug 19, 2025Updated 11 months ago
swallow-llm / swallow-evaluation
View on GitHub
Swallowプロジェクト大規模言語モデル評価スクリプト
☆25Sep 17, 2025Updated 10 months ago
tam17aki / speech_process_exercise
View on GitHub
音声情報処理n本ノックを目指して
☆133Jun 13, 2024Updated 2 years ago
llm-jp / llama-mimi
View on GitHub
Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…
☆31Sep 20, 2025Updated 10 months ago
pfnet-research / plamo-examples
View on GitHub
☆25May 29, 2025Updated last year
ayutaz / uni-llm-voice-chat
View on GitHub
This project uses llama.cpp as an LLM server to perform inference and generate speech using Synthetic voice library
☆22Mar 5, 2024Updated 2 years ago
hirokisince1998 / jasj-bibtex
View on GitHub
日本音響学会誌用BibTeXスタイルファイル
☆11Jan 24, 2022Updated 4 years ago
nu-dialogue / moshi-finetune
View on GitHub
Fine-tuning Moshi/J-Moshi on your own spoken dialogue data
☆101Jan 5, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yukara-ikemiya / Open-Miipher-2
View on GitHub
PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind
☆70Sep 22, 2025Updated 10 months ago
pfnet-research / pfgen-bench
View on GitHub
Preferred Generation Benchmark
☆103Mar 6, 2026Updated 4 months ago
muramasa2 / paper_summary
View on GitHub
☆13Jul 10, 2021Updated 5 years ago
tennmoku71 / advent_calendar_cyberagent_llm_dialogue_system
View on GitHub
☆11Jan 10, 2024Updated 2 years ago
remdis / remdis
View on GitHub
The Remdis toolkit: Building advanced real-time multimodal dialogue systems with incremental processing and large language models
☆102Jun 20, 2026Updated last month
speed1313 / jax-llm
View on GitHub
JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dat…
☆13Aug 5, 2024Updated last year
b-sigpro / neural-fcasa
View on GitHub
This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).
☆40Mar 12, 2025Updated last year
nttcslab / eval-audio-repr
View on GitHub
EVAR ~ Evaluation package for Audio Representations
☆81Feb 19, 2026Updated 5 months ago
MaAI-Kyoto / MaAI
View on GitHub
A real-time software for turn-taking, backchannel, and head-nodding prediction
☆107Jul 21, 2026Updated last week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
yamato0811 / streamlit-langgraph-HITL-copy-generator
View on GitHub
StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション
☆11Feb 15, 2025Updated last year
KoheiYatabe / DGTtool
View on GitHub
A simple and user-friendly tool for computing STFT/DGT
☆19Jun 22, 2021Updated 5 years ago
nadare881 / voice-changer-vector-search
View on GitHub
This is a repository for comparing voice changer results and searching datasets and trained models.
☆30May 21, 2023Updated 3 years ago
abap34 / almo
View on GitHub
ALMOは拡張Markdownパーサ・静的サイトジェネレータです。WebAssemblyを使ってブラウザ上で完結する実行環境を提供し、サーバを必要としないサンプルコードの実行環境やジャッジシステムを提供するページの構築を可能にします。
☆16Apr 14, 2026Updated 3 months ago
ryota-komatsu / speech_resynth
View on GitHub
Speech Resynthesis and Language Modeling
☆27Jun 11, 2025Updated last year
kyama0321 / gammachirpy
View on GitHub
A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)
☆32May 14, 2024Updated 2 years ago
yamathcy / music-deeplearning-japanese
View on GitHub
深層学習×音楽情報処理勉強会@筑波大学・人と音の情報学研究室
☆19Jul 9, 2023Updated 3 years ago
HidekiKawahara / YANGstraight_source
View on GitHub
Analytic signal-based source information analysis for YANGstraight and real-time interactive tools
☆34Aug 20, 2019Updated 6 years ago
SakanaAI / kame_finetune
View on GitHub
☆30Jul 16, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nu-dialogue / j-moshi
View on GitHub
J-Moshi: A Japanese Full-duplex Spoken Dialogue System
☆316Jun 4, 2025Updated last year
nobutaka-ito / pulse
View on GitHub
Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)
☆43Jul 24, 2023Updated 3 years ago
mtshiba / mitou-docs
View on GitHub
2023年度未踏IT人材発掘・育成事業芝山PJの関連書類
☆15Apr 19, 2024Updated 2 years ago
takamichi-lab / nlp-lecture-keio
View on GitHub
慶応義塾大学理工学部情報工学科講義「自然言語処理」
☆19Jul 15, 2026Updated 2 weeks ago
Hiroshiba / hihobot-synthesis
View on GitHub
自分の声で音声合成
☆17Mar 4, 2019Updated 7 years ago
SmartSoundKAIST / 6DRIR-DL
View on GitHub
6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid
☆17Aug 31, 2023Updated 2 years ago
oshizo / JapaneseEmbeddingEval
View on GitHub
☆183Oct 9, 2024Updated last year