zszheng147/VoiceCraft-X

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zszheng147/VoiceCraft-X)

zszheng147 / VoiceCraft-X

☆42

Alternatives and similar repositories for VoiceCraft-X

Users that are interested in VoiceCraft-X are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

paraynaud / MTH8408-Hiv24
View on GitHub
☆15Mar 25, 2024Updated 2 years ago
ZhikangNiu / A-DMA
View on GitHub
[INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"
☆67Jun 16, 2025Updated last year
the-bird-F / Expressive-Vectors
View on GitHub
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆40Dec 24, 2025Updated 7 months ago
akhileshthite / zipify-tunes
View on GitHub
Convert any playlist CSVs into MP3 files with metadata, bring back your MP3 player!
☆21Jul 4, 2026Updated 3 weeks ago
tensake / litehook
View on GitHub
Lightweight social media monitoring tool built with Rust
☆17Jun 10, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
fluxions-ai / stftvae
View on GitHub
Inference for the STFT-VAE continuous audio codec (24kHz, 3.125Hz latent)
☆43Jul 12, 2026Updated 2 weeks ago
JackXing875 / NeneBot
View on GitHub
綾地寧々は世界一可愛い！
☆16Jul 14, 2026Updated 2 weeks ago
zkingston / unknot
View on GitHub
Unknot Motion Planning
☆15Mar 30, 2025Updated last year
SYuan03 / VisaAppointmentWatcher
View on GitHub
☆16Jul 15, 2025Updated last year
Belyenochi / openclaw-edd
View on GitHub
Evaluation-Driven Development for OpenClaw agents — mine golden cases from real sessions, catch regressions before they ship.
☆18Mar 17, 2026Updated 4 months ago
ryota-komatsu / speech_resynth
View on GitHub
Speech Resynthesis and Language Modeling
☆27Jun 11, 2025Updated last year
vishnu97770 / VELOTYPE
View on GitHub
Adaptive AI-powered typing practice system that analyzes repeated user mistakes and generates personalized corrective tasks using FastAPI…
☆15Jun 6, 2026Updated last month
lkarlslund / tokenrouter
View on GitHub
One OpenAI-compatible endpoint for all your AI providers
☆17Apr 23, 2026Updated 3 months ago
jaredrummler / consoul
View on GitHub
A beautiful terminal-based AI chat interface built with Textual and LangChain
☆15Jan 7, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ErikZ719 / CoTA
View on GitHub
[ICLR 26] Context Tokens are Anchors: Understanding the Repeat Curse in dMLLMs from an Information Flow Perspective
☆16Mar 6, 2026Updated 4 months ago
Soul-AILab / SAC
View on GitHub
[ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.
☆108Nov 1, 2025Updated 8 months ago
nmandic78 / AI-VoiceAssistant
View on GitHub
A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …
☆19Jul 22, 2026Updated last week
Shy-98 / MELLE
View on GitHub
Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"
☆41Jun 28, 2025Updated last year
Jivoronix / blockchain-data-validator
View on GitHub
☆15Jan 31, 2025Updated last year
sumerc / zee
View on GitHub
Voice transcription that stays out of your way. Runs fully on-device — local Parakeet and Whisper on Metal, no account, no API key, no ne…
☆15Updated this week
jingzhunxue / FlowMirror_HydraVox
View on GitHub
FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…
☆49Feb 17, 2026Updated 5 months ago
Yonoi / AGL-STAN
View on GitHub
The code of paper "Spatial-Temporal Attention Network for Crime Prediction with Adaptive Graph Learning"
☆15Mar 18, 2023Updated 3 years ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
dannysun85 / ClawX
View on GitHub
依托ClawX完全重构和增加了全新的功能！
☆16Feb 23, 2026Updated 5 months ago
timjuenemann / wikipedia-mcp
View on GitHub
Wikipedia MCP Server written in TypeScript
☆15Apr 18, 2025Updated last year
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
mattkang0 / weatpy
View on GitHub
A python implementation of wego
☆15Oct 15, 2016Updated 9 years ago
omar-A-hassan / medsci-agent
View on GitHub
Biomedical research agent with 28 MCP tools powered by MedGemma, TxGemma and OpenCode
☆18Mar 14, 2026Updated 4 months ago
hhguo / SoCodec
View on GitHub
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆92Dec 20, 2024Updated last year
green-anger / MemoryPool
View on GitHub
Fast Efficient Fixed-Size Memory Pool
☆15Dec 12, 2018Updated 7 years ago
junminhong / awesome-agent-skills
View on GitHub
A curated list of agent skills, resources, and tools for building customizable AI workflows (Claude Code, Codex, Kiro-CLI)
☆15Updated this week
tbrumue / heapo
View on GitHub
HEAPO – An Open Dataset for Heat Pump Optimization with Smart Electricity Meter Data and On-Site Inspection Protocols
☆16Mar 26, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lsm1103 / session-dashboard
View on GitHub
Used to browse and monitor the historical session records of AI programming tools（Claude Code、Codex CLI、Cursor、Aider）
☆16Mar 18, 2026Updated 4 months ago
ycdfwzy / PL-MSCKF
View on GitHub
☆16Jan 6, 2023Updated 3 years ago
acgessler / rust-persistent-kv
View on GitHub
Persistent fault-tolerant key-value store in rust
☆16Mar 17, 2025Updated last year
pretty66 / fastcar
View on GitHub
PHP long connection proxy, eliminates short links and reduces request latency
☆20Oct 4, 2023Updated 2 years ago
gorodnitskiy / jax-cuda-docker
View on GitHub
This repo contains Dockerfile that can be used to easily run JAX with CUDA support in Docker without JAX and CUDA/cuDNN versions mismatch…
☆15Feb 1, 2023Updated 3 years ago
Lukeli0425 / Coord-SoS-PACT
View on GitHub
[ICCV 2025] Coordinate-based Speed of Sound Recovery for Aberration-Corrected Photoacoustic Computed Tomography
☆16Apr 2, 2026Updated 3 months ago
ajd12342 / paraspeechcaps
View on GitHub
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
☆165Mar 26, 2026Updated 4 months ago