☆52Feb 19, 2026Updated 2 months ago
Alternatives and similar repositories for kani-tts-2-pretrain
Users that are interested in kani-tts-2-pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-agent orchestration framework for AI applications - build, deploy, and manage AI agents across the full lifecycle with Forge, Conve…☆30Mar 28, 2026Updated last month
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆19Jun 22, 2025Updated 10 months ago
- A MCP stdio toolpack for local LLMs☆31Apr 6, 2026Updated last month
- Protocol for Augmented Memory of Project Artifacts (MCP compatible) - extended☆24Jan 24, 2026Updated 3 months ago
- A curated collection of persona-based mcp server & tool groupings.☆36Sep 11, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆46Oct 28, 2025Updated 6 months ago
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆43Aug 3, 2025Updated 9 months ago
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 11 months ago
- A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.☆35Jul 16, 2025Updated 10 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆14Mar 15, 2025Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Apr 22, 2026Updated 3 weeks ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- Neural Homomorphic Vocoder optimized for singing voice synthesis☆33May 2, 2026Updated 2 weeks ago
- ☆43Aug 2, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆28Nov 3, 2025Updated 6 months ago
- A bytebot variant that uses Holo 1.5 7b to control the desktop☆25Nov 4, 2025Updated 6 months ago
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 7 months ago
- ☆36Oct 23, 2025Updated 6 months ago
- PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …☆39May 16, 2021Updated 5 years ago
- ComfyUI Colab Notebook for Image and Video Generation.☆14Nov 27, 2023Updated 2 years ago
- Agentic BYOK Browser-Based Website Builder☆44Updated this week
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆26Mar 17, 2025Updated last year
- Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint☆16Aug 8, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A ComfyUI plugin that integrates [Pascal Editor](https://github.com/pascalorg/editor) — a full-featured 3D architectural editor — directl…☆94Apr 24, 2026Updated 3 weeks ago
- The AI-Native Cinematic Studio. A professional Non-Linear Editor (NLE) for filmmakers.☆75Mar 16, 2026Updated 2 months ago
- Unofficial PyTorch Implementation of "Were RNNs All We Needed?"☆17Mar 20, 2025Updated last year
- ☆12Dec 14, 2024Updated last year
- One Diffusion model implementation base on LibTorch☆13Mar 22, 2023Updated 3 years ago
- Basel morphable face model mesh and texture generator using GPU.☆14Sep 14, 2020Updated 5 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 7 months ago
- InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity☆12Jan 3, 2026Updated 4 months ago
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆165Oct 20, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A collection of all our phonemeizers for dataset construction and inference☆30Feb 21, 2025Updated last year
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- An AI assistant for PCs powered by Meta's LLaMA3 using Hugging Face, runs on voice recognition, text-to-speech. Send messages, voice/vide…☆19Jun 6, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated 11 months ago
- MichiAI: A Low Latency, Full Duplex Speech LLM with zero coherence loss☆101Apr 24, 2026Updated 3 weeks ago
- Local banking voice assistant focused on banking☆64Apr 10, 2026Updated last month
- Topics of conferences☆12Jul 12, 2016Updated 9 years ago