X-Voice
☆142May 22, 2026Updated last week
Alternatives and similar repositories for X-Voice
Users that are interested in X-Voice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆55Sep 2, 2025Updated 8 months ago
- Professional Audio Recording, Playback & Voice Activity Detection for Expo☆31Mar 2, 2026Updated 2 months ago
- Deep Research Tool☆25Nov 8, 2025Updated 6 months ago
- ☆26Mar 13, 2024Updated 2 years ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- Quantum Computing Resources & Syllabus – Mind Map Format, 1. Resources – curated list of books, courses, videos. 2. Syllabus – visual cou…☆27Sep 13, 2025Updated 8 months ago
- Unofficial implementation of DreamTalk in ComfyUI☆12Aug 15, 2024Updated last year
- ☆31Feb 7, 2026Updated 3 months ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- Comfort Noise Generator Module Port From WebRTC☆22Mar 4, 2019Updated 7 years ago
- ☆28Nov 27, 2025Updated 6 months ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Mar 25, 2024Updated 2 years ago
- Cohere Transcribe in Rust☆94May 19, 2026Updated last week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- All the content of my youtube channel : https://youtube.com/@florenzerstling?si=7t10PBr6MDha74PO☆14May 28, 2025Updated last year
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆117Dec 11, 2025Updated 5 months ago
- Multi Model Personal Assistant Wrapper in Go: Interact with ChatGPT, Claude or Ollama Cross Platform (Speech & Image generation supported…☆16May 11, 2026Updated 2 weeks ago
- ☆18Dec 2, 2025Updated 5 months ago
- DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The …☆14Sep 20, 2024Updated last year
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆22May 13, 2021Updated 5 years ago
- Face Animation from Text☆18Aug 1, 2024Updated last year
- A Python project aimed at making an automatic MIDI music generator☆22Mar 29, 2026Updated 2 months ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆229Nov 30, 2025Updated 5 months ago
- A SIP/WebRTC voice agent☆160May 21, 2026Updated last week
- A highly scalable, offline-first foundation with the best developer experience and a focus on performance and best practices.☆21Feb 16, 2020Updated 6 years ago
- ChatBot using Meta AI Llama v2 LLM model on your local PC.☆12Aug 6, 2025Updated 9 months ago
- ComfyUI Yolo World EfficientSAM custom node☆15Jul 16, 2024Updated last year
- ☆17Apr 25, 2026Updated last month
- Accept payments on your React Native-based apps with cards, wallets, and key local payment methods☆21Sep 7, 2021Updated 4 years ago
- Misc. tools/scripts that I made to use for tortoise☆21Aug 19, 2024Updated last year
- strapi & socket.io☆18Feb 18, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A curated list of Text-to-Video Generation papers and BibTeX entries☆21Feb 21, 2024Updated 2 years ago
- ☆27Aug 2, 2024Updated last year
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- ☆28Nov 15, 2023Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 4 years ago
- ☆22Oct 2, 2020Updated 5 years ago
- Taming Stable Diffusion for Lip Sync!☆16Mar 18, 2025Updated last year