This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.
☆21Nov 19, 2024Updated last year
Alternatives and similar repositories for fairseq
Users that are interested in fairseq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collection of scripts from mHuBERT-147.☆35Nov 19, 2024Updated last year
- ☆22Jul 22, 2022Updated 3 years ago
- My version of the RVC V2 Disconnected Colab notebook, which allows you to use RVC without using WebUI/Gradio☆15Jun 11, 2024Updated 2 years ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆27Nov 3, 2025Updated 7 months ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆134Sep 25, 2023Updated 2 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆16Apr 8, 2024Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated 2 years ago
- Run commands on remote hosts, inspecting key indicators to manage infrastructure☆15Jan 29, 2026Updated 5 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆61Jul 29, 2025Updated 11 months ago
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- a Frontier Japanese Speech Generation net☆65May 15, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆24Oct 30, 2024Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆40Jan 4, 2026Updated 5 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆95Jul 23, 2025Updated 11 months ago
- ☆17Nov 27, 2024Updated last year
- ☆18Sep 22, 2022Updated 3 years ago
- ☆11Sep 4, 2023Updated 2 years ago
- A higher quality RVC pretrained model to accelerate your training process.☆22Nov 11, 2025Updated 7 months ago
- Blender addon for importing and exporting Hedgehog Engine 3D related file formats☆13Mar 19, 2026Updated 3 months ago
- A simple extension that allows LLM to speak in any voice, literally, based on Sliero TTS which is available in oobabooga's textgen-webui …☆12Aug 26, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆19Dec 23, 2025Updated 6 months ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆79Dec 3, 2024Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆183Mar 6, 2024Updated 2 years ago
- ☆11May 4, 2020Updated 6 years ago
- ☆18Jul 22, 2024Updated last year
- ☆53Aug 27, 2021Updated 4 years ago
- Manifest Dumper is a GUI tool that creates game file's for SteamTools.☆13May 8, 2025Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆25Oct 8, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Apr 24, 2025Updated last year
- Retrieval-based-Voice-Conversion ( RVC ) modified and enhanced by codename;0☆13Jul 8, 2024Updated last year
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Apr 19, 2025Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- ☆25Mar 6, 2024Updated 2 years ago