This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.
☆21Nov 19, 2024Updated last year
Alternatives and similar repositories for fairseq
Users that are interested in fairseq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collection of scripts from mHuBERT-147.☆34Nov 19, 2024Updated last year
- ☆20Jul 22, 2022Updated 3 years ago
- My version of the RVC V2 Disconnected Colab notebook, which allows you to use RVC without using WebUI/Gradio☆15Jun 11, 2024Updated last year
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆27Nov 3, 2025Updated 5 months ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆10May 11, 2024Updated last year
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- LLM-only topic extraction and classification☆11Sep 20, 2024Updated last year
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆134Sep 25, 2023Updated 2 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆16Apr 8, 2024Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- Simple tool for generating tokens with open source transformers and/or calculate per-token surprisal.☆14Apr 15, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Run commands on remote hosts, inspecting key indicators to manage infrastructure☆15Jan 29, 2026Updated 3 months ago
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆24Oct 30, 2024Updated last year
- a Frontier Japanese Speech Generation net☆64May 15, 2025Updated 11 months ago
- ☆14Jun 25, 2024Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆41Jan 4, 2026Updated 3 months ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆19Mar 23, 2024Updated 2 years ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 9 months ago
- ☆11Sep 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A higher quality RVC pretrained model to accelerate your training process.☆22Nov 11, 2025Updated 5 months ago
- Blender addon for importing and exporting Hedgehog Engine 3D related file formats☆12Mar 19, 2026Updated last month
- ☆18Dec 23, 2025Updated 4 months ago
- A simple extension that allows LLM to speak in any voice, literally, based on Sliero TTS which is available in oobabooga's textgen-webui …☆12Aug 26, 2023Updated 2 years ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆79Dec 3, 2024Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆183Mar 6, 2024Updated 2 years ago
- ☆11May 4, 2020Updated 5 years ago
- ☆18Jul 22, 2024Updated last year
- ☆53Aug 27, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Manifest Dumper is a GUI tool that creates game file's for SteamTools.☆13May 8, 2025Updated 11 months ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 6 months ago
- Leveraging LLMs for Post-OCR Correction of Historical Newspapers☆17Jun 20, 2024Updated last year
- ☆16Apr 24, 2025Updated last year
- Modified version of the PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆26Apr 10, 2026Updated 3 weeks ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year