Official implementation of the paper: "NeoBabel: A Multilingual Open Tower for Visual Generation"
☆23Aug 4, 2025Updated 9 months ago
Alternatives and similar repositories for NeoBabel
Users that are interested in NeoBabel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆14Mar 11, 2025Updated last year
- Official code for SongEcho☆64Mar 3, 2026Updated 3 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 11 months ago
- ☆11Feb 20, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Feb 3, 2026Updated 3 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 10 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23May 19, 2026Updated 2 weeks ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆40Oct 26, 2025Updated 7 months ago
- [ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"☆32Jan 26, 2026Updated 4 months ago
- World-Gymnast: Training Robots with Reinforcement Learning in a World Model☆36Feb 11, 2026Updated 3 months ago
- A practice to handle multi-modal datasets in a unified way.☆10Apr 15, 2024Updated 2 years ago
- ☆17Dec 12, 2023Updated 2 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Apr 16, 2026Updated last month
- [ICCV 2023] Bayesian Prompt Learning for Image-Language Model Generalization☆42Oct 6, 2023Updated 2 years ago
- ☆19Sep 9, 2024Updated last year
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆56Oct 12, 2025Updated 7 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆52May 1, 2025Updated last year
- ☆22Apr 4, 2023Updated 3 years ago
- Voice conversion with just linear regression.☆37Sep 25, 2025Updated 8 months ago
- Unofficial implementation of wavenext vocoder☆59Aug 28, 2024Updated last year
- ☆47Aug 31, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 11 months ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆24Jun 18, 2025Updated 11 months ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 3 years ago
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆53Sep 20, 2025Updated 8 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆34Apr 28, 2025Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆63Aug 30, 2024Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆62Nov 10, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆22Aug 13, 2024Updated last year
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆37Mar 3, 2026Updated 3 months ago
- ☆89Dec 31, 2025Updated 5 months ago
- A neural speech codec based on discrete WavLM representations☆26Aug 28, 2024Updated last year
- Base Code of "LifeLonger: A Benchmark for Continual Disease Classification, MICCAI, 2022"☆22Dec 29, 2022Updated 3 years ago
- Mixture of A Million Experts☆55Jul 30, 2024Updated last year
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year