kyegomez/MELLE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyegomez/MELLE)

kyegomez / MELLE

An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"

☆16

Alternatives and similar repositories for MELLE

Users that are interested in MELLE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

The-Swarm-Corporation / SwarmOS
View on GitHub
Traditional operating systems are reactive - they wait for user input or system events before taking action. SwarmOS breaks this paradigm…
☆15Dec 6, 2024Updated last year
The-Swarm-Corporation / HTX-Swarm
View on GitHub
A sophisticated multi-agent system designed for real-time market analysis of HTX (formerly Huobi) exchange data. This swarm combines spec…
☆11Mar 18, 2025Updated last year
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
thuhcsi / dpss-exp3-VC-BNF
View on GitHub
Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
☆18Nov 27, 2024Updated last year
The-Swarm-Corporation / Research-Paper-Writer-Swarm
View on GitHub
Automate the creation of high quality research papers in latex. Powered by Swarms 🤖
☆11Dec 1, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
kyegomez / SoundStream
View on GitHub
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆13Jan 27, 2025Updated last year
Agora-Lab-AI / Atom
View on GitHub
a suite of finetuned LLMs for atomically precise function calling 🧪
☆16Updated this week
lavendery / AudioComposer
View on GitHub
☆27Sep 10, 2025Updated 10 months ago
Agora-Lab-AI / OmegaViT
View on GitHub
OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…
☆15Updated this week
darkshapes / singularity
View on GitHub
Shadowbox : A modern no-code AI instrument. UI thin client component.
☆18Jan 8, 2026Updated 6 months ago
bfs18 / e2_tts
View on GitHub
☆69Sep 3, 2024Updated last year
kyegomez / MLXTransformer
View on GitHub
Simple Implementation of a Transformer in the new framework MLX by Apple
☆19Nov 18, 2024Updated last year
SXU-YaxinGuo / CRMU
View on GitHub
儿童故事常识推理与寓意理解评测（Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories，CRMU）
☆18Oct 22, 2024Updated last year
ncclab-sustech / omni-eegbench
View on GitHub
☆16Jun 9, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Agora-Lab-AI / The-Distiller
View on GitHub
Generate High Quality textual or multi-modal datasets with Agents
☆18Jun 7, 2023Updated 3 years ago
jin-woo-lee / nfs-binaural
View on GitHub
☆13Aug 13, 2023Updated 2 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
exporl / vlaai
View on GitHub
Decoding of the speech envelope from EEG using the VLAAI deep neural network
☆14Sep 28, 2022Updated 3 years ago
cognitive-systems-lab / closed-loop-seeg-speech-synthesis
View on GitHub
Corresponding source code for the study "Real-time Synthesis of Imagined Speech Processes from Minimally Invasive Recordings of Neural Ac…
☆11Jul 30, 2021Updated 4 years ago
NeuSpeech / NeuGPT
View on GitHub
First neural GPT aligned with text and speech. Welcome to join us to make better foundation model in neural modality.
☆14Oct 30, 2024Updated last year
The-Swarm-Corporation / swarms-core
View on GitHub
Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.
☆20Nov 11, 2024Updated last year
kaihuhuang / Language-Group
View on GitHub
☆11Dec 24, 2024Updated last year
gengxuelong / wenet_LLM_from_ASLP
View on GitHub
wenet_LLM_from_ASLP
☆15Nov 26, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
CCTN-BCI / Neural2Speech
View on GitHub
code and speech demo for speech reconstruction from ECoG recordings
☆12May 21, 2025Updated last year
kyegomez / USM
View on GitHub
Implementation of Google's USM speech model in Pytorch
☆36Updated this week
zqs01 / eeg2vec
View on GitHub
☆11May 20, 2023Updated 3 years ago
Tencent / Freeze-Omni
View on GitHub
The official implement of Freeze-Omni.
☆16Jul 10, 2025Updated last year
kyegomez / OpenStrawberry
View on GitHub
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆30Updated this week
NeuSpeech / MAD-MEG2text
View on GitHub
☆22Nov 16, 2024Updated last year
The-Swarm-Corporation / AgentOS
View on GitHub
AgentOS is a lightweight, single-file implementation that provides a robust foundation for building autonomous AI agents. It implements t…
☆25Jul 11, 2025Updated last year
xtpub / api-doc
View on GitHub
api document for www.xt.com , www.xt.pub etc
☆10Jun 17, 2022Updated 4 years ago
JinchaoLove / CUHK-PhD-Thesis-Template
View on GitHub
Latex template for CUHK PhD Thesis
☆14Jun 29, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
exporl / auditory-eeg-challenge-2023-code
View on GitHub
☆15Sep 1, 2023Updated 2 years ago
light1726 / Speech-Tokenization-Papers
View on GitHub
This repository follows papers and reports on discrete speech representation learning and speech tokenization methods for speech language…
☆15Dec 1, 2023Updated 2 years ago
tbenst / silent_speech
View on GitHub
Official repository for "A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition"
☆18Mar 14, 2024Updated 2 years ago
pepebecker / pinyin-split
View on GitHub
Split up any kind of Pinyin into an array of syllables.
☆11Aug 14, 2024Updated last year
frothywater / kanade-tokenizer
View on GitHub
Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…
☆108Jul 18, 2026Updated last week
Tobecoder / OpenWechat
View on GitHub
微信开放平台
☆10Nov 15, 2016Updated 9 years ago
liutaocode / AwesomeDiarizationDataset
View on GitHub
Both audio-only and audio-visual speaker diarization datasets are listed here.
☆16Feb 22, 2023Updated 3 years ago