KoelLabs / MLLinks
Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learners! This repo contains the ML training, evaluation, and data processing code
☆17Updated 2 months ago
Alternatives and similar repositories for ML
Users that are interested in ML are comparing it to the libraries listed below
Sorting:
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated 3 months ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆22Updated 11 months ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- Fast audio super resolution from 16khz to 48khz.☆192Updated last month
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Updated last year
- ☆245Updated last month
- Speaker diarization service☆26Updated last week
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆150Updated last week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- Audio tokenization, in the fastest way possible!☆53Updated last year
- A simple uv workspace☆19Updated 10 months ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆203Updated 3 weeks ago
- ☆15Updated last year
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated last year
- Self-supervised neural network for music recommendations.☆18Updated 2 years ago
- An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"☆17Updated 4 months ago
- A library for making PyTorch models streamable☆57Updated 2 weeks ago
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆14Updated last week
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆185Updated 3 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆388Updated 2 weeks ago
- DACVAE☆191Updated last month
- A lightweight Python library for running TTS models with a unified API.☆21Updated 11 months ago
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 11 months ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆68Updated this week
- Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…☆14Updated 4 months ago
- Tokenizer for Text to Speech (TTS) models☆13Updated last year