KoelLabs / MLLinks
Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learners! This repo contains the ML training, evaluation, and data processing code
☆17Updated 2 months ago
Alternatives and similar repositories for ML
Users that are interested in ML are comparing it to the libraries listed below
Sorting:
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated 3 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆14Updated last week
- A simple uv workspace☆19Updated 10 months ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Updated last year
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Updated last year
- An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"☆17Updated 4 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆50Updated 2 weeks ago
- ☆245Updated last month
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- Evaluation framework for document processing models and services.☆63Updated last week
- ☆50Updated 3 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆149Updated last week
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆185Updated 3 months ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Updated last year
- ☆60Updated last month
- Fine-tune FLUX 1.dev for personal AI photos☆22Updated last year
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- ☆20Updated 11 months ago
- Fast audio super resolution from 16khz to 48khz.☆192Updated last month
- Speaker diarization service☆26Updated last week
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated last year
- Load any clip model with a standardized interface☆22Updated 3 months ago
- Practice Notebook for AI Course☆12Updated 11 months ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- code for training and using chess embeddings models☆13Updated last year
- Encountering 14 different Naive RAG fails and using KG to solve it☆20Updated 2 months ago