KoelLabs / MLLinks
Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learners! This repo contains the ML training, evaluation, and data processing code
☆16Updated last week
Alternatives and similar repositories for ML
Users that are interested in ML are comparing it to the libraries listed below
Sorting:
- A simple uv workspace☆17Updated 7 months ago
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last month
- An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"☆16Updated last month
- Speaker diarization service☆24Updated 4 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆12Updated 11 months ago
- Open TTS models, built for streaming on the edge☆44Updated 8 months ago
- ☆202Updated last month
- Audio tokenization, in the fastest way possible!☆53Updated last year
- 🧪 Data Science | ⚒️ MLOps | ⚙️ DataOps : Talks about 🦄☆19Updated 3 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆119Updated last month
- Evaluation framework for document processing models and services.☆55Updated last week
- Fine-tune FLUX 1.dev for personal AI photos☆22Updated last year
- Self-supervised neural network for music recommendations.☆18Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Updated last year
- code for training and using chess embeddings models☆12Updated last year
- ☆12Updated 6 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated last year
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated last year
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 3 years ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 6 months ago