tincans-ai / gazelle-inference
proof of concept conversation orchestrator with a speech-language model
☆13Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for gazelle-inference
- Joint speech-language model - respond directly to audio!☆30Updated 5 months ago
- Supervoice diffusion enhance☆25Updated 3 months ago
- ☆9Updated last month
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Collection of scripts from mHuBERT-147.☆22Updated 4 months ago
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 5 months ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 5 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆12Updated 2 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆12Updated 5 months ago
- Speaker Diarization with Transformers☆59Updated 5 months ago
- Unofficial implementation of wavenext vocoder☆31Updated 2 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated this week
- ☆17Updated 3 months ago
- ☆23Updated last year
- Text-To-Speech for NotebookLM☆16Updated last week
- A synthetic story narration dataset to study small audio LMs.☆29Updated 9 months ago
- ☆10Updated 2 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆17Updated 8 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆18Updated this week
- Production-ready vocoder using BigVSAN☆11Updated 8 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 7 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 8 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆13Updated 3 weeks ago
- ☆61Updated 3 months ago
- VoiceBox neural network implementation☆96Updated 3 months ago
- GPT for FACodec☆13Updated 7 months ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Updated last year
- ☆16Updated 6 months ago
- ☆12Updated 3 months ago