dreji18 / Fine-tune-Speech-RecognitionLinks
Tutorial on how to train a custom voice recognition model using Hugging face models.
☆11Updated 2 years ago
Alternatives and similar repositories for Fine-tune-Speech-Recognition
Users that are interested in Fine-tune-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Llama.cui is a small llama.cpp-based chat application for Node.js☆19Updated 4 months ago
- Adds a web API to RVC to infer via json requests☆29Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆103Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- streaming speech to text server using Whisper☆98Updated 2 years ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆49Updated 6 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 5 months ago
- ☆36Updated 2 years ago
- ☆19Updated last year
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆28Updated last year
- Train and finutune text-to-speech models for Bengali and many other languages!☆15Updated 7 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆50Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- Seamless Voice Interactions with LLMs☆12Updated 2 years ago
- kokoro text to speech using javascript☆63Updated 10 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆245Updated 3 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆18Updated last year
- On-device speaker recognition engine powered by deep learning☆38Updated this week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 11 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
- Demo FastAPI WebSocket Audio☆41Updated 5 years ago
- TTS with The Massively Multilingual Speech (MMS) project☆231Updated last year
- Interactive chat application leveraging OpenAI's GPT-4 for real-time conversation simulations. Built with Flask, this project showcases s…☆25Updated last year
- An LLM-based app to easily track calories and exercise by taking a photo of your meal or describing your physical activity☆16Updated last month
- Text to Music Generation App built using Meta's Audiocraft library. It is a Streamlit application utilises Music Gen small model.☆26Updated 2 years ago