dreji18 / Fine-tune-Speech-RecognitionLinks
Tutorial on how to train a custom voice recognition model using Hugging face models.
☆11Updated 2 years ago
Alternatives and similar repositories for Fine-tune-Speech-Recognition
Users that are interested in Fine-tune-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- AI-powered YouTube Notes Generator: Create detailed notes from YouTube videos. Streamlit UI for easy use.☆48Updated last year
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆116Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated 10 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆90Updated last year
- kokoro text to speech using javascript☆59Updated 6 months ago
- Interactive chat application leveraging OpenAI's GPT-4 for real-time conversation simulations. Built with Flask, this project showcases s…☆25Updated last year
- GGUF Quantization of any LLM.☆40Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆16Updated 11 months ago
- ☆16Updated last year
- 🧠 Mem4AI: A LLM Friendly memory management library.☆29Updated 9 months ago
- ☆18Updated 11 months ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆118Updated 2 years ago
- ☆16Updated last year
- Auto-Video maker handling many AI's☆11Updated last year
- streaming speech to text server using Whisper☆94Updated 2 years ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 7 months ago
- Seamless Voice Interactions with LLMs☆12Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Text to Music Generation App built using Meta's Audiocraft library. It is a Streamlit application utilises Music Gen small model.☆27Updated 2 years ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated last year
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆17Updated 2 months ago
- This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) mode…☆26Updated last year
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- Training Small Language Model☆26Updated last year
- ☆36Updated 2 years ago
- Demo FastAPI WebSocket Audio☆40Updated 5 years ago
- Multimodal AI App using Llava 7B and Gradio.☆40Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆224Updated this week