dreji18 / Fine-tune-Speech-RecognitionLinks
Tutorial on how to train a custom voice recognition model using Hugging face models.
☆11Updated 2 years ago
Alternatives and similar repositories for Fine-tune-Speech-Recognition
Users that are interested in Fine-tune-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆95Updated 2 weeks ago
- ☆52Updated 5 months ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated last year
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆116Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆38Updated 3 months ago
- AI-powered YouTube Notes Generator: Create detailed notes from YouTube videos. Streamlit UI for easy use.☆48Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆26Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆68Updated this week
- Desklib's AI Text Detector☆27Updated 6 months ago
- ☆18Updated 11 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆29Updated this week
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆40Updated last year
- Seamless Voice Interactions with LLMs☆12Updated last year
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- Demo FastAPI WebSocket Audio☆40Updated 5 years ago
- ☆36Updated 2 years ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆18Updated last year
- ☆52Updated 3 years ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆17Updated 3 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 2 years ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆41Updated this week
- streaming speech to text server using Whisper☆94Updated 2 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆233Updated 3 weeks ago
- Using this LLM-powered tool you can seamlessly create high quality (tiktok type) videos☆11Updated 11 months ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆13Updated 5 months ago
- Real-time speech to text with specific language translation.☆48Updated 4 years ago
- Create Animated Subtitles From .SRT files in Remotion☆68Updated last year