dreji18 / Fine-tune-Speech-RecognitionLinks
Tutorial on how to train a custom voice recognition model using Hugging face models.
☆11Updated 2 years ago
Alternatives and similar repositories for Fine-tune-Speech-Recognition
Users that are interested in Fine-tune-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆118Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 10 months ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆15Updated 6 months ago
- kokoro text to speech using javascript☆62Updated 9 months ago
- Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Container)☆149Updated 3 months ago
- Multimodal AI App using Llava 7B and Gradio.☆40Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆48Updated last year
- Using this LLM-powered tool you can seamlessly create high quality (tiktok type) videos☆11Updated last year
- The UnisonAI Multi-Agent Framework (A2A) provides a flexible and extensible environment for creating and coordinating multiple autonomous…☆22Updated this week
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆17Updated 4 months ago
- Training Small Language Model☆27Updated last year
- ☆11Updated last month
- AI-powered YouTube Notes Generator: Create detailed notes from YouTube videos. Streamlit UI for easy use.☆48Updated last year
- Fork of "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆16Updated 11 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- 🧠 Mem4AI: A LLM Friendly memory management library.☆33Updated 11 months ago
- A character chat with integrated medium and long-term memory☆21Updated 2 months ago
- HuggingChat like UI in Gradio☆70Updated 2 years ago
- ✒️ LanguageTool integration for Quill.js editors☆16Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆17Updated 9 months ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆12Updated last year
- Performing a RAG (Retrieval Augmented Generation) assessment using voice-to-voice query resolution. Provide the file containing the queri…☆45Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
- WebSage is an AI Engine that extracts content from any URL, generates summaries, and enables interaction using AI models. Choose between …☆16Updated 8 months ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆18Updated last year
- A general purpose AI voice assistant built using GPT-4.☆34Updated 2 years ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year