dreji18 / Fine-tune-Speech-Recognition
Tutorial on how to train a custom voice recognition model using Hugging face models.
☆10Updated last year
Alternatives and similar repositories for Fine-tune-Speech-Recognition:
Users that are interested in Fine-tune-Speech-Recognition are comparing it to the libraries listed below
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 3 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆17Updated this week
- This project is used to generate a blog post using Natural Language processing, Hugging Face Transformers and GPT-2 Model.☆17Updated 3 years ago
- Text to Music Generation App built using Meta's Audiocraft library. It is a Streamlit application utilises Music Gen small model.☆25Updated last year
- ☆36Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆38Updated 11 months ago
- ☆16Updated 10 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆14Updated 5 months ago
- Training Small Language Model☆24Updated last year
- Performing a RAG (Retrieval Augmented Generation) assessment using voice-to-voice query resolution. Provide the file containing the queri…☆35Updated last year
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web …☆32Updated 3 months ago
- 🧠 Mem4AI: A LLM Friendly memory management library.☆20Updated 5 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- A neural network-based AI chatbot has been designed that uses LSTM as its training model for both encoding and decoding. The chatbot work…☆22Updated 3 years ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆51Updated 7 months ago
- An autonomous Mall assistant that can answer user queries using tools. Powered by LLMs.☆14Updated last year
- Notebooks using the Neural Magic libraries 📓☆42Updated 8 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆27Updated 6 months ago
- ☆20Updated last year
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 5 months ago
- ☆13Updated last year
- ☆20Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- ☆12Updated last year
- Streamlit app for scheduling habits and interacting with your schedule using ChatGPT and LangChain☆19Updated last year
- ☆48Updated 3 years ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- ☆23Updated 2 years ago
- Web tool to count LLM tokens (GPT, Claude, Llama, ...)☆28Updated this week
- VSCode Copilot for Groq fans!☆41Updated 8 months ago