dreji18 / Fine-tune-Speech-Recognition
Tutorial on how to train a custom voice recognition model using Hugging face models.
☆11Updated last year
Alternatives and similar repositories for Fine-tune-Speech-Recognition
Users that are interested in Fine-tune-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- Web App Capable of Predicting Next Word Using BERT☆14Updated 2 years ago
- A neural network-based AI chatbot has been designed that uses LSTM as its training model for both encoding and decoding. The chatbot work…☆21Updated 3 years ago
- ☆17Updated 8 months ago
- Demo FastAPI WebSocket Audio☆40Updated 4 years ago
- A streaming whisper server for on-prem transcription☆20Updated 9 months ago
- Multimodal AI App using Llava 7B and Gradio.☆38Updated last year
- ☆15Updated 2 months ago
- ☆18Updated 4 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆11Updated 9 months ago
- An AI tools which helps to analyze any YouTube video, give the sentiment of the video and suggest description and topics related the cont…☆11Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆13Updated 2 years ago
- ☆50Updated 3 years ago
- A Streamlit app to extract keywords using KeyBert☆36Updated 4 years ago
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆20Updated last year
- Real time face detection streamlit based bew application for server deployment.☆27Updated 3 years ago
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆25Updated 3 years ago
- ☆32Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆15Updated 8 months ago
- AI Agent capable of automating various tasks using MCP☆37Updated last month
- Performing a RAG (Retrieval Augmented Generation) assessment using voice-to-voice query resolution. Provide the file containing the queri…☆38Updated last year
- AI-powered YouTube Notes Generator: Create detailed notes from YouTube videos. Streamlit UI for easy use.☆42Updated 9 months ago
- Notebooks using the Neural Magic libraries 📓☆41Updated 9 months ago
- Speech Emotion Detection using SVM, Decision Tree, Random Forest, MLP, CNN with different architectures☆35Updated last year
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Chainlit app for advanced RAG. Uses llamaparse, langchain, qdrant and models from groq.☆45Updated 11 months ago
- cleanup cached models.☆11Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆54Updated 9 months ago
- Translate any text using GPT.☆16Updated 2 years ago
- kokoro text to speech using javascript☆57Updated 3 months ago
- ☆64Updated 2 years ago