dreji18 / Fine-tune-Speech-RecognitionLinks
Tutorial on how to train a custom voice recognition model using Hugging face models.
☆11Updated 2 years ago
Alternatives and similar repositories for Fine-tune-Speech-Recognition
Users that are interested in Fine-tune-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- ☆36Updated 2 years ago
- Training Small Language Model☆26Updated last year
- A neural network-based AI chatbot has been designed that uses LSTM as its training model for both encoding and decoding. The chatbot work…☆23Updated 4 years ago
- Video Translation with LipSync with OpenAi's whisper for ASR, YourTTS for TTS, and Wav2lip for lip sync.☆19Updated 2 years ago
- Demo FastAPI WebSocket Audio☆40Updated 5 years ago
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆118Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆98Updated last month
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆29Updated this week
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- Real-time speech to text with specific language translation.☆47Updated 4 years ago
- ☆23Updated 3 years ago
- ☆64Updated 2 years ago
- kokoro text to speech using javascript☆62Updated 8 months ago
- Make Kanye sing any song ya want 🎤🔥☆25Updated 2 years ago
- Multimodal AI App using Llava 7B and Gradio.☆40Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Updated last year
- GGUF Quantization of any LLM.☆40Updated last year
- 🧠 Mem4AI: A LLM Friendly memory management library.☆30Updated 11 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆57Updated last year
- InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.☆101Updated last year
- Extract handwritten information like name, student ID and then recognize them with CRNN-CTC-Attention. Using lexicon search on class list…☆30Updated 6 months ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆18Updated 4 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆40Updated 2 weeks ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated last year
- Interactive chat application leveraging OpenAI's GPT-4 for real-time conversation simulations. Built with Flask, this project showcases s…☆25Updated last year
- Real time web based Speech-to-Text app with Streamlit☆251Updated 2 years ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆18Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 9 months ago
- ☆11Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆16Updated last year