mohan696matlab / whisper-finetuning-youtube-seriseLinks
☆11Updated 2 months ago
Alternatives and similar repositories for whisper-finetuning-youtube-serise
Users that are interested in whisper-finetuning-youtube-serise are comparing it to the libraries listed below
Sorting:
- ☆64Updated 5 months ago
- Build a real-time AI voice assistant using Python that can handle incoming calls, transcribe speech, generate intelligent responses, and …☆46Updated 11 months ago
- mvc architecture with flutter x getx☆1Updated last year
- Python package for Real-time, Local Speech-to-Text and Speaker Diarization. FastAPI Server & Web Interface☆381Updated last week
- A simple voice agent using FastRTC and Groq☆47Updated 2 months ago
- ☆50Updated 3 months ago
- ☆157Updated 7 months ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆42Updated 11 months ago
- Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset☆11Updated last year
- Modern startup website made with React.js & Tailwind CSS☆18Updated last year
- ☆100Updated 9 months ago
- ☆340Updated 11 months ago
- A real-time voice chat application powered by local AI models☆46Updated 3 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆769Updated last month
- Text-to-Speech for languages of India☆256Updated 8 months ago
- AI Agent for Telephony voice bot - based on vocode, twilio, deepgram, and elevenlabs. Just add your own keys and prompt.☆25Updated 10 months ago
- ☆41Updated 2 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆319Updated 2 years ago
- ☆64Updated 2 months ago
- ☆282Updated 6 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆436Updated 10 months ago
- Generative AI phone call toolkit using Twilio Media Streams.☆457Updated 7 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆368Updated 2 months ago
- Conversational voice AI agents☆336Updated this week
- Real-time Speech To Text using Faster Whisper.☆57Updated 11 months ago
- ☆777Updated last month
- A talking LLM that runs on your own computer without needing the internet.☆512Updated last month
- ☆40Updated 6 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆1,884Updated last week
- The repo contains an audio emotion detection model, facial emotion detection model, and a model that combines both these models to predic…☆75Updated last year