dreji18 / Fine-tune-Speech-Recognition
Tutorial on how to train a custom voice recognition model using Hugging face models.
☆10Updated last year
Related projects: ⓘ
- ☆33Updated last year
- This project is used to generate a blog post using Natural Language processing, Hugging Face Transformers and GPT-2 Model.☆17Updated 3 years ago
- ☆41Updated 2 years ago
- Real-time speech to text with specific language translation.☆43Updated 3 years ago
- ☆32Updated last year
- On-device speaker recognition engine powered by deep learning☆24Updated last week
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio tra…☆47Updated last year
- Code for OpenAI Whisper Web App Demo☆95Updated 2 years ago
- A neural network-based AI chatbot has been designed that uses LSTM as its training model for both encoding and decoding. The chatbot work…☆22Updated 3 years ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆34Updated 3 weeks ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆105Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated last year
- A Streamlit app to extract keywords using KeyBert☆35Updated 3 years ago
- A biometric system for person identification by voice recognition☆8Updated 7 years ago
- Transcription and diarization (speaker identification)☆26Updated last year
- Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation …☆13Updated 6 months ago
- Gesichtserkennungssystem☆10Updated last year
- ChatBot using Meta AI Llama v2 LLM model on your local PC.☆12Updated 7 months ago
- A VoiceAsistant with WhisperAI speech recognition☆28Updated 2 weeks ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 2 years ago
- Voice cloning AI (deepfake for voice). Using cloned voice from only 5-10 seconds of targeted voice.☆48Updated 2 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆157Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆22Updated last year
- ☆10Updated 5 years ago
- ☆14Updated 4 months ago
- OpenAI API _ Chatbot implementation☆26Updated 6 months ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆12Updated 4 years ago
- This is a Text Analysis App which can be used to find a detailed analysis of a particular text. This includes 5 main types of Analysis - …☆24Updated 2 years ago
- Text to Music Generation App built using Meta's Audiocraft library. It is a Streamlit application utilises Music Gen small model.☆22Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆152Updated 2 months ago