vb000 / LookOnceToHearLinks
A novel human-interaction method for real-time speech extraction on headphones.
☆597Updated last year
Alternatives and similar repositories for LookOnceToHear
Users that are interested in LookOnceToHear are comparing it to the libraries listed below
Sorting:
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆886Updated last month
- Real-time audio to chords, lyrics, beat, and melody.☆715Updated last year
- Pytorch based speech enhancement toolkit.☆336Updated last year
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆200Updated 11 months ago
- Whisper with Medusa heads☆865Updated 6 months ago
- Joint speech-language model - respond directly to audio!☆372Updated last year
- first base model for full-duplex conversational audio☆1,774Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆195Updated 9 months ago
- A transformer-based network model for pitch detection☆166Updated 6 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆783Updated last year
- OpenCV+YOLO+LLAVA powered video surveillance system☆783Updated 3 months ago
- Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector☆682Updated last month
- ☆65Updated last year
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆492Updated 2 years ago
- ☆1,249Updated last week
- ☆497Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,642Updated last year
- ☆182Updated 3 months ago
- Mistral7B playing DOOM☆139Updated last year
- Open Audio Watermarking Tool☆465Updated last month
- Raspberry Pi Voice Assistant☆812Updated last year
- A real-time silent speech recognition tool.☆694Updated 3 months ago
- A vocal pitch correction web application (like Autotune)☆325Updated 3 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆413Updated last year
- ☆259Updated last year
- ☆275Updated last year
- Collection of Open Source Speech Data☆164Updated 4 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆367Updated 2 years ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆535Updated 2 years ago