vb000 / LookOnceToHearLinks

A novel human-interaction method for real-time speech extraction on headphones.

☆597

Alternatives and similar repositories for LookOnceToHear

Users that are interested in LookOnceToHear are comparing it to the libraries listed below

Sorting:

lifeiteng / OmniSenseVoice
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
☆886Updated last month
DoMusic / Hybrid-Net
Real-time audio to chords, lyrics, beat, and melody.
☆715Updated last year
shahules786 / mayavoz
Pytorch based speech enhancement toolkit.
☆336Updated last year
aiola-lab / whisper-ner
Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"
☆200Updated 11 months ago
aiola-lab / whisper-medusa
Whisper with Medusa heads
☆865Updated 6 months ago
tincans-ai / gazelle
Joint speech-language model - respond directly to audio!
☆372Updated last year
Standard-Intelligence / hertz-dev
first base model for full-duplex conversational audio
☆1,774Updated last year
skirdey / voicerestore
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
☆195Updated 9 months ago
TuneNN / TuneNN
A transformer-based network model for pitch detection
☆166Updated 6 months ago
mezbaul-h / june
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
☆783Updated last year
PsyChip / machina
OpenCV+YOLO+LLAVA powered video surveillance system
☆783Updated 3 months ago
facebookresearch / audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
☆682Updated last month
skrbnv / javad
☆65Updated last year
lxe / llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
☆492Updated 2 years ago
pipecat-ai / smart-turn
☆1,249Updated last week
LAION-AI / natural_voice_assistant
☆497Updated last year
collabora / WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
☆1,642Updated last year
google / zimtohrli
☆182Updated 3 months ago
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆139Updated last year
resemble-ai / Perth
Open Audio Watermarking Tool
☆465Updated last month
nkasmanoff / pi-card
Raspberry Pi Voice Assistant
☆812Updated last year
amanvirparhar / chaplin
A real-time silent speech recognition tool.
☆694Updated 3 months ago
alexcrist / autotone
A vocal pitch correction web application (like Autotune)
☆325Updated 3 years ago
YuanGongND / whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …
☆413Updated last year
PolyAI-LDN / pheme
☆259Updated last year
dubverse-ai / MahaTTS
☆275Updated last year
hlt-mt / mosel
Collection of Open Source Speech Data
☆164Updated 4 months ago
KoljaB / WhoSpeaks
Efficient approach to speaker diarization using voice characteristics extraction
☆106Updated 7 months ago
bytedance / uss
This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.
☆367Updated 2 years ago
akashmjn / tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
☆535Updated 2 years ago