yakovmon/Real-Time-Audio-Visual-Speech-Enhancement

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yakovmon/Real-Time-Audio-Visual-Speech-Enhancement)

yakovmon / Real-Time-Audio-Visual-Speech-Enhancement

☆13

Alternatives and similar repositories for Real-Time-Audio-Visual-Speech-Enhancement

Users that are interested in Real-Time-Audio-Visual-Speech-Enhancement are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ULAFF / LAFF-On-HPC
View on GitHub
Repository for LAFF-On Programming for High Performance
☆22Nov 20, 2018Updated 7 years ago
Kajiyu / LLLNet
View on GitHub
Keras Implementation of "Look, Listen and Learn" Model
☆21Nov 14, 2017Updated 8 years ago
WangYihang / LinuxShellScript
View on GitHub
LinuxShell编程笔记
☆15Aug 29, 2017Updated 8 years ago
naomital / computer-vision-course
View on GitHub
☆15Apr 22, 2020Updated 6 years ago
aaronzguan / Autonomous-Bin-Picking
View on GitHub
RLBench simulation project for autonomous bin picking using Pandas robot arm
☆11Mar 1, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
KhanhNguyen4999 / Speech-Enhancement-CLSKD
View on GitHub
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement
☆11Jun 22, 2023Updated 3 years ago
freysie / kaomoji-palette
View on GitHub
A kaomoji input method for macOS. ( ＾ ▽ ＾ )
☆10Jun 4, 2026Updated last month
krunal1313 / 2d-Convolution-CUDA
View on GitHub
This is a simple 2d convolution written in cuda c which uses shared memory for better performance
☆20Apr 12, 2018Updated 8 years ago
stu4355226 / Speech_Enhancement
View on GitHub
Speech Signal Processing project with different types of filters.
☆10Aug 7, 2017Updated 8 years ago
nickswalker / gpsr-command-understanding
View on GitHub
Tools for understanding natural language robot commands
☆12Feb 21, 2021Updated 5 years ago
dr-pato / audio_visual_speech_enhancement
View on GitHub
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆112Mar 19, 2024Updated 2 years ago
hiddenmaze / InteractivePickup
View on GitHub
Interactive Text2Pickup Network for Natural Language based Human-Robot Collaboration
☆11Sep 28, 2018Updated 7 years ago
kagaminccino / LAVSE
View on GitHub
Python codes for Lite Audio-Visual Speech Enhancement.
☆95May 3, 2024Updated 2 years ago
jordanopensource / arabic-ocr-studygroup
View on GitHub
Arabic handwriting dataset and starter code for deep learning study group
☆15Oct 9, 2017Updated 8 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
tjhrulz / MessagePassingForRainmeter
View on GitHub
A websocket plugin intended for message passing with Rainmeter and other programs such as Wallpaper Engine
☆13Dec 6, 2017Updated 8 years ago
jiahuei / sparse-image-captioning
View on GitHub
Image captioning with weight pruning in PyTorch
☆22Jan 14, 2022Updated 4 years ago
valiakon / MultimodalAnalysis_SpeakerDiarization
View on GitHub
The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face im…
☆16Feb 10, 2019Updated 7 years ago
thinwayliu / Watermark-Vaccine
View on GitHub
The code for ECCV2022 (Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal)
☆45Oct 1, 2022Updated 3 years ago
hfutami / distill-bert-for-seq2seq-asr
View on GitHub
☆24Jun 17, 2020Updated 6 years ago
nnyj / python-audio-separator-live
View on GitHub
Real-time vocal/instrumental separation using MDX-NET and MelBand Roformer models
☆27Updated this week
TakingFire / HueSaber
View on GitHub
Highly configurable Philips Hue integration for Beat Saber.
☆10Mar 26, 2023Updated 3 years ago
miralv / Deep-Learning-for-Speech-Enhancement
View on GitHub
Remove noise from sound clips by use of supervised training and an ideal ratio mask.
☆14Apr 2, 2019Updated 7 years ago
parthchadha / upsideDownRL
View on GitHub
Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"
☆17Dec 17, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ujw0l / pic-crop
View on GitHub
Electron js app for Mac for picture cropping
☆11Apr 7, 2026Updated 3 months ago
danmic / av-se
View on GitHub
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
☆222Apr 16, 2023Updated 3 years ago
xingchensong / Speech-Transformer-tf2.0
View on GitHub
transformer for ASR-systerm (via tensorflow2.0)
☆114May 7, 2019Updated 7 years ago
NOTtheMessiah / scosk
View on GitHub
Steam Controller On-Screen Keyboard
☆10May 26, 2016Updated 10 years ago
scorpion004 / luna-voice-assistant
View on GitHub
This is a voice assistant program for windows made of python language. No need of IDE and command to run this program.
☆10Dec 8, 2022Updated 3 years ago
jpminor / mailspring-isaac-light-theme
View on GitHub
A Newton inspired light theme for Mailspring
☆12Nov 7, 2019Updated 6 years ago
weiran / watch-it-later
View on GitHub
Watch videos saved on Instapaper on your Apple TV.
☆11Dec 4, 2024Updated last year
Mentathiel / KMeansJava
View on GitHub
A K-Means implementation in Java created for a Stack Abuse article.
☆12Feb 3, 2021Updated 5 years ago
fediaFedia / Daylight
View on GitHub
Daylight for Rainmeter - Changing your skins to dark mode, along with the wallpaper, system theme and dock
☆12Aug 29, 2020Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
waynamigo / VideoActionRecognize
View on GitHub
不良行为识别+身份检测，直接将从摄像头实时检测到的不良行为，将通过人脸检测对应到的学生id和取证照片上传至服务器集群
☆19Jul 8, 2019Updated 7 years ago
rasoulghaznavi / MLSEED
View on GitHub
Emotion Recognition using ML methods on the SEED dataset
☆22Aug 6, 2019Updated 6 years ago
SweetMNM / rainmeter-bluetooth
View on GitHub
bluetooth plugin to use in rainmeter
☆12Jan 1, 2019Updated 7 years ago
danielbraithwt / Speech-Enhancement-with-Variance-Constrained-Autoencoders
View on GitHub
Code and audio files associated with the paper "Speech Enhancement with Variance Constrained Autoencoders" presented at Interspeech 2019
☆15Oct 10, 2019Updated 6 years ago
kenshohara / video-classification-3d-cnn
View on GitHub
Video classification tools using 3D ResNet
☆25Sep 25, 2017Updated 8 years ago
fy378968174 / GAN-based-speech-enhancement-Keras-
View on GitHub
Keras implementation of speech enhancement based on LSGAN
☆20Dec 10, 2017Updated 8 years ago
toni-heittola / dcase2019_task1_baseline
View on GitHub
DCASE2019 Challenge Task 1 baseline system
☆20Oct 11, 2019Updated 6 years ago