Estimate gaze on computer screen
☆23Jul 28, 2025Updated 7 months ago
Alternatives and similar repositories for WebCamGazeEstimation
Users that are interested in WebCamGazeEstimation are comparing it to the libraries listed below
Sorting:
- ☆13Apr 17, 2025Updated 10 months ago
- [INTERSPEECH 2025] The official implementation of DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for…☆16Sep 7, 2025Updated 5 months ago
- This repository contains frameworks for pre-processing, training and evaluating full face or multi-region (face, left and right eyes) gaz…☆11Mar 12, 2024Updated last year
- 使用GCN实现mnist手写数字分类☆13Oct 24, 2021Updated 4 years ago
- full camera-to-screen gaze tracking pipeline☆123Aug 9, 2024Updated last year
- Mason-Alberta Phonetic Segmenter☆15Feb 24, 2026Updated last week
- ☆14Jan 9, 2024Updated 2 years ago
- [IJCV 2025] The project is an official implementation of our paper "Learning Structure-Supporting Dependencies via Keypoint Interactive T…☆18Jul 16, 2025Updated 7 months ago
- [ICCV2025 Highlight] GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting☆25Sep 10, 2025Updated 5 months ago
- ☆16Apr 24, 2025Updated 10 months ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 6 months ago
- ☆21Nov 10, 2023Updated 2 years ago
- ☆17Apr 5, 2024Updated last year
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- [TASLP 2025] The pytorch implementation of BERP: A Blind Estimator of Room Parameters☆21Aug 16, 2025Updated 6 months ago
- ☆20Nov 3, 2024Updated last year
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Jul 9, 2019Updated 6 years ago
- Multi-focus image fusion based on Laplacian pyramid.☆18Jul 20, 2023Updated 2 years ago
- ☆15Jul 6, 2023Updated 2 years ago
- A PyTorch implementation of the ECCV 2022 paper "Neural Image Representations for Multi-Image Fusion and Layer Separation"☆23Jul 10, 2022Updated 3 years ago
- ☆21Nov 1, 2018Updated 7 years ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Oct 8, 2020Updated 5 years ago
- Skeleton Based Hand Gesture Recognition Using Data Level Fusion☆24Jan 21, 2025Updated last year
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆51Sep 20, 2025Updated 5 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Sep 4, 2022Updated 3 years ago
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆135Feb 3, 2025Updated last year
- wavenet vocoder using tensorflow☆26Feb 18, 2018Updated 8 years ago
- [CVPR 25] The official repository for paper 'ProbPose: A Probabilistic Approach to 2D Human Pose Estimation'☆51Jan 7, 2026Updated last month
- ☆30Aug 12, 2023Updated 2 years ago
- ⚠️ This repository is archived. Please use https://github.com/nbhr/pycalib . An Implementation of Takahashi, Nobuhara and Matsuyama "A N…☆33Dec 6, 2025Updated 3 months ago
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆44May 26, 2025Updated 9 months ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆34Aug 11, 2020Updated 5 years ago
- mediapipe-hand,mediapipe-body,mediapipe-face, mediapipe-embedding, mediapipe-classifier and so on.MNN inference☆34Aug 23, 2023Updated 2 years ago
- PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …☆39May 16, 2021Updated 4 years ago
- What Do You See in Vehicle? Comprehensive Vision Solution for In-Vehicle Gaze Estimation☆49Aug 12, 2024Updated last year
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Oct 13, 2023Updated 2 years ago
- ☆42Oct 30, 2018Updated 7 years ago
- Skeleton-Based Action Recognition with Local Dynamic Spatial-Temporal Aggregation (Expert Systems with Applications 2023) (Previous name:…☆40Nov 20, 2023Updated 2 years ago