[WACV 2026] LASER: Lip Landmark Assisted Speaker Detection for Robustness official implemntation
☆25Feb 26, 2026Updated 2 months ago
Alternatives and similar repositories for LASER_ASD
Users that are interested in LASER_ASD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The repository for Springer IJCV 2025 (LR-ASD: Lightweight and Robust Network for Active Speaker Detection)☆113Mar 23, 2025Updated last year
- ☆13Aug 24, 2023Updated 2 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆59May 29, 2023Updated 2 years ago
- ☆68Sep 13, 2022Updated 3 years ago
- repo for active speaker detection for media videos.☆31Nov 19, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ViSpeR: Multilingual Audio-Visual Speech Recognition☆57Apr 17, 2025Updated last year
- ☆64Jul 1, 2025Updated 10 months ago
- ☆25Nov 17, 2025Updated 5 months ago
- Python library for finding similar content in videos.☆18Nov 29, 2023Updated 2 years ago
- Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.☆37Jun 3, 2025Updated 11 months ago
- A tool for generating python inference pipeline☆59Mar 25, 2026Updated last month
- CLI for archiving pages and its all links to Wayback Machine☆14Mar 10, 2022Updated 4 years ago
- Recognize speech from an audio file and convert it into animation FBX☆24Mar 7, 2022Updated 4 years ago
- ☆24Jul 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The project page repo for Neural Dubber.☆30Sep 20, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 6 months ago
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- ☆10Feb 17, 2022Updated 4 years ago
- a set of erlang utilities for the Olson timezone database files☆34Dec 21, 2015Updated 10 years ago
- ☆13Dec 18, 2024Updated last year
- This is the official Python implementation repository for a paper entitled "Resolving Camera Position for a Practical Application of Gaz…☆12Jan 11, 2022Updated 4 years ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Feb 11, 2023Updated 3 years ago
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆64Feb 22, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Apr 16, 2026Updated 2 weeks ago
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆29Mar 30, 2026Updated last month
- Official Repo for MoCha Towards Movie-Grade Talking Character Synthesis☆61Dec 27, 2025Updated 4 months ago
- A simple JavaScript library that uses jsPsych and Google Sheet for running behavioral experiments online☆15Jun 8, 2021Updated 4 years ago
- ☆12Apr 18, 2021Updated 5 years ago
- ☆12Nov 25, 2021Updated 4 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- The repository is no longer maintained. See https://github.com/s7tya/gakicc.☆16Jan 23, 2024Updated 2 years ago
- This repo is re-produce for Channel_pruning☆11May 17, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- NATOPS Aircraft Handling Signals Database (FG 2011)☆16Aug 16, 2017Updated 8 years ago
- material about gaze estimation or gaze tracking for codes, papers and demos.☆10Jul 19, 2021Updated 4 years ago
- A guide to structured generation using constrained decoding☆18Jun 9, 2024Updated last year
- Computer Vision Paper Reading☆11Nov 21, 2019Updated 6 years ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- Wikidata Subsetting☆17Feb 26, 2023Updated 3 years ago
- ☆21Jun 20, 2024Updated last year