[WACV 2026 Oral] LASER: Lip Landmark Assisted Speaker Detection for Robustness official implemntation
☆28Feb 26, 2026Updated 3 months ago
Alternatives and similar repositories for LASER_ASD
Users that are interested in LASER_ASD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The repository for Springer IJCV 2025 (LR-ASD: Lightweight and Robust Network for Active Speaker Detection)☆117Mar 23, 2025Updated last year
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space (ICML2026)☆40May 12, 2026Updated last month
- ☆68Sep 13, 2022Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Stealth browser automation that actually works. Runs Camoufox (custom Firefox) in Docker with zero Chrome DevTools Protocol exposure, rea…☆51May 28, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Proof of concept for a reasoning model that runs locally in your browser with WebGPU acceleration☆25Jan 22, 2025Updated last year
- Vision Foundation Models: SAM, ViT, CLIP, DINOv2, object detection, segmentation, and multimodal AI for computer vision.☆22Nov 10, 2025Updated 7 months ago
- Source code of "Deep Rank Hashing Network for Cancellable Face Identification"☆12Jul 8, 2022Updated 3 years ago
- Official implementation of USR (NeurIPS 2024)☆40Dec 21, 2024Updated last year
- Automated Video Generation Solution☆24Jan 1, 2025Updated last year
- ☆64Jul 1, 2025Updated 11 months ago
- Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.☆37Jun 3, 2025Updated last year
- AMI Meeting Parallel Corpus☆12Dec 11, 2020Updated 5 years ago
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆176Mar 23, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Face detection algorithms in PyTorch.☆81Jan 27, 2022Updated 4 years ago
- A tool for generating python inference pipeline☆64May 29, 2026Updated 2 weeks ago
- CLI for archiving pages and its all links to Wayback Machine☆14Mar 10, 2022Updated 4 years ago
- Recognize speech from an audio file and convert it into animation FBX☆24Mar 7, 2022Updated 4 years ago
- AI agents weaving intelligence, execution, and automation into DeFi☆10Feb 28, 2025Updated last year
- The project page repo for Neural Dubber.☆30Sep 20, 2023Updated 2 years ago
- An ambiguous subtitles dataset for visual scene-aware machine translation☆14Oct 17, 2022Updated 3 years ago
- ☆56Aug 7, 2022Updated 3 years ago
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆37Updated this week
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Feb 11, 2023Updated 3 years ago
- real-time transcription application☆12Jun 9, 2023Updated 3 years ago
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆66Feb 22, 2026Updated 3 months ago
- Make DB of Dojinvoice (DLsite)☆13Apr 25, 2026Updated last month
- A simple JavaScript library that uses jsPsych and Google Sheet for running behavioral experiments online☆15Jun 8, 2021Updated 5 years ago
- ☆21Apr 18, 2023Updated 3 years ago
- The repository is no longer maintained. See https://github.com/s7tya/gakicc.☆16Jan 23, 2024Updated 2 years ago
- NATOPS Aircraft Handling Signals Database (FG 2011)☆16Aug 16, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- Official PyTorch implementation of "BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition", ECCV 2020☆24Jun 14, 2021Updated 5 years ago
- A drag-and-drop-enabled, responsive, envelope graph that allows to shape a wave with attack, decay, sustain and release☆11Jan 5, 2023Updated 3 years ago
- ☆21Jun 20, 2024Updated last year
- Add n-gram and large language model (LLM) support to Whisper models.☆43May 6, 2025Updated last year
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- A comfyui costume node by BillBum for using api gen (VLM LLM T2I API Tools)☆10May 26, 2026Updated 3 weeks ago