sieve-community / fast-asdView external linksLinks
an optimized, production-ready implementation of active speaker detection
☆80May 29, 2024Updated last year
Alternatives and similar repositories for fast-asd
Users that are interested in fast-asd are comparing it to the libraries listed below
Sorting:
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆449Oct 23, 2023Updated 2 years ago
- The purpose of this repository is to discuss on Audio transformers☆14Aug 22, 2025Updated 5 months ago
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆165Mar 23, 2025Updated 10 months ago
- This Repository demostrates various examples using YOLO☆13Feb 9, 2024Updated 2 years ago
- ☆33Nov 26, 2025Updated 2 months ago
- Face Verification API☆11Sep 27, 2021Updated 4 years ago
- TheNZT is a powerful multi-agent finance query processing system designed to process and respond to finance-related queries efficiently. …☆30Feb 3, 2026Updated last week
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…☆21Mar 16, 2024Updated last year
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Updated this week
- ☆26Oct 15, 2024Updated last year
- Official repository of "TensorFlow Serving with Docker for Model Deployment" Coursera Project☆23Aug 27, 2020Updated 5 years ago
- Identifying tumor affected scans using Fast.ai and detecting them using openCV☆13Jan 18, 2021Updated 5 years ago
- ☆17Sep 1, 2024Updated last year
- repo for active speaker detection for media videos.☆31Nov 19, 2023Updated 2 years ago
- Fine tune Gemma 3 on an object detection task☆97Jul 14, 2025Updated 7 months ago
- Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset☆72Jan 18, 2022Updated 4 years ago
- Eye exploration☆31Nov 29, 2025Updated 2 months ago
- ☆16Jan 16, 2023Updated 3 years ago
- pix2pix and Cycle GAN architectures for image style transfer☆13May 27, 2021Updated 4 years ago
- Reinforcement learning modular with pytorch☆11Jan 18, 2021Updated 5 years ago
- This template demonstrates how to create a collaborative team of AI agents that work together to process, analyze, and generate insights …☆53Jan 13, 2025Updated last year
- ☆29Dec 20, 2025Updated last month
- This is a simple example for face verification using facenet implemented by davidsandberg☆10Apr 28, 2019Updated 6 years ago
- Convert Python scripts into a single line of code (oneliner).☆13Jan 23, 2026Updated 3 weeks ago
- Pluralsight trainings code☆10Jun 24, 2021Updated 4 years ago
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆45Feb 1, 2026Updated 2 weeks ago
- Local CLI tool that lets you write natural language instructions and get the corresponding shell commands generated by a small language m…☆21Nov 18, 2025Updated 2 months ago
- https://demo-web.reflex.run☆12Apr 25, 2024Updated last year
- ☆25Jan 15, 2026Updated last month
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 3 months ago
- ☆13Nov 12, 2021Updated 4 years ago
- A local, voice-controlled AI assistant with the personality of HAL 9000 from 2001: A Space Odyssey.☆20Aug 16, 2025Updated 5 months ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 2 months ago
- Image perspective transformation and text recognition☆10Jun 26, 2020Updated 5 years ago
- Source code used in the blog☆12Feb 6, 2024Updated 2 years ago
- Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features☆13Jan 9, 2026Updated last month
- Browser extension for toggling fullscreen on any website.☆10Feb 9, 2020Updated 6 years ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 4 months ago
- ☆10Feb 22, 2025Updated 11 months ago