an optimized, production-ready implementation of active speaker detection
☆86May 29, 2024Updated 2 years ago
Alternatives and similar repositories for fast-asd
Users that are interested in fast-asd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- This Repository demostrates various examples using YOLO☆13Feb 9, 2024Updated 2 years ago
- A quality zero-shot lipsync pipeline built with MuseTalk, LivePortrait, and CodeFormer.☆52Sep 25, 2024Updated last year
- The repository for Springer IJCV 2025 (LR-ASD: Lightweight and Robust Network for Active Speaker Detection)☆123Mar 23, 2025Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…☆21Mar 16, 2024Updated 2 years ago
- ☆46Jun 26, 2026Updated last week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆10Dec 3, 2023Updated 2 years ago
- Streamlit-Based License Plate Recognition (LPR) App☆12Mar 26, 2025Updated last year
- ☆21Aug 21, 2024Updated last year
- Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering☆11Feb 16, 2023Updated 3 years ago
- ☆15Mar 18, 2026Updated 3 months ago
- ☆15May 13, 2024Updated 2 years ago
- Content Aware Fill for Linux's Python☆10Jul 6, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Example code - use word embeddings to make emoji prediction smarter with context☆11Sep 14, 2018Updated 7 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- Runpod WhisperX Docker Container Repo☆16Mar 10, 2024Updated 2 years ago
- Open-source, modular cloud automation and billing system.☆17Jun 4, 2026Updated last month
- 🦙 Inference code for LLaMA models (modified for cpu)☆12Mar 4, 2023Updated 3 years ago
- Add Rain Streak Mask On Unparied Image Using GAN☆10Sep 12, 2020Updated 5 years ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆17Jul 11, 2023Updated 2 years ago
- Streamlit component like Microsoft Excel☆25Sep 7, 2022Updated 3 years ago
- We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enh…☆17Dec 31, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ShellSpeak translates natural language to shell commands, simplifying system interactions for non-tech-savvy users. With color-coded UI, …☆12Nov 26, 2023Updated 2 years ago
- repo for active speaker detection for media videos.☆31Nov 19, 2023Updated 2 years ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆22Jun 22, 2026Updated last week
- ☆15May 25, 2024Updated 2 years ago
- UpToDateAI, an open source tool to help you help AI assist you with coding and debugging in lesser-known or newly released programming fr…☆12Sep 10, 2024Updated last year
- A python package of robust and effective defogging/dehazing method☆15Dec 30, 2018Updated 7 years ago
- An Agentic RAG starter that use Swarm, Nemo Guardrails and SingleStore as a database☆29Dec 18, 2024Updated last year
- Module for the WHMCS system. For manage Mikrotik secrets users as a product VPN.☆10Oct 11, 2023Updated 2 years ago
- This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…☆12Sep 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for Nature Communications paper entitled "Sleep-like Unsupervised Replay Reduces Catastrophic Forgetting in Artificial Neural …☆15Oct 28, 2022Updated 3 years ago
- ☆14Jul 17, 2024Updated last year
- Code and dataset for NAACL 2022 paper "CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination" Hyounghun Kim, Abhay Zala, Mohi…☆16Nov 26, 2022Updated 3 years ago
- Barebones Rust EVM Implementation☆12Feb 9, 2022Updated 4 years ago
- YouTube Assistant☆12May 15, 2023Updated 3 years ago
- A curated list of Story Ending Generation models; DASFAA'22: Incorporating Commonsense Knowledge into Story Ending Generation via Heterog…☆14May 12, 2022Updated 4 years ago
- This repository provides code and resources for Parameter Efficient Fine-Tuning (PEFT), a technique for improving fine-tuning efficiency …☆18Feb 23, 2024Updated 2 years ago