PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆14Aug 11, 2020Updated 5 years ago
Alternatives and similar repositories for SlowFast
Users that are interested in SlowFast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 11 months ago
- ☆24Mar 16, 2026Updated last week
- Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing☆24Dec 29, 2021Updated 4 years ago
- The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.☆29Mar 4, 2022Updated 4 years ago
- ☆10Jun 28, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆33Feb 11, 2026Updated last month
- A paper list of Weakly Supervised Object Detection (WSOD) resources.☆13May 6, 2021Updated 4 years ago
- Revisiting Test Time Adaptation Under Online Evaluation☆35May 2, 2024Updated last year
- A reviewed paper list about applying deep learning models for smarter transportation systems☆12Sep 15, 2020Updated 5 years ago
- ☆14Jan 9, 2026Updated 2 months ago
- Pytorch implemenation of structure from motion using Libviso2, SIFT, SuperPoint, SPyNet and Sfm Learner.☆21Oct 12, 2021Updated 4 years ago
- ☆14May 16, 2021Updated 4 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆91Oct 24, 2022Updated 3 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆14Sep 2, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …☆14Aug 16, 2021Updated 4 years ago
- Crawl traffic data from PEMS☆10Jul 19, 2021Updated 4 years ago
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆23Jan 27, 2026Updated last month
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆108Jun 26, 2024Updated last year
- Unofficial implementation of Variational Diffusion Models in PyTorch (Lightning)☆11Aug 31, 2023Updated 2 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆31Apr 13, 2020Updated 5 years ago
- The repository contains the Pytorch Implementation of the paper Age invariant face recognition and retrieval by coupled auto-encoder netw…☆13Dec 17, 2022Updated 3 years ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Dec 13, 2024Updated last year
- ☆10Jul 27, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Oct 23, 2021Updated 4 years ago
- ☆22Feb 13, 2026Updated last month
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 7 months ago
- ☆11Jan 29, 2023Updated 3 years ago
- ☆20Jun 26, 2024Updated last year
- A MaskGIT port from JAX to PyTorch☆18Jun 18, 2022Updated 3 years ago
- A simple demo project of cmake and google protocol buffer.☆10Dec 3, 2013Updated 12 years ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 3 months ago
- Shaping Visual Representations with Language for Few-shot Classification, ACL 2020☆16May 9, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A "talking head" project capable of displaying emotions created using blender and python☆19Oct 14, 2018Updated 7 years ago
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆219Jul 5, 2022Updated 3 years ago
- Guide diffusion on ImageBind embedding similarity☆29May 27, 2023Updated 2 years ago
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- [NeurIPS 2025] This is the official repository for "RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis"☆26Nov 21, 2025Updated 4 months ago
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated last year
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆230Apr 8, 2023Updated 2 years ago