PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆14Aug 11, 2020Updated 5 years ago
Alternatives and similar repositories for SlowFast
Users that are interested in SlowFast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Mar 16, 2026Updated 2 months ago
- Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing☆24Dec 29, 2021Updated 4 years ago
- The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.☆29Mar 4, 2022Updated 4 years ago
- Based on StackExchange.Redis that operates Tair For Redis Modules.☆11Feb 28, 2025Updated last year
- ☆13Nov 28, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆33Feb 11, 2026Updated 3 months ago
- A paper list of Weakly Supervised Object Detection (WSOD) resources.☆13May 6, 2021Updated 5 years ago
- ☆11Aug 27, 2018Updated 7 years ago
- A reviewed paper list about applying deep learning models for smarter transportation systems☆12Sep 15, 2020Updated 5 years ago
- Pytorch implemenation of structure from motion using Libviso2, SIFT, SuperPoint, SPyNet and Sfm Learner.☆21Oct 12, 2021Updated 4 years ago
- ☆14May 16, 2021Updated 5 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆91Oct 24, 2022Updated 3 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆17Sep 2, 2024Updated last year
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆22Mar 19, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Crawl traffic data from PEMS☆10Jul 19, 2021Updated 4 years ago
- The repository contains the Pytorch Implementation of the paper Age invariant face recognition and retrieval by coupled auto-encoder netw…☆13Dec 17, 2022Updated 3 years ago
- ☆19Aug 6, 2024Updated last year
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Dec 13, 2024Updated last year
- HDU - 在期末的时候给老师评价的小脚本,需要在控制台打开☆14May 21, 2016Updated 10 years ago
- ☆12Oct 23, 2021Updated 4 years ago
- ☆10Jul 27, 2019Updated 6 years ago
- This is an official implementation of GRIT-VLP☆20Aug 8, 2022Updated 3 years ago
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆21Jul 29, 2025Updated 9 months ago
- ☆11Jan 29, 2023Updated 3 years ago
- ☆14Nov 13, 2023Updated 2 years ago
- A MaskGIT port from JAX to PyTorch☆18Jun 18, 2022Updated 3 years ago
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆31Nov 30, 2023Updated 2 years ago
- One Discrete Word for Visual Reasoning Overtakes Agentic and Latent Methods☆118May 15, 2026Updated last week
- A "talking head" project capable of displaying emotions created using blender and python☆19Oct 14, 2018Updated 7 years ago
- Guide diffusion on ImageBind embedding similarity☆29May 27, 2023Updated 2 years ago
- Philo: uniting modalities. A repository with adaptive fusion techniques for multimodal data☆26Mar 16, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated last year
- Analyzing Airline data to predict delays☆19May 15, 2014Updated 12 years ago
- [ACL 2025] RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios☆26Jul 2, 2025Updated 10 months ago
- Collection of open datasets in computer vision.☆13Jun 9, 2018Updated 7 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆231Apr 8, 2023Updated 3 years ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago